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ERGODIC AUTOMORPHISMS 
LOUIS AUSLANDER, City University of New York 


Introduction. In recent years a great deal of progress has been made in 
determining when measure preserving mappings of measure spaces are ergodic. 
We shall examine some of the new methods developed in [2], [3], and [4] that 
have enabled us to make this progress. Not only should this serve as an intro- 
duction to the subject, but also it should help clarify the power of the results in 
these papers. This is because many of the computations given here, specializa- 
tions of the general results, have never been explicitly or even implicitly pre- 
sented before. 

To keep technical problems to a minimum, we have centered our study about 
automorphisms of tori and an example that is a natural generalization of this 
concept. The example will show why the classical function theoretic methods 
cannot yet be generalized. In our final part of this paper, we shall look again at 
automorphisms of tori, but this time we shall use the most general proof. Thus 
we shall see how the method of proof has been altered by our need to treat more 
general problems. To show the success of our method, we shall finish the paper 
with the study of ergodic properties of affine motions of tori first presented in 
[5]. 

Part I. Torus automorphisms. 

1. Basic definitions. Let V"={(x1,---, %:)} be the u-dimensional real 
vector space and let L= | (ma, +++) Mn) t m,;an integer. Then Z is a subgroup 
of V”, called a lattice subgroup, and the quotient group V*/L=T™ is the z- 
dimensional torus group. Let A bea linear transformation of V” which maps L 
onto itself. Then relative to the above implicit basis of V”, A can be represented 
as a matrix with integer entries and determinant, | A| , equal to +1. Then A 
induces an automorphism 4, of T” onto itself; conversely, if B is any automorph- 
ism of TJ”, there exists an automorphism A of V” onto itself which maps L onto 
itself and such that 


B= Ay. 
Let us define a measure on V* by the nonsingular x-form 
dx, /\ +++ /\ dXn. 
Then A induces the mapping dA on 2-forms, and we have 
| (dA) (dey A+++ A dan)| =| | Alder A+++ A deal = | day A+++ A dal. 
Therefore A acts as a measure-preserving transformation on the measure space 


Professor Auslandei is a University of Chicago student of S. S. Chern. He has taught at a 
number of universities and been a visitor at the Institute for Advanced Study, Yale, and Berkeley. 
His books include Introduction to the Theory of Manifolds, Differential Geometry, What are Numbers, 
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(V", dxi/\ --- Adxa). Since dx /\ - - - /A\dx, is invariant under translations by 
L, the n-form dxi/A\ - + - Adx, projects to an n-form on V"/L=T™ and deter- 
mines a measure, won JT”. Clearly Az preserves the measure yp. Thus Az deter- 
mines a measure preserving automorphism of the torus T”. 


DEFINITION 1. Let (X, w) be a measure space with u(X)< wand let B bea 
measure preserving mapping of X onto itself. We say that B acts ergodically on 
(X, pw) tf the only measurable subsets U of X invariant under B, 1.e., such that BU 
= U, have the property that 


w(U) = 0 or p(U) = u(X). 


Since if U is measurable, the complement of U is measurable, another way of 
stating the condition of ergodicity is as follows: B acts ergodically if there are 
not two disjoint measurable subsets U; and U; of X invariant under B such that 


A = U,U U, and u( Uj) > 0 and u( U2) > 0. 


2. Nonergodic automorphisms of a torus. It is now a classical result that a 
necessary and sufficient condition for an automorphism A; of a torus JT” to be 
ergodic is that none of the eigenvalues of A be a root of unity. Let us begin by 
showing geometrically that this condition is necessary. 


LEMMA 1. Let Az, be an automorphism of the torus T” and let A have an eigen- 
value equal to 1. Then T” contains tnvariant subsets under Ax of arbitrary small 
positive measure. 


Proof. Let I be the identity operator on V” and let W be the range of A —J. 
Now Wis a proper subset of V* since (A —J) is singular by hypothesis. Because 
(A —I) isa matrix with integer entries it maps L into itself. But W is spanned by 
(A —I)L, since L contains a basis of V*. Thus W/\L contains a basis of W; hence 


W/WOL 
is compact. 
Now let us choose a basis &,---, & of V" such that &,---, & span W. 
Then (relative to the basis &,---, &) if we consider A operating in column 


vectors, we have 
[4] 
0 Tf 
where J is the (n—k)X(n—k) identity matrix and 0 is the kX (n—k) trivial 
matrix. Thus A maps the set U, defined by 
U. = (f°, &) | [&| Se<Ci=kt+1,---,n} 


onto itself for alle, when Cis sufficiently small. Now 


U./WIVL= W/WALX [-,e] X--- X [-e el 


(where there are n—& factors [—e, €]) is compact. The image Uz of U. in T* is 
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compact and has measure less than Ke”~*, where K is a fixed constant. This 
proves our assertion. 


THEOREM 2. Let A, be an automorphism of the torus T” such that A has an 
e1genvalue which 1s a root of unity. Then Az 1s not ergodic on T”. 


Proof. Let a be an integer such that A® has an eigenvalue 1, and let U~ bean 
invariant space for A* as in Lemma 1. Then U7) 4‘(U*) is an invariant set for 
A of positive measure D, where 


Ke = DS aker', 
By choosing ¢€ sufficiently small we can prove our assertion. 


3. Ergodic automorphisms of the torus. Before proving the converse of 
Theorem 2 it will be convenient to look at the concept of ergodicity from another 
point of view. 

Let (X, w) be a measure space, u(X) < «©, and let B be a measure preserving 
transformation of X. Let L?(X, uw) be the Hilbert space of square summable 
functionson X, and let 


Us(f)(v) = f(Bla)) fel (Xu), «EX. 
Then it is easy to verify that Ugisa unitary operator on L?(X, p). 


THEOREM 3. A necessary and sufficient condition for a measure-preserving 
transformation B to be ergodic on (X, p) 1s that 


Usf =f 
implies f = constant a.e. 


Proof. Let B not be ergodic on X. Then there exist two disjoint measurable 
sets U; and U2 each of positive measure and each invariant under B such that 


A = U,U U2. 


If x is the characteristic function of U,, then Ug(x)=yx since B(U)) = U,. 
But x #constant a.e. 


Conversely, let Unf=f, fEL?(X, uw), f¥constant a.e. Then there exists a 
set of measure zero, say K, such that 


f(a) = f(Bx) xe X— K. 
Let us choose Cy such that 
flr|r<Cof = Uy and folr| Cosr} = Uz 


have positive measure. Then both U,—(U,.\K) and U2.—(U2/\K) have posi- 
tive measure and are invariant under B. Thus B is not ergodic. These results 
prove our assertion. 


THEOREM 4. Let Az, be an automorphism of the torus T” with A having no 
ergenvalues which are roots of unity. Then Ax 1s ergodic. 
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Proof. Let £EL, let xEV", and let £-x denote the dot product in V*. Then 
the functions e?**'* can be viewed first as functions on TJ” and second as an 
orthonormal basis for L?(7”). Thus if fEL2(7T”), then 


f — > b e272, 


4EL 


We next observe that 
U4, (err “) — ernie -(Az) — e2tt(A’t) ‘= 


where A’ is the transpose of A acting on the lattice L. Hence 


U4,f = >» bpe2ttt A @) — >» bye2ti(A's) e 


LeEL te 


Since A’ isa 1-1 mapping of L onto itself, setting B = (A’)~! we have 
Ua, f = Dy ba nerrit', 


LEE 


Thus Usf =f is equivalent to Day = de. 
Now B acting on L cannot have a finite orbit provided £0, 1.e., the points 


(, Bl,-- +, Brb--- 
are all distinct. For if B’¢= B*f, wherer>s, then 
(1) (Br-* — 1)(B(f)) = 0, 


and B’-* has an eigenvalue equal to 1. But B has an eigenvalue which is a root 
of unity if and only if A has an eigenvalue which is a root of unity. Thus (1) 
would contradict the fact that A has no eigenvalues that are roots of unity. 

Suppose A is not ergodic. By Theorem 3, there is a function fEL?2(7") such 
that Usf=f and f¥constant a.e. Thus there exists an 40 such that b,,+0. 
But then 


a < I. | f(x) "da < © 


tel 


and 


Db S by t+ bay $s + bam, te = 
(EL 
which is impossible. Thus Theorem 4 has been proved. 
Thus Theorems 2 and 4 together have established the following: a necessary 
and sufficient condition for an automorphsim A, of a torus to be ergodic is that 
A has no eigenvalues which are roots of unity. 


Part II. Generalizations of tori 


4, Basic definitions. There exists a natural generalization of the torus which 
involves non-abelian nilpotent Lie groups. Rather than looking at this in its 
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full generality, we shall content ourselves with a detailed examination of the 
simplest special case. 
Let x1, X2, X3 be real numbers and consider the set of matrices 


1 x, 23 
N3=4|0 1 xe 
0 0 1 
Then N;3 is a group, since 
1 4% 4X3 1 yi Ys 1 wit y1 %3 + 3 + X1y2 
O 1 xf 0 1 yet =] 0 1 Xo + Ve ; 
00 1 0 0 1 0 0 1 
and 
1 x, 4x3! 1 —x%, —xX3 + x1%2 
0 1 xX =] 0 1 — Xo 
00 1 0 0 1 
If we denote the element 
1 4x, X3 
O 1 xX. 
00 1 


of Ns; by the triple (x1, x2, x3) then the group multiplication becomes 
(x1, %2, %3) (V1, Ye, Ya) = (41 + M1, Xe + Ye Xs + Ys + Hie); 


thus instead of having a linear rule for multiplying group elements we have a 
polynomial rule. 

We define a measure on N3 by means of the nonsingular form dx,/\dx2./\dx3. 
We shall see that this form on N; is invariant under N3; acting on itself by left 
multiplication. Since 


(a1, a2, as) (a1, X25 x3) = (41 + a1, X2 + Qe, X3 + a3 + QyX2), 
we have 
dx /\ dx, /\ dx, d(x + a1) /\ d (x2 + a2) /\ d(x3 + a3 + di%X2) 
= du, /\ die /\ (dx3 + aydx2) = dx, /\ dxe /\ dxs. 


A similar computation shows that the form dx,/\dx2/\dx3 is right invariant 
under NW; acting on itself. Let us denote this measure space by (V3, dx). Let 


L= { (m1, m2, 4m) © N3| m, integers i = 1, 2, 3}. 


Then it is straightforward to verify that L is a subgroup of N3, so we may form 
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the homogeneous space N3/L of right cosets nL, nCWN3. Since L is not normal in 
N;, the space N3/L is not a group. But it has several important properties that 
follow from the lemma below. 


LEMMA 5. Let N3 and L be as above, and let 
C = {n = (a1, x2, #3) € N3|0 S x1, v <1 and 0 S a3 < 3}. 


Then: (1) For m, n2€C there exists no €EL such that né= ny. (2) For any nEN; 
there exists m,€ Cand tlEL such that nt =m. 


Proof. Let (x1, x2, x3) and (y1, v2, v3) CC, and let (mi, me, m3) EL satisfy 
(a1, X2, %3) (m1, M2, 3ms) = (91, Yo, Ya)- 


Since (x1, 2, %3) (7711, Me, 4173) = (x1 -- 711, X2+ Me, X3+4m3+x1M2), the restrictions 
OSy;<1 and 0S, x2<1 imply that m=m.,=0. Also OSx3-+3m3<} with 
0 <x3<4 and m; an integer. This implies that m3 = 0 and proves (1). 

To prove assertion (2) let 2 = (1, Ve, y3). Let yi=xi-+m:, where m; is an inte- 
gerand 035%, x2<1. Then 


(41, X%2, X3)(—%m71, —M2, 0) = (41 — 111, Xo — Mo, X3 — XyM2). 
Let x3 —x1m, =3m3+73 for OSx3<4 and m; an integer. Then 
(41, %2, %3)(—11, —M2, —3Ms) = (1, Ya, ys) and (41, %2, %3) € C. 


The set C is often called a fundamental domain for the group LZ acting on the 
space N3. The existence of C has certain important implications which we list 
below: 

P.1. N3/Lisa differentiable manifold with the natural covering mapping 


P: N3 — N3/L 


a differentiable mapping and a local diffeomorphism. 

P.2. N3/L is compact. (This follows from the fact that the closure C of C 
in N; is compact and P maps C onto N;3/L by Lemma 5.) 

P.3. Let A be an automorphism of N3; which maps L onto itself. Then A 
determines a 1-1 mapping of N;/L onto itself which we call an automorphism of 
N;/Land denote by At. 

The left and right invariance of the form dx;/\dx2:/\dx3 on Ns; has also several 
important consequences: 

P.4. P induces a mapping of the 3-form dx1/\dx2/\dx3 to a 3-form on N3/L 
and thus determines a measure on N3/L which we denote by yp. 

P.5. N3 acts on N3/L by no(mL) = (non) L, no EC No. 

This defines N; acting on N;/L, and it acts as a group of measure preserving 
transformations on the measure space (N3/LZ, p). 


5. Automorphisms of N3 and N3/L. In order to facilitate the discussion of 
automorphisms of N; and N3;/L, it is very convenient to introduce L(Ns3), the 
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Lie algebra of N3. Since we shall not need the algebra structure of L(N3), we 
shall content ourselves with defining the vector space structure on L(IN3). 

Consider the space E(V*) of all 3X3 real matrices as a 9-dimensional real 
euclidean space. We can consider N3 as a 3-dimensional submanifold of E(V%). 
Hence, we may talk about the tangent space of N; at any of its points, m, and 
denote this by 7,,(V3), »CN3. Now the identity e is an element of N3, and so we 
may consider 7,(N3). If we consider T.(.N3) as a subspace of E(V%), it is precisely 
the subspace 


0 a 
0 0 
0 0 


oOo mw a 
we 


where a, b, c are real numbers. Now every differentiable mapping of Ns onto itself 
which leaves e fixed induces a linear transformation of 7,(N3) which is the 
Jacobian of the mapping at e. Thus if A is an automorphism of N3, then A 
determines a linear transformation of T.(IV3) onto itself, which we denote by 
ad(A). 

We next observe that there is a diffeomorphism, exp, of 7.(N3) onto Ns; 
defined by 


0 ae 1 0 0 0 ae O a cf 
exp 0 0 o]=[0 1 of+]o 0 5f+—Jo 0 a] + 
0 0 0 0 0 i 0 0 0 0 0 0 
1 a@ ct dab 
“fo b 
0 0 1 
since 
Oa cf 0 0 0 
0 0 6b; =} 0 0 O (n = 3) 
0 0 0 0 0 0 


Thus exp(a, b, ¢) = (a, 6, c+4$ab). lt is straightforward to verify that exp has an 
inverse log: N;->7.(Ns) given by 


1 Xy X38 0 Hy weg EX 1X9 
log} O 1 awey=tO O X9 
0 0 1 0 0 0 


It is interesting to note that log L is a lattice in the vector space T.(N3). Let us 
examine one other important property of the exponential mapping. 
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LEMMA 6. Consider the 1-diamenstonal vector subspace V! 
(at, bt, ct) —-o <t< ow 
of T..N3). Then exp(V}) ts a subgroup of N3 tsomorphic to the real line. 
Proof. We have exp(at, bt, ct) = (at, bt, ct-+4abt?). We next note that 
(aly, bty, cly + dabt,) (ato, bts, cly + abty) 
2 2 
= (a(t, + to), b(ty +- i»), c(ty -+- to) + sab(ty -- to) -- abtyt.) 
= (a(ty + to), b(t, + to), c(t, + to) + Zab(ty + to)?). 
Thus exp(V") is a subgroup of N3 isomorphic to the reals. 
Nontrivial subgroups of Lie groups that are continuous images of the line 
are called one-parameter groups. Thus exp(V'!) is a 1-parameter subgroup of N3. 
The converse of Lemma 6 is also true, i.e., every 1-parameter subgroup of N; 


is exp(V') for some 1-dimensional subspace of 7.(IN3). We neither need nor 


prove this statement. 
Let A* bea linear transformation of T.(N3) such that (using column vectors) 


Qi1 9 G42 0 
A* = Qe; Gag 0 ) 
0 0 a 


Qo1 22 


Then it is a straightforward, if somewhat lengthy, computation to verify the 
following: If we define A: N3—>N3 by 


A = expo A* o log, 
then A is an automorphism of N3 which maps L onto itself. Also, each auto- 
morphism A of N; which maps L onto itself has the property that ad(A) =A™*. 
Thus the mappings exp and log enable us to pass back and forth from the 


group N; to the vector space 7,(N3) and replace automorphisms of Ns by certain 
linear transformations of 7.,(V3). Now 


exp*(da /\ db (\ dc) = da /\ db A d(c + $ab) = da fA db / de. 


Since A * preserves the form da/\db /\dc it is straightforward to verify that each 
automorphism of N; of the form 


A = expo A*o log 


preserves the form dx;/\dx2/\dx3; and hence the measure yw. Thus every auto- 
morphism of N3; which maps L onto itself is measure preserving. 
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DEFINITION. Let A be an automorphism of N3; which maps the group L onto 
itself. Then A induces a measure-preserving diffeomorphism Az of N3/L onto itself 
which we call an automorphism of N3/L. 


We shall show the following: If the eigenvalues of A are 1, e*, e, with real 
A, then A, induces an ergodic automorphism of N3/L. (The action of A, is 
ergodic even if \ is not real, but the proof of this assertion requires a technique 
that would obscure the main point of our discussion.) The proof of the assertion 
with real \ will be our goal in the next three sections. Before going on to the next 
sections, let us prove one important group theoretic lemma. 


LEMMA 7. Let all notation be as above and let v1 and v2 be ergenvectors of A* in 
T.(N3) such that 


A*o, = ev, A*y_. = e700. 


Then N3 is generated as a group by the 1-parameter subgroups exp(tv:) =q,(t), 
4=1, 2. Furthermore A(q(t)) =q (et) and A (q2(t)) =q2(e™?). 


Proof. The last statements in the lemma follow trivially from our definition 
of g:(t)7=1, 2 and the fact that A =exp o A*o log. It remains only to prove the 
first part. The first part follows from the following general group theoretic facts: 
(1) The subgroup Z= { (0, 0, 3) € N3| all real 3 } is the commutator subgroup 
and the center of N3. (2) The images of qi(t) and q2(t) in N3/Z generate this two- 
dimensional vector space as a group since they are linearly independent. (3) 
Any element in Z can be written as the commutator of some pair of elements 
a,b, where aE q(t) and bE q(t). 


6. Flows and automorphisms. Since we do not have a basis of L?(N3/L) on 
which A acts in a controllable fashion, we must find a different way of proving 
our theorem. It turns out that this is most conveniently done by changing from 
a discrete to a continuous problem. We shall first examine this in an abstract 
setting, and then see that in our particular case it can be given a nice formula- 
tion. 

Let (X, u) be a finite measure space, and let B bea measure preserving 1-1 
mapping of X. We shall build a new measure space (Y, uXdx) on which the 
reals, R, acts as a group of measure-preserving transformations, and where R 
acts ergodically on (Y, uXdx) if and only if B acts ergodically on (X, pu). This 
construction is often called the skew product construction. 


DEFINITION. Let ¥:= X XI, where I= |0, 1], and let dx be the Lebesque measure 
on I. Take up Xdx as the measure on Y;. Then X XO and XX1 have measure zero in 
X XI. Now define Y as Yi; with XX0 identified with XX1 by the mapping B. To 
be precise 


Y= {(2,a)|xC X, OSaS1, and (x, 0) = (Bx, 1)}. 


Then p Xdx defines a measure on Y. 
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Now define the action of the group R on Y. Let R be parameterized by 
—«o <t< ;define/(y) for yC Y as follows: 
lf0<msi<m-+1, let 


i(x, a) = (Bx, (a + f)mod 1) ifa+tmodi <1 
t(x, a) = (B™ 1x, (a +t) mod 1) ifia+tmodi 21, 


where m is an integer. We call R acting on the measure space (Y, uw Xdx) the 
skew product flow. 

It is easy to see that R acts as a group of measure preserving transformations 
on (Y, uXdx). Just as we can talk of a single measure-preserving transformation 
acting ergodically on a measure space so can we talk of a group of measure- 
preserving transformations acting ergodically on a measure space. 


DEFINITION. Let (X, uw) be a measure space and let u(X)< ©. Let G bea group 
of measure preserving transformations of X. We say that G acts ergodically on X 
af the only measurable subsets U of X such that g(U) = Uall ge G have the property 
that 


u(U) = 0 or pw(U) = w(X). 


Of course, this definition can be reformulated as follows: G acts ergodically 
if there are not two disjoint measurable subsets U; and U2, of X such that 
g(U,) = Ui and g( U2) = Uz for all gE G, and with the additional properties 


X = U,U U,, u( Uj) > 0, p( U2) > 0. 
The following theorem is the reason for introducing skew product flows. 


THEOREM 8, Let (X, pw) be a measure space with u(X) < © and let B be a mea- 
sure preserving mapping of X. The mapping B acts ergodically on (X, p) tf and 
only tf the reals R act ergodically as the skew product flow on (Y, uw Xdx). 


Proof. Assume B does not operate ergodically on X, i.e., let U be an invariant 
subset of X such that 0<p(U) <u(X). Then UX (0, 1] is invariant in Y and 
clearly 0<(uXdx)(UX [0, 1])<(uXdx)(Y). Thus R does not act ergodically 
on Y. 

Conversely assume R does not act ergodically on Y. Let VCY be invariant 
under R and satisfying 


0 < (u X dx)(V) < Cu X dx)(Y). 


Then the pre-image of Vin Y; is a set of the form UX [0,1], where Uis invariant 
under B. But clearly 


O< u(U) < u(X), 


so B is not ergodic. 
Let us now return to our special case of an automorphism A, acting on 
N;/L, and let A be the automorphism of N3 which determines A,. Further let 
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A =exp o A*o log, where A* is a linear transformation of 7,(Nz). Recall that 
we are assuming that 

a11 Gy O 


A*® = Qe, G22 0 ) 
0 O 1 


G11 12 
ee a 
has positive real eigenvalues. This implies that A(0, 0, xs) =(0, 0, x3), since 
exp(0, 0, Cz) = (0, 0, Ct), C¥0 and 
A*(0, 0, Ct) = (0, 0, C2). 
Let a basis v1, v2, v3 of T.(IV3) be chosen so that 
A*(v1) = 03, A*(ve) = e>v0, A*(v3) = 03, 


where v3 = (0, 0, C) where C40. 
We may define a 1-parameter group A *(¢) acting on T.(N3) by 


A*(d) (v1) = &'04, A*(t)(ve) = e>!00, A*(t)(vg) = 3. 


and that the matrix 


Let A(t) =exp o A*(t) olog. Then we may verify that A(¢) is a 1-parameter 
group of automorphisms of Ns. 

We shall now introduce an algebraic construction that we shall ultimately see 
is equivalent to the skew product construction. 

We begin by defining the semidirect product of two groups A and B, where 
each element of A may be considered an automorphism of B. The semiditrect 
product of A and B, denoted by A-B, is the set AX B with the multiplication 
given by 


(a1, 61)(@2, b2) = (aida, ay (b1) -be). 


Under this law of composition one can verify that A: B isa group. Note that A 
and Bcan be viewed as subgroups of A- Bin an obvious way. 

Using this construction, we form the groups A(t): N3 and A(m)-L, where m 
takes on integer values. It is clear that 


A(t):Nz D> A(m)-L. 


We introduce the notation T=A(m)-L. Let us now examine the relation of 
A (t)-N3; to the subgroup Lr. 


Lemna 9. Let all notation be as defined above. Then T 1s a discrete subgroup of 
A (t) -Ns3, and 


A(t): N3/T 
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as compact and homeomorphic to the skew product space determined by N3/L and 
the mapping A. 


Proof. Since A(t)-Ns3 consists of all pairs (¢, 2), ‘CR, nCN3 we have that 
(t, x1, X2, %3) (real parameters) serves as a coordinate system for A (¢)-N3. Further 
I is the subgroup consisting all the points (m1, m2, m3, 4m), where all m; are 
integers. Thus I is a discrete subgroup of A(¢)-N3. We may argue essentially as 
in Lemma 5 to see that the set 


C’ = {(t, a1, x2, 443) € A()-N3|0St<1,0S5a;< 1, i= 1, 2,3} 


is a fundamental domain for I acting on A (t)-N3. Thus again we have properties 
Pi and P2 holding for A(¢)-N3/T. In particular A (¢)-N3/T is compact. 

It remains to verify our last assertion. This is best carried out in the language 
of fiber bundles. The reader who is not familiar with this language can convince 
himself of our assertion by checking the identifications that T makes on the 
boundary of C’. In particular ZL maps the sets 


{ (to, X1, X25 23) } 


onto themselves, and the identification space is N3/L. Further A(1) maps the 
set { (0, X1, Xe, 23) } onto (1, X1, Xo, x3) } exactly as A does. Thus on the two 
boundaries where ¢=0 and ¢=1 of C’, the identification is the same as A,. This 
shows that A (¢)-N3/ZL is the desired skew product space. 

A more rigorous proof can be based on the following remarks: 


A (t)-N3/T 
{ 
A(t) -N;/UNs 


is a fiber bundle with fiber N3/ZL and base space A(é)/A(m) =A(£)-N3/TNs, 
which is topologically a circle. Thus A (t)-N3/Y may be pictured as 


N;/Vv X [0, 1] 
with N3/t X0 and N;/T X1 identified by the mapping A(1) on the fiber. This 
again shows that A (¢)-N3/L is the desired skew product space. 


LEMMA 10. Let all notation be as before. Then A (t) acts on (A (t)-N3/T, uw Xdx) as 
the skew product flow, where 


A(T) = (AMO gE Alt): Ns. 


Proof. That A(¢) acts on (A (¢)-N3/I, uXdx) as a group of measure preserv- 
ing transformations we accept without proof. We shall, however, verify that 
the action is correct. 

Let (t, 7)€@C’. Once we verify that A(¢) acts correctly on these points we 
shall have verified it in general. Now let (s, 0)E A(t). Then 


(s, O)(t, ) = (s +4, n) 
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by our definition of group multiplication for the semidirect product. Thus for 
0O<s+i<i1, we have(s, 0)(¢, ~)€C’ and the action of A(s) agrees with s in the 
skew product flow. For s-+t=1 we have 


(s + t, a)(—1, 0) = (0, A(1)n) = (0, A(n)). 


But this also agrees with the skew product flow. The most general case then 
follows since both actions are group actions. Theorem 7 and Lemmas 8 and 9 
combine to yield a proof of the following theorem. 


THEOREM 11. The transformation A acts ergodically on N3/V 1f and only af 
A (é) acts ergodically on (A (t)-N3/T, wu Xdx). 


7. Mautner phenomena. There are certain Lie groups whose unitary repre- 
sentations havea strange property which we shall call the Mautner phenomenon. 
Since this will play a crucial role in our discussion, we shall go into some detail 
on the matter. 

Let H be aseparable Hilbert space, and let U(H’) denote the group of unitary 
operators on H with the strong operator topology. Let G be a Lie group. Then 
a continuous homomorphism U: G—U(H) given by U(g) = U, is called a uniiary 
representation of G. 


DEFINITION. Let (G, p(t)) be a pair consisting of a Lie group G and a 1-param- 
eter subgroup p(t). Let U be any unitary representation of G and let YEH satisfy 
Uno =wW for all t. If this imphes that U;b=w for all geG, we shall say that 
(G, p(t)) exhibits the Mautner phenomenon. 


The remarkable fact is that there are many Lie groups G which have 1- 
parameter subgroups p(¢) such that the pair (G, p(¢)) exhibits the Mautner 
phenomenon. We need to discuss only two such examples. 


Example 1. Let S, be the group consisting of ordered pairs of real numbers 
(%1, X2) with multiplication given by 


(41, %2) (M1, Vo) = (41 + 1, ex, + ya). 


This is isomorphic to the group of matrices of the form 


et] Ly 
| | — 0O <I X1,X%2 << ©. 
QO 1 


We let pi) =, 0) CSi. We shall see that the pair (Si, p:(/)) has the Mautner 
property. 
Example 2. Let ¢ be a real number and let z€C, the additive group of com- 


plex numbers. Let S, be the ordered pairs (é, 2), CR, z@C. We define multipli- 
cation in S_, by 


(¢1, 21) (te, Zo) = (t, ++ to, ei-to)tag, + Zo) o> 0, i= /—1. 
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It is straightforward to verify that S: is a group under this law of composition. 
We let po(t) = (t, 0) CS. We shall see that the pair (So, f2(¢)) has the Mautner 


property. 
LEMMA 12. Let g=(%1, x2) CG Si, and let hag) =pil—n)gpi(n). Then 


lim ¢n(g) = (%, 0). 


Proof. @n(%1, x2) =(—n, 0) (x1, X2)(n, 0) = (x1, e7"%x2), hence Limy..Gn(X1, X2) 
= (x1, 0). 


CoROLLARY 13. limyjs.6,(0, x2) =e, where e is the identity element in Sy. 
Exactly the same statements hold for Sz and p2(t). 


To be precise in S, we have 


lim (—n, O)(t, 2)(m, 0) = Ct, 0) or 
lim (—n, 0)(0, z)(7, 0) = (0, 0) = ein Sz. 


We let H, be the subgroup of S, of elements (0, x2) and Hz be the subgroup 
of S. of elements (0, 2). Note that S; ts generated by the subgroups p,(t) and 
H; (4=1, 2). 


THEOREM 14. The pairs (Sy, pi(t)) and (Se, pelt)) exhibit the Mautner phe- 
nomenon. 


Proof. Let U be a unitary representation of S; such that 


Unpv =v all ¢. 
If we can show that 
U,v=y 
all h;€ H,, we shall have proved that is invariant under the group S; generated 


by p,(t) and H;. So let h; © H;. Then (changing our notation slightly by setting 
U, = U(g)): 


(U(AiYy, ¥) = (U(pi(n)) Ubi —n)hipi(n)) U(pi(—n))y, ) 
= (U(dnlhs))U(pi(—1))¥, U(pi(—1))¥) 
= (U(dalhi))¥, ). 
Taking the limit as » goes to infinity gives, because U is continuous, 
(Ui, ¥) = (Y, ¥). 


Since U;, is unitary, the length of U;, is that of w, but the above relation then 
implies that U,4=w. This all holds for z=1, 2, which proves our theorem. 
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8. Proof of main theorem. Before presenting the main result of Part IT. 
Theorem 15, we shall need the following algebraic lemma. 


LEMMA 15. Let A be an automorphism of Nz such that 
A = expo A* o log, 


where the eigenvalues of A* are e%, e~*, 1. Let qi(s) be the one parameter groups de- 
fined in Lemma 7. Then the groups 


Si = A(t)-ails) and Sy = A(8)-qu(s) 


are isomorphic to S,, with q,(s) and q2(s) corresponding to H, and A(t) corresponding 
to Py(t). 


Proof. Lemma 7 tells us that 


A(t)(qi(s)) = giles) 
A (t)(q2(s)) = q2(e™'s). 


But then multiplication A(é)-q(s) is given by (4, 51) (te, Se) = (ti tte, e7~*251-+ 52) 
and so agrees with multiplication in S;. Since the mapping 


A(t)-qi(s) > A(—2)q2(s) 


is an isomorphism, our assertion is verified. 

Let us return to our group A(¢)-N3 and recall certain facts. 

1. A(t)-N3; acts as a group of measure preserving transformations on 
(A (¢)-N3/V, dx Xp) by right multiplication, where A(t)-N3/T is the left coset 
space. 

2. If H=L?(A(t)-N;3/T, dx Xu), the above action induces a unitary represen- 
tation U: A(t)-N3;—>U(H) where U(H) denotes unitary operators on H. 

3. If YEH and Uw=y for all g@A(t)-N;, then Y=constant a.e. This 
follows because A(t)-N3 acts transitively on A(t)-N3/T. 


THEOREM 15. Let A be an automorphism of N; which maps L onto itself such 
that A on T.(Ns) ts given by a matrix 


e oO O 
0 e& 0 AN x 0. 
0 oOo t 


Then Ax 1s ergodic on N3/L. 


Proof. By Theorem 11 we see it suffices to prove that A(¢) acts ergodically 
on (A(t)-N3;/T, dx Xu). It is this last assertion that we shall actually prove. 
Let H=L?(A(t)-N3/V, dx Xp) and let YEAH satisfy 


16 LOUIS AUSLANDER [January 


Usamy=y alli. 


But by Statement 2 in this section the above unitary representation of A(é) can 
be extended to the whole group A(t)-N3. Since Sf and S; are subgroups of 
A(t)-N3, we get representations of Sf and S;. But the pairs (Sf, A(#)) and 
(S3, A(t)) exhibit the Mautner phenomenon, so 


Uw=¥, ¢ES: or Sy. 


Then U,/=w for all g in the group generated by S} and S;. But the group 
generated by Sf and S; is A(t)- Ns, so we have 


Uw=w, alle © A(t)-Ns. 


But since A(t)-N3 acts transitively on A(t)-N3/T, we have ~=constant a.e. 
This proves A(t) acts ergodically on (A(t)-N3/V, dx Xu), and so our assertion 
follows. 

Part III. Affine motions on the torus 


9. Torus automorphisms revisited. Before beginning the study of affine 
motions on the torus, let us re-examine the ergodic properties of the automorph- 
isms of the torus using the same method of skew products and Mautner phe- 
nomena that we used on N3/L. 

Let A be an automorphism of z dimensional real vector space V”. Then we 
may formulate the Jordan canonical form theorem for matrices as follows: We 
may write 


Vr=V,0--- OV; 


where AV;=V;, 7=1, ---:, &, and let A| V;=A;. Then 4;=D;R,N,, where all 
matrices on the right hand side commute, and 
ens 0 
D,=| 
0 | ert 


| cos@; sin ] 
—sin 6; cosd; 


ia 
l 


cos@; sin i] 
—sin 6; cos; 


I 


N; nF 
0 1 


where the * denotes some matrix entries. 
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In general there will exist no 1-parameter group of matrices A(¢) such that 
A(1) =A and A(i)V;=V; all ¢. But this is merely a technical difficulty that can 
be overcome using complex vector spaces. Instead of introducing this extra 
complication, we shall merely discuss the case where A(t) exists such that 
A(1)=A and A(t)V;=V;. Thus we shall restrict ourselves to automorphisms A 
with the above property which we shall call property 1-P. Hence if A has prop- 
erty 1-P there exists a 1-parameter group A(#) such that A(4)V;=V;, all ¢. Let 
A(t) [V; =A ,(t), where 


A,(t) = Di) RON), 


and where all matrices in the right hand side commute for all values of ¢ and 


eit 0 
D;() =| 
O erst 
cos 6,;t sin 0,t 
| 6,t cos 1 
R,(t) = | 


{ cos 6@,/ sin ] 

—sin Ot COs Ot 
1 * 

Ni) = F i 


We now proceed exactly as in Section 6. 


Step 1. Form the semidirect product A(t)-V”, where V“*DL and V“/L is a 
torus and A(1) maps L onto itself. 


Step 2. Verify that if m denotes an integer, then A(m)-L=T is a discrete 
subgroup of A(t)-V™ and A(t). V"/F is compact. 


Step 3. A(t) acting on A(t)-V”/T is equivalent to the skew product of A 
acting on V”/T. 


Step 4. A acts ergodically on V”/I if and only if A(é) acts ergodically on 
A(t)-V"/T. 

Wé are now in a position to begin a Mautner phenomenon type of argument. 

In our indexing of the invariant spaces V, let us assume that forz=1,---,s, 
sk, we have that \;#0 in Dy. 


LemMMA 16. The subgroup A(t)-V; has a subgroup S* containing A(t), iso- 
morphic to one of the groups S; or Se, and 1n which A(t) corresponds to pi(t) or po(t) 
of Section 7. 
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Proof. lf in R;(¢),0;=0, then 4,(¢) has a one-dimensional invariant subspace, 
call it WC V;. Then the group A (¢)- W, is isomorphic to S, exactly as in Lemma 
15. 

If in R,(t), 0;A40, then A,(t) has a 2-dimensional invariant subspace, W.CV;, 
and the group A(t): W. may be seen to be isomorphic to S, by an argument 
analogous to that in Lemma 15. 

Thus the pair (S*, A(¢)) exhibits the Mautner phenomenon by Theorem 14. 
Now let H=L?(A (é)- V?/T), and let YCH satisfy 


Csamy=y for all ¢. 
Let HiCH satisfy 
Vay =v forally € Mi, 


in other words His the subspace of H where U,,y acts trivially. By the Mautner 
phenomenon, we may conclude that 


Usw=y ally GA, 


Now by considering A(t)-V,;/S* and using induction on the dimension of V,, 
we see that A(t)-V; acts trivially on the elements of A. 

Another inductive argument can be used to extend easily the above to all 
gCA(t)-(Vi® --- @YV,). Consider the group generated by Land Vi® -- - OV,, 
and let C be the closure of this group in V*. Then we may apply Theorem 3.5 of 
[3] to conclude the following. 


THEOREM 17. The flow A(t) on A(t)- V"/T ts ergodic tf and only af the flow A(t) 
on A(t). V"/C 1s ergodic. 


Thus if s=k we have proved our theorem, or even if C= V” we have proved 
that A is ergodic on V“/L. 

We need an algebraic lemma to relate the above result to the theorems in 
Part I. 


LEMMA 18. Let all notation be as above. Then C¥V" tf and only uf A has no 
ergenvalues which are roots of unity. 


Proof. lf A has roots of unity as eigenvalues, then s<k. Let a be chosen so 
A* has an eigenvalue equal to one. Then the range of (A*—J) contains 
Vi® --- @YV,, and also contains a lattice. Hence C is contained in the range of 
(AF—TI). 

If C4V", then s<k and V"/Cy, where Cy is the identity component of C, 
contains a lattice invariant under A restricted to V"/Co. But all eigenvalues of 
A have absolute value 1, so the semisimple part of A, call it ¢, has the property 
that #?, for some integer 8, preserves a lattice in V*/Co. Thus # generates a dis- 
crete, hence finite, subgroup of a compact group. Hence #7 =e for some y and ¢ 
consists of roots of unity. This proves our assertion. 
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Thus we have a new proof of Theorem 3. Theorem 1 stays the same and is not 
altered by our discussion. 


10. Main theorem. Again let V”/Z be a torus. Let A be an automorphism of 
V"/L and let £€V"/L. Question: When is the transformation 


f(a) > A(x) + 


ergodic? This was answered in [5]. We present another proof. 

We, of course, may consider the mapping of the torus onto itself given by 
cf’=—yofovy(«), veV"/L. Clearly f is ergodic if and only if f’ is ergodic. 
Since 


f(x) = A(x) + (A — 1)y + &, 


we may choose y so that & is not in the range of (A —1)y. Hence using the nota- 
tion of Section 9 we see that & is not in V; for any 2 where 9; or A; are not zero. 
But then we may apply Theorem 7 to see that f’ is ergodic on V"/Z if and only 
if it induces an ergodic motion on V”/Co. Let f* be the image of f’ on V"/Co. We 
may reason as in Section 9 and conclude that the semisimple part of f* must act 
trivially in V"/Co. But then 


A(t): V*/Cor 


is a nil-manifold, and we may apply the results of [4] on nil-flows to conclude 
the following result: 


THEOREM 19. An affine transformation f = A(x)+& on a torus is ergodic 1f and 
only uf 
1. A has no eigenvalues which are roots of unity. 
2. f induces an ergodic transformation on V"/L-+(A —I)V", where (A —1)V” 
denotes the range of A—T. 
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MATHEMATICS AND THINKING MATHEMATICALLY 
M. L. CARTWRIGHT, Claremont Graduate School 


This year I find myself in the Division of Applied Mathematics at Brown 
University. Not every University or College has a separate department of ap- 
plied mathematics, but for over thirty years I was classed as a pure mathema- 
tician in the University of Cambridge, England, and some applied mathemati- 
cians there prefer to call themselves theoretical physicists. Moreover, I once 
heard a geophysicist with a mainly mathematical training say that he used 
‘applied mathematics’ as a term of abuse, meaning stuff which was not good 
mathematics and not really relevant to any physical problem. All these factors 
have made me think about the borderline between mathematics and its applica- 
tions, not only to physical problems of a more or less traditional type, but also 
to statistical, economic, and industrial problems. 

It is well known that the origins of some of the most abstract pure mathe- 
matics can be traced through the theory of Fourier series to a problem about 
vibrating strings, or through the theory of irrational numbers to Greek geom- 
etry and Egyptian devices for measurement of right angles, but the pure 
mathematicians of the last 100 or 150 years have been pursuing the mathematics 
for its own sake without any thought of vibrating strings. On the other hand 
many major new developments in pure mathematics were initiated quite 
specifically for the purpose of using them in some application. For instance this 
is certainly true of Newton’s contributions to the calculus and of probability 
theory, and this still seems to be happening in operational research and control 
theory. In distinguishing pure mathematics from applied two questions seem to 
arise. Is the work truly abstract and separated from all applications? And is it 
any more mathematical if it is truly abstract and pursued strictly for its own 
sake? 

If we delve into the beginnings of mathematical thought in very young 
children or primitive peoples, there is plenty of evidence to show that the power 
of complete abstraction comes very slowly, and indeed to many people it prob- 
ably only ever comes in a very restricted sense. A number of eminent people take 
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the view that thought begins with the idea of actions performed in the mind 
only, that is to say operations. According to Piaget an ordinary child by the 
time he is two can work out how he is going to do something before he does it, 
provided that the situation is simple and is familiar to him, but in order to under- 
stand abstract mathematical concepts such as 1, 2, 3, 4,---, the child has to 
move from perceptions arising from his environment and actions to the abstrac- 
tions, a long and gradual process. Much work has been done by Piaget and 
Innhelder on the child’s conception of space, and, for instance, its powers to 
distinguish between different kinds of figures such as a circle, a square, and a 
circle with a little one either inside or outside. Their experiments have thrown 
much light on the development of numerical, spatial, and physical concepts of a 
very elementary kind among young children, but it seems doubtful to me 
whether the abilities tested are always truly mathematical. For young black- 
birds will gape at a piece of black cardboard consisting of one large circle and two 
small ones attached to it, but they only gape at the small circle whose size is a 
certain proportion of that of the large circle. This indicates that the ability to 
distinguish between certain shapes may have psychological foundations. 

H. and H. A. Frankfort in an essay on myth and reality point out that 
ancient man could reason and work out the causes of things, but worked on very 
different hypotheses from ours. The primitive mind asks ‘who’ when it looks for 
a cause, and cannot withdraw far from perceptual reality. When the river does 
not rise, the river has refused to rise, and so the river or the gods intend to convey 
something to the people. At the same time primitive man used symbols much 
as we do, but he can no more conceive them as signifying, yet separate from, the 
gods or powers than he can consider a relationship—such as resemblance—as 
connecting, and yet separate from, the objects compared. Hence there is a co- 
alescence of the symbol and what it signifies, as there is coalescence of two 
objects compared so that one may stand for the other. 

Frankfort then gives an example of this coalescence in which pottery bowls 
with the names of hostile tribes were solemnly smashed at a ritual by the Egyp- 
tians in the belief that real harm was done to the enemies by the destruction of 
their names. It may seem a far cry from this to modern mathematics, but 
Bochner has drawn a parallel between mathematics and myth, and replaced 
myth by mathematics in some of Frankfort’s sentences. I am not prepared to 
go as far as he does by replacing the word myth by mathematics in a sentence 
which then asserts that mathematics transcends reasoning in that it wants to 
bring about the truth it proclaims. However, in the ritual we have two funda- 
mental features of mathematics, symbols representing something and operations 
on those symbols representing operations on the thing itself. Symbols and nota- 
tion are part of the essential basis of mathematics, and I believe that the devel- 
opment and standardization of a good notation is an extremely important part 
of the development of mathematics. 

If we turn to the extreme other end of the scale, we run into another kind 
of difficulty in separating the mathematics from its applications. Some pure 
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mathematicians seem to do their mathematical thinking in terms of idealized 
physical and spatial ideas. The late G. H. Hardy, who taught me, was very much 
against applied mathematics, but in a footnote to a joint paper with J. E. 
Littlewood published in a Swedish periodical he wrote that a certain problem 
is most easily grasped in terms of cricket averages. Norbert Wiener would trans- 
late a mathematical problem into the language of Brownian motion, and I be- 
lieve that his thinking was completely abstract although I do not know the 
theory, or remember what he said well enough to be quite sure. Hadamard has 
described his visualization of the proof that there is a prime greater than 11. To 
consider all prime numbers from 2 to 11, ie., 2, 3, 5, 7, 11 he visualized a con- 
fused mass. Forming the product 2X3X5X7X11=N, since NW is large, he 
visualized a point remote from the mass. Increasing the product by 1 he saw 
another point a little beyond the first. N-+1, if not a prime, is divisible by a 
prime greater than 11; Hadamard saw a place between the mass and the first 
point. This seems to me to be a sort of mathematical shorthand and would cer- 
tainly have to be translated back to numbers before it could be communicated 
to anyone else. 

As I said earlier, I have until now always been classed as a pure mathemati- 
cian, but Professor J. E. Littlewood and I did a lot of work on the theory of 
ordinary differential equations arising from problems of radio engineering. 
Littlewood is also a very pure mathematician in many ways, but he worked on 
antiaircraft gun fire in the First World War, and he translated our problems, 
which were suggested by radio values and oscillations, capacitance and induc- 
tance, etc., into dynamical problems and called all the solutions of our equations 
‘trajectories’ as if they were the paths of missiles shot from a gun. In the radio 
problems there are oscillations with negative damping, and so we had periodic 
trajectories going up and down over and over again, and J am sure that the 
abstraction was complete although there was often a certain woolliness until 
the argument was complete, just as in Hadamard’s visualization. Between these 
two extremes there are some users of complicated mathematics, physicists and 
engineers in particular, who are thinking all, or nearly all, the time in terms of 
the physics of the problem. Engineers have consulted me about a number of 
different types of problem, radio, control theory, oscillations of stretched wires; 
they usually come with some equations and very little explanation. I have to 
ask a lot of questions before they tell me everything relevant to the mathemati- 
cal problem. It seems difficult for them to think in abstract mathematical terms, 
the symbols to them seem to mean the engineering concepts, currents and circuit 
eonstants such as impedance and inductance. This is important in two ways. 
The engineers have mental reservations and can check at every stage because 
they visualize how the physical system works. On the other hand they find it 
difficult to apply the mathematical processes used in one field to any other 
physical problem, even if they are just as relevant there. Some years ago at a 
conference for engineers I was asked to speak on Liapunov’s method for stability 
problems. I described the basic principles as simply as I could, and after I spoke 
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Professor Parks lectured on applications of the method. Many in the audience 
commented that the order of our lectures should have been reversed, and that 
they would have understood my lecture much better if they had understood 
that I was talking about the phase plane. It is possible that this was partly a 
question of notation and terminology, but I believe that they could do advanced 
mathematics best by thinking of it in terms of their particular engineering prob- 
lems. The Liapunov method was developed mainly in connection with control 
engineering and by now has adopted much of its terminology, but the math- 
ematics arising there need to be abstracted and put in a form which makes it 
available in connection with other applications. Problems of ordinary differen- 
tial equations have arisen in connection with astronomy, ballistics, radio engi- 
neering, control theory, mechanical oscillations of machinery; each application 
has special features, and the theory of it was often developed in a correct logical 
form quite a long way before it was fitted into the general theory of ordinary 
differential equations as pure mathematics. The individual who formulated the 
equation and asked the question is, in the sense of my title, thinking mathemat- 
ically, but he is not doing mathematics until he operates on his symbols. Please 
note that I do not say ‘asked for a solution of the equation’ because, although 
he may say that, he really wants to know something about the solutions in 
general. Is there a periodic solution? Is it stable? Will it remain stable if I change 
a certain parameter? Will the period be longer or shorter? He may find the 
methods which he needs in the literature and do the work himself. He may find 
a mathematician to help him. Although I myself have helped to develop the 
general theory and settle certain theoretical problems, I do not think that I have 
ever produced a result useful for any specific practical problem when it was 
needed. For soon after Littlewood and I began work on these problems, it was 
realized that the variations in individual thermionic valves was so great that 
precise mathematical results were not worth the trouble, and satisfactory ex- 
perimental determinations could be more easily obtained. In recent times the 
person who formulates the mathematical statement of a physical or other real 
life problem usually does not do anything very original in the mathematical 
handling of it, although some interesting purely mathematical work on matrices 
appears in journals concerned with computing or applications to economics, 
detached from other pure mathematics. 

To sum up so far I believe that the dividing line between strictly abstract 
thinking in mathematics and thinking in terms of the real world is by no means 
clearly defined and some of the major developments in mathematics such as the 
Calculus were thought out more or less in terms of the real world. Further ab- 
straction does not necessarily make the mathematics any better. For the 
Babylonian schoolmasters constructed sets of most complicated artificial for- 
mulae, perhaps 200 on one tablet, for their pupils to simplify. Their mathematics 
was sufficiently abstract for them to be indifferent whether they added the num- 
ber of men to the number of days. In present circumstances this seems abstrac- 
tion at its worst, but perhaps then it was a step forward. The Babylonians must 
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have developed the laws of arithmetic a long way to set these complicated 
exercises, but mainly for practical purposes whether it was accounting or 
astronomy. 

Now let us turn to those who do mathematics for its own sake. I should like 
to begin with the Hindu who in about 1200 B.C. wrote, “As crests on the heads 
of peacocks, as the gems on the hoods of snakes, so is ganita, mathematics, at 
the top of the sciences known as the Vedanga.” Ganita is literally the science of 
calculation and in the early days it consisted of finger arithmetic, mental arith- 
metic, and higher arithmetic in general. At first it included astronomy, but 
geometry belonged elsewhere. At one stage higher mathematics was called ‘dust 
work’ because it was done in sand spread on the board or on the ground. We owe 
our so-called Arabic numerals to the Hindus, and they advanced a long way in 
algebra very early. 

Most people consider that the Greeks were the first to do mathematics for 
its own sake and to realize the need for proof. The word ‘mathema’ meant origi- 
nally a subject of instruction, but very early it was restricted to mathematical 
subjects among which Pythagoras included geometry, theory of numbers, 
sphaeric (or spherical trigonometry used for astronomy), and music. They 
classified numbers not only as odd and even, but as even-even, 2”; even-odd, 
2(2n+1); odd-even 2”*!(2u%-+1), and also proved that there are an infinity of 
primes. I doubt whether they could calculate as well as the Babylonians, but 
probably that did not attract them, and also they lacked the incentives provided 
by the government of a far flung empire. I feel that I have to remind myself of 
the difficulties due to the absence of convenient symbols. Sir Thomas Heath 
writing of the arithmetic of Nicomachus said ‘If the verbiage is eliminated, the 
mathematical content can be stated in quite a small compass,’ but Heath used 
modern notation and Arabic numerals. In the Wasps of Aristophanes one of the 
characters tells his father to do an easy sum ‘not with pebbles but with fingers,’ 
and Herodotus says that, in reckoning with pebbles, Greeks move left to right, 
Egyptians right to left, which implies vertical columns facing the reckoner. 

The Greeks also developed a theory of geometry which remained more 
important than any other for nearly 2,000 years, and was the first deliberate 
development of a logical system in mathematics. In the third century A.D. an 
unknown writer jokingly used words of Homer intended for something else to 
describe mathematics: 

Small at her birth, but rising every hour. 

She stalks on earth and shakes the world around. 

For, says Anatolius, Bishop of Laodacia, who quoted it, mathematics begins 
with a point and a line and forthwith it takes in the heaven itself and all things 
within its compass. If this was the Greek viewpoint at such a late date, is it 
possible that their geometry was not truly abstract and that the symbols of 
point and line were still partly coalesced with the abstract point and line? 

The position of geometry and more generally spatial concepts in mathemat- 
ics is not completely clear to me. In recent times all types of geometry have been 
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given an analytical basis and freed from the logical difficulties such as those 
which used to worry schoolmasters teaching about congruent triangles by the 
method of superposition. I therefore ask myself whether geometry and spatial 
concepts are really part of the basis of mathematics or a field of application 
similar to mechanics, both terrestrial and celestial, or to games of chance. The 
reason for the traditional special position of geometry may be that in geometry 
the symbols are the objects themselves; the abstract point, line, and triangle are 
represented by a point, line, and triangle; what is more, so long as the geometry 
is plane geometry they can be drawn on a flat surface by pen or pencil on paper 
or in sand on the ground. When Greek geometry was being developed there was 
no good notation for dealing with numbers, and even in the 15th Century the 
solution of a cubic equation was described in geometrical terms and illustrated 
by a figure for lack of a good algebraic notation. In mechanics a comparable real 
life representation of motion could not be used to explain the theory; written 
symbols or geometrical figures were needed for communication. But if we ask 
whether the contributions of spatial concepts to modern mathematics are 
greater than those of other real life problems it is difficult to answer. Spatial 
thinking has led to the highly abstract theory of irrational numbers of Cantor 
and Dedekind, and permeates mathematical thought in almost all fields; the 
physical sciences have given rise to the calculus (not without the help of geome- 
try), and statistics and probability have their basis in multitudinous practical 
problems. 

Pfeiffer explains the situation well in relation to probability. Some of the 
salient points in his account are as follows: The history of probability theory 
(as is true of most theories) is marked both by brilliant intuition and discovery 
and by confusion and controversy. Until certain patterns had emerged to form 
the basis of a clear-cut theoretical model, investigators could not formulate 
problems with precision, and reason about them with mathematical assurance. 

From what some people say it sounds to me as if quantum theory had not 
yet reached this stage, but it is certainly beyond my competence to form a valid 
judgment. 

Pfeiffer continues by saying that although long experience was needed to 
produce a satisfactory theory, we need not retrace and relive the fumblings 
which delayed the discovery of an appropriate mathematical model. That is a 
mathematical system whose concepts and relationships correspond to the ap- 
propriate concepts and relationships of the real world. Once the model has been 
discovered, studied, and refined, it becomes possible for an ordinary mind to 
grasp, in a reasonably short time, a pattern which took decades of effort and the 
insight of genius to develop in the first place. I note that Pfeiffer asserts that 
the most successful model of probability theory known at present is character- 
ized by considerable mathematical abstractness. 

J. Willard Gibbs wrote ‘One of the principal objects of theoretical research 
in any department of knowledge is to find the point of view from which the 
subject appears in its greatest simplicity’ and Bushaw says that one of the dis- 
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tinctive characteristics of modern mathematics is its way of taking old math- 
ematical ideas apart like watches, studying the parts separately, and putting 
these parts together again in new and interesting combinations and studying 
these complications in turn. I believe that this process has contributed enor- 
mously to this simplification in mathematics itself, and so made it more readily 
available for applications. Mandelbrojt referring to the quotation from Willard 
Gibbs says ‘Integration in function spaces provided such a point of view over 
and over again in widely scattered areas of knowledge and it gave us not only a 
new way of looking at problems but actually a new way of thinking about them.’ 
Now one might call Fréchet, the father of abstract spaces, and in the front of 
his book he puts a quotation from Hadamard’s survey of functional analysis 
given in 1911. ‘The functional continuum does not present any simple concept 
to our imagination. Geometrical intuition tells us nothing a przorz about it. We 
are forced to remedy this ignorance and we can do it only analytically, by creat- 
ing a chapter of the theory of sets for handling the functional continuum.’ Else- 
where Hadamard wrote that the calculus of variations was nothing but the first 
chapter of functional analysis, and of his own work on the calculus of variations, 
hyperbolic partial differential equations, and certain other topics he said that 
he owed the greater part to his contacts with the physicist Duhem, through 
Duhem’s book on hydrodynamics, elasticity, and acoustics and many conversa- 
tions when they were both at Bordeaux. So we have a record here of the complete 
cycle from a physical basis through the calculus of variations to functional 
analysis and abstract spaces, and thence to a multitude of applications through 
the process of analyzing geometrical ideas and putting them together again in a 
most abstract new way to create function spaces. 

A further variation on this pattern has become evident of recent years and 
that is the use of an auxiliary model consisting of various graphical, mechanical, 
and other aids to visualizing, remembering, and even discovering things about 
the mathematical model. The visual images of Hadamard, Hardy’s cricket 
averages, and Littlewood’s trajectories might be considered as auxiliary models, 
but of more universal significance are the analogue machines with electronic 
devices which simulate what happens in, for instance, fluid mechanics, or rather 
what corresponds in the mathematical model. We now have 

(A) The real world of actual phenomena, known to us by various ways of 
experiencing these phenomena. 

(B) The abstract world of the mathematical model which uses symbols to 
state relationships and facts with great precision and economy. 

(C) The auxiliary model. 

The transition from A to B is the formulation of real world phenomena in 
mathematical terms; the transition B to A is the interpretation of the deduction 
by pure mathematics from that formulation. Both these I consider to be think- 
ing mathematically, but only the deductions inside B are mathematics. We may 
also think mathematically by moving from B to C which is a secondary inter- 
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pretation, and then either back to B to confirm what C has suggested or from C 
direct to A. 

As Pfeiffer points out, the value of both the mathematical model and the 
auxiliary model depends on how successfully the appropriate features of the 
model may be related to the ‘real-life’ situation. The models cannot be used to 
prove anything about the real world, although a study of it may help us to 
discover important facts about the real world. A model is not true or false; it 
fits or it does not fit. It is unsatisfactory if either (1) the solutions of the model 
problems have unrealistic interpretations, for instance, arbitrarily large quan- 
tities or arbitrarily fine differences, or (2) it is incomplete or inconsistent so that 
the mathematics produces contradictions. Many models fit amazingly well. 
Karl Pearson wrote ‘The mathematician, carried along on his flood of symbols, 
dealing apparently with purely formal truths, may still reach results of endless 
importance for our description of the physical universe.’ 

Until perhaps 100 years ago many scientists and mathematicians knew a 
bit of everything, and the mathematical formulation, as I said of Newton in 
particular, was done by someone who was a good enough mathematician to 
develop the mathematics to a considerable extent. This is particularly true of 
Sir Isaac Newton, but in these days of specialization the scientist or economist, 
or worker in close contact with the real world situation must do stage A->B. 
Sir Cyril Hinshelwood, former President of the Royal Society, said ‘Scientists 
need to be taught mathematics as a language they can actually speak. It is of 
great importance for the scientist to be able to learn the art of formulating prob- 
lems in mathematical terms which of course is a quite difficult job. You have to 
think very accurately and carefully about a problem before you can do it. You 
have to have practice in speaking the language of mathematics. It does not 
matter being an expert in differential equations. You can go to the expert for 
help in solving an equation. But you cannot expect the mathematician to do the 
translation into mathematics. There should be an early and rather intensive 
cultivation of the power of thinking about real things and the application of 
mathematical symbolism to physical ideas.” He went on to draw a parallel be- 
tween learning simple French as a child and learning to express physical ideas 
in mathematics when the level of physics and mathematics reached are both 
elementary, so that the child becomes accustomed to the process by easy stages. 
Although he advocates, as I do, that the scientist should do the mathematical 
formulation, his words seem to imply an incomplete abstraction. In his mind 
the mathematical symbols were still representing their physical counterparts, 
not that this matters for a scientist who has access to an expert mathematician, 
but it is clear from Mandelbrojt’s remarks on function spaces and Hadamard’s 
remarks about the functional continuum that without complete abstraction on 
the part of some mathematicians we should lack some of the most expressive 
parts of the mathematical language used by scientists. 


ok * *« 
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Since delivering the above lecture I came across the following passage by 
A. Robinson in Some thoughts on the history of mathematics, Compositio Math., 
20 (1968) 188-193. 

‘Euclid’s geometry was supposed to deal with real objects, whether in the 
physical world or in some ideal world. The definitions which preface several 
books in the Elements are supposed to communicate what object the author is 
talking about even though, like the famous definition of the point and the line, 
they may not be required in the sequel. The fundamental importance of the 
advent of non-Euclidean geometry is that by contradicting the axiom of parallels 
it denied the uniqueness of geometrical concepts and hence, their reality. By the 
end of the nineteenth century, the interpretation of the basic concepts of geome- 
try had become irrelevant. This was the more important since geometry had 
been regarded for a long time as the ultimate foundation of all mathematics. 
However, it is likely that the independent development of the foundations of the 
number system which was sparked by the intricacies of analysis would have 
deprived geometry of its predominant position anyhow.’ 

Although it confirms my views on Euclidean geometry, it does not seem to 
recognize the geometrical origin of the theory of irrational numbers. 

I also noticed that A. Aaboe in Eptsodes from the Early History of Mathemat- 
ics, writes ‘Even the oft repeated statement that the Egyptians knew the 3, 4, 5 
right angle has no basis in available tests, but was invented about 80 years ago.’ 


The above lecture was given as the Samuel Newton Taylor Lecture at Goucher College, 
January 30, 1969. 
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SHARPENED FORMS OF THE PLANE 
ISOPERIMETRIC INEQUALITY 


DONALD C. BENSON, University of California, Davis 


1. Introduction. By the isoperimetric inequality we mean here the following: 
Let T be a simple closed rectifiable curve in E*? with perimeter L and area A. 
Then 


(1.1) L?— 4rA 2 0, 


and equality holds only if T 1s a circle. 

By introducing parameters of the curve other than L and A, it is possible to 
obtain sharpened forms of (1.1) in which the right hand side is replaced by a 
nonnegative quantity which is not identically zero. See [1, 80-83, 111-113] and 
[8, p. 591] for examples of this sort. (In [1, p. 81] certain formulas are incorrect; 
see [2, p. 140].) 

In Section 2, a sharpened form of (1.1) is found which yields in special cases 
two different inequalities of T. Kubota [3]. (The methods used are quite differ- 
ent from those of Kubota.) 

In Section 3, it is assumed that the curve I’ is contained in an annulus with 
outer and inner radii 7; and rz, respectively. It is further assumed that I winds 
once around the inner circle and that it contains points both on the inner and 
outer circles. A sharpening of (1.1) is given in which 7; and 72 occur as pa- 
rameters. 

In considerations of this sort it is usual to use the theory of convex sets. 
This will not be done here; the results are all demonstrated directly for rectifi- 
able curves. The methods are analytical rather than geometrical. In each case 
one expresses the geometrical quantities by certain integrals. The results are 
obtained by using Cauchy’s inequality and by transforming some of the inte- 
grals by a suitable substitution. 

The author has obtained inequalities by similar methods in [4] and [5]. 


Zz. A generalization of two inequalities of T. Kubota. Let I‘ be a simple closed 
rectifiable curve in the x-y plane. Let w be the width of I’ in some arbitrary 
direction. Choose a coordinate system so that [ lies in the strip | y| Sw/2 and 
so that I contains points on the lines y=w/2 and y= —w/2. 

We parametrize I’ with the positive orientation by means of arc length s. 
It is known that x and y on TI are absolutely continuous functions of s on I. 
(See [6, p. 17].) Putting x’ and y’ for dx/ds and dy/ds, and observing that x’ 
and y’ exist for almost all s on I’ and satisfy «’?+-y’2=1, we have the following 
expression for the total arc length L of T: 


(2.1) L= [ a- | (a’? + y’?)1/2d5, 
r r 


From Cauchy’s inequality we have 
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(a! a yl?) 112 — [(2y/wo)? _ (1 _ (2y/wo) 2) ]1/2(a"? + vy’) 1/2 
= (| 2y| /wo) | «| + (1 — (2y/wo)?)!/?| y’ |, 


where Wy is a parameter which must satisfy wSwpo so that the square root in 
(2.2) is real on I’. We shall discuss special choices for wo later. Integrating (2.2) 
we have 


, 2 7 ay Ww 1/2 
(2.3) Le | ya! | ds + 1 | y’| ds. 
Wor T r Wo 


Equality holds in (2.3) if and only if the first order differential equation is 


(2.2) 


2 2.1/2 
(2.4) (wy — (2y)) | x’ | = 2| vy’| 
satisfied almost everywhere. A closed curve I is a solution of (2.4) if and only if 
it can be formed by fitting together arcs of circles of radius wo/2 centered on 
the x-axis and segments of the lines y=wo/2 and y= —wy)/2. (E.g., see figure 1.) 


Fic. 1 


For the area 4 enclosed by I we have 


=| f y2%as sf |o2"|as 
r 


with equality if and only if yx’ has the same algebraic sign at all points at which 
x’ is defined. We may choose arc length so that y(0) = —w/2 and for s=s, we 
have y(s1) =w/2. We treat the second term on the right of (2.3) as follows, using 
a standard theorem on integration by substitution [7, p. 221]: 


[-@)) ee fG- OY 
ae r f( -(2)) "ana 


w {2 2 2 1/2 Ww 20) 
= of (1 — (2) ) dy = — (we — w) + wo arcsin — - 
—w /2 Wo Wo Wo 


(2.5) A= LJ 2 
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Equality holds in (2.6) if and only if y’ is nonnegative on [0, s,] and nonpositive 
on [s,, L]. Putting together (2.6), (2.5), and (2.3) we have 
2A wo 2 2.1/2 _ 
(2.7) £E2=—+—(w—w) + wo arcsin—; 
Wo Wo Wo 
for any wy2w. Inequality (2.7) is the main result of this section. 

Now let us look more carefully at the conditions under which equality holds 
in (2.7). We may assume, without loss of generality, that [ is parametrized by 
arc length s(0<sSL) with the usual positive orientation, and that we have, as 
above, y(0) =y(L) = —w/2, y(s;) =w/2. Then the conditions mentioned follow- 
ing (2.3), (2.5), and (2.6) give the following necessary and sufficient conditions 
for equality in (2.7): 

(a) The curve I consists of arcs of circles with radius w,/2 centered on the 
x-axis and possibly segments of the lines y=w /2 and y= —wo/2. (Note that 
the line segments can occur only if w=wy.) 

(b) On IT, x is an increasing function of s wherever y is positive, and decreas- 
ing wherever y is negative. (The positive orientation of I is used here.) 

(c) On T, y is nondecreasing fer s in [0, s:] and non-increasing for s in 
(si, LI. 

The curves I’ which satisfy (a), (b), and (c) fall into two different types, 
depending on whether equality holds in (2.7) with a choice of wo such that 
Wo<w or such that w=w. If equality holds with wo<w, then (a), (b), and (c) 
imply that I is a “lens” of diameter w, i.e., the boundary of the intersection of 
two circular discs with centers on the x-axis and diameter wo. If equality holds 
with w,)=w, then (a), (b), and (c) imply that isa “racetrack,” 1.e., the boundary 
of the convex hull of two circular discs situated as above; in this case note that 
wis the minimum width of T. (Of course, we intend that a circle of radius w is 
a special case both of the racetrack and the lens.) 

Suppose the width w in an arbitrary direction and the area A of a closed 
curve TI’ are given. How shall we choose wy in order to give the best estimate 
of Z in (2.7)? 

lf A =a7w?/4, then there is a racetrack having A as its area and w as its 
minimum width. Putting w=wo, equality holds in (2.7) for this racetrack. This 
shows that the right hand side of (2.7) is maximum for this choice of wo, and 


(2.7) reads 
(2.8) L = 2A/w + rw/2. 


REMARK. If it is known that w is actually the diameter d of I’, then the fore- 
going case occurs only if TI is a circle of diameter d because the inequality of 
Bieberbach ([1] p. 76) asserts 


A S rd?/4 


with equality only in case I is a circle of diameter d. 
Inequality (2.8) was observed by Kubota [3]. The isoperimetric inequality 
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(1.1) follows from it because (2.8) may be written in the form 
(2.9) Lt 4rd (— 


If A <aw?/4, then (2.8) holds, but a sharper result is possible. In this case 
there is a lens with area A and diameter w. The right hand side of (2.7) will be 
maximized if we put for wo the diameter of the circular discs whose intersection 
gives this lens. Putting w/w) =sin ¢, we see that to satisfy this geometrical con- 
dition we must choose wo so that the transcendental equation 


(2.10) 2A/w? = dcsc*?d — cot¢d 


is satisfied. One can show that if A <mw?/4, then (2.8) has exactly one solution. 
Thus for A <aw?/4 (2.7) gives us 


(2.11) L 2 2w¢* csc ¢* 


where @ =¢* is the solution of (2.10). 

We have shown that (2.8) and (2.11) give sharp estimates for the length of 
T if the width w in an arbitrary direction and the area A of a closed curve I are 
given. We may also consider the question of determining the best possible upper 
bound for A if w and L are fixed. Inequality (2.7) may be written 


(2.12) 2A S wml — w(w —~ w) — wo arcsin (w/w). 


For what choice of wo is the right hand side of (2.12) the best possible upper 
bound for the area A? There are two cases to consider. Either we have L= rw 
or 2wSL<rw. (We always have 2wSL.) In the first case (L =7w) there exists 
a racetrack with the length Z and minimum width w which gives equality in 
(2.12) with w=wp. Thus (2.8) is the best lower bound for A if ZL and w are fixed 
and satisfy L27w. However, the case 2wSL<7w gives something different. 
As before, put sin ¢=w/wpo. Here we find that there is a lens with length Z and 
diameter w which gives equality in (2.12) provided ¢ satisfies the transcendental 
equation 


(2.13) 2ow = Lsin 4, 
and, with ¢** equal to the unique solution of (2.13), (2.12) reads 


(2.14) As (L — 2w cos ¢**). 


8p** 
Inequality (2.14) is due to Kubota in the same paper [3] which contains 
inequality (2.8). 

REMARK. If the area, the diameter d, and the minimum width w* of T' are 
known, one may consider the problem of maximizing the right hand side of 
(2.7) as a function of w and wo subject to the constraints OSwoSw and w* 
<w<d. It does not seem of interest, however, to do so, because the lower bound 
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for L which results is not sharp, i.e., in general there will not exist a curve [ 
with prescribed A, d, and w* which gives equality in (2.7). To give such a sharp 
lower bound for Z appears to be an interesting problem. 


3. Aninequality involving L, A, 7, and 72. Again let denote a simple closed 
rectifiable curve in £?. Let the origin for a polar coordinate system be a point 0 
inside the curve. 

Let us choose arc length s for parameter of I’ in such a way that s gives the 
positive orientation to I’. Then since x and y on I are absolutely continuous 
functions of s, it follows that the polar coordinates, r and @, are absolutely con- 
tinuous functions of I. (See [6, p. 17].) The derivatives dr/ds and d0/ds are 
defined for almost all s on T. 

Let r: and r2 denote, respectively, sup{r| (r, 0)E€T} and inf{r| (r, 0)ET}. 
Let EZ and F denote positive constants such that the relations 

Fr, — Erp! =1 

Fr. — Erg} = — 1 
hold unless 7; =72, in which case I is a circle with center at 0. In fact, it is not 
difficult to compute explicitly from (3.1) the following 


n= QF)314+ 0 + 4EF)!/) 

re = (2F)—1(—1 + (1 + 4EF)!/*), 
F = (71 — r2)7! 

E = ryro(ry + 12) (11 — 72)7. 


From Cauchy’s inequality we have, putting r’ and 6’ for dr/ds and d6/ds, 
respectively, 


(3.4) (7/2 + 726'2)1/2(1 + 4EF)/2 > (Fr + Er—)r| 6’ | + 1 — (Fr — Er )2)1/2| |, 
Equality holds in (3.4) if and only if the differential equation 
(3.5) (1 — (Fr — Er-3)2)1/2y | 6" | = (Fr + Er) | r' | 


(3.1) 


(3.2) 


(3.3) 


holds a.e. Evidently one solution to (3.5) is 

(3.6) sin(@ + c) = Fr — Er“, 

which is the equation of a circle with sup r=7 and inf ry =72. Other solutions are 
the circles r=7, and r=re. Let us put LZ for the length of I and A for the area 


enclosed by I’. Noticing 7’?+79§=1 a.e., and using a standard theorem on 
intégration by substitution we have 


(1+ 4EF)122 = f (Fr + Er )r| 6’| ds + | (1 — (Fr — Er)2)1/2| 7’ | ds 
(3.7) 


IV 


2FA + 2rE + J (1 — (Fr — Er-)?)"/2| ¢' | ds. 


34 D. C. BENSON [January 


Equality holds in the first inequality of (3.7) if and only if the differential 
equation (3.5) holds a.e. Equality holds in the second inequality of (3.7) if and 
only if 6’20 a.e. Now, transforming the integral on the right hand side of 
(3.7) by means of the substitution u = Fr—Er—'!, we have 


f (1 — (Fr — Er)2)12| 7! | ds 
r 


r=r} r= 
= | (1 — (Fr — Er)?)197'ds — | (1 — (Fr — Er-")*)1/2y'ds 
r= r=r- 


(3.8) 


1 1 
e (1 + u(w? + 4EF)?)1 — uw?) du 
—1 


1 1 
—{ (1 — u)1l2dy = r/2F, 
—1 


using (3.1) and noting that u(w?+4EF)—1/2(1 —w?)1/2 is an odd function. Equal- 
ity holds in (3.8) if and only if I consists of two disjoint connected arcs I, and 
I’, such that 7 is a nondecreasing function of s on Ty and nonincreasing on Ts. 
Putting (3.7) and (3.8) together we have the inequality 


(3.9) L= QFA 4+ 2cE 4 7/2F)(1 + 4EF)-1/2. 


The necessary and sufficient conditions for equality in (3.9) are just the union 
of the conditions for equality in (3.7) and (3.8). One sees that equality holds in 
(3.9) only for the circles (3.6). 

Inequality (3.9) is the goal of this section; it is a sharpened form of the 
isoperimetric inequality (1.1). This can be seen by rewriting (3.9) in the form 


(3.10) 12 — apa = CEA Tm 2k) 
: ~ 1-+4EF 
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ULAM’S CONJECTURE AND GRAPH RECONSTRUCTIONS 
P. V. O'NEIL, College of William and Mary 


1. Introduction. Henri Poincaré once meditated upon how it can happen 
that some people do not understand mathematics. Perhaps one reason is that 
most worthwhile questions nowadays are too difficult to explain. Anyone in a 
position to understand the problem can usually follow the solution. In this 
connection, Ulam’s conjecture is a pleasant example of a mathematical rarity— 
a nontrivial, easily stated problem that anyone can take a crack at. At least in 
its graph-theoretic formulation, Ulam’s conjecture is simpler to explain than, 
say, baseball or Monopoly. Also, reminiscent of seventeenth and eighteenth 
century mathematics, the clever amateur probably stands as good a chance as 
anyone of settling it. 

In this paper, we discuss Ulam’s conjecture as it relates to graph theory, 
together with some results and questions in the area of graph reconstructions 
to which it naturally leads. Theorems and proofs will be stated very informally, 
and, with the exception of the existence theorem of Section 5, none are new. 


2. Ulam’s version. In his famous problem book [10, p. 29|, Ulam asked the 
following question: 

Suppose A and B are sets with elements each (x23). A metric p is given 
on A with the property that p(x, y) is either 1 or 2 whenever x and y are in A 
and «yy. A similar metric is given on B. Now suppose that the x—1 element 
subsets of A and B can be labelled, Ai, ---,Anand Bi, ---, Ba, in such a way 
that each A; is isometric to B;. Does this force A to be isometric to B? 

Ulam’s conjecture is that it does. 

This certainly seems a plausible guess, and this author has even been told 
that it is obvious, although not by anyone who has attempted to write down a 
proof. We shall not be concerned directly with the statement just given, but 
shall instead turn immediately to a graph theoretic version. The advantages to 
be gained are concreteness and a certain visual appeal which the term isometry 
lacks by itself. These may be more than just fanciful, as most of the progress in 
this area seems to have been made by adopting the graph theorist’s point of 
view. 


3. A graph theoretic formulation. Surprisingly, the barest essentials of graph 
theory are sufficient to restate the problem. We shall briefly review these below. 
The interested reader can pursue the subject in [1, 3, 4, 8, 9]. 

A graph may be thought of as a finite, nonempty set of points (vertices) 
and lines (branches) connecting distinct points. It is easy to show that any graph 
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can be drawn in 3-space so that branches intersect only at vertices (for a proof, 
see [3, p. 8]). For obvious reasons, graphs are in practice usually drawn in the 
plane. Personal taste is allowed, but disregarded, in the drawing of graphs. 
Thus, the graphs of Figure 1 are considered the same, because a bit of stretching 
and bending (no tearing) will make them identical. Such graphs are said to be 
isomorphic. More carefully, G and K are isomorphic if there is a 1-1 function f 
between their vertices such that a branch between x and y is in G exactly when 
the branch between f(x) and f(y) isin K. 


Fic. 1 


Finally, if x is a vertex of G, then G—x denotes the graph obtained by delet- 
ing x and all branches incident with x from G. 

We now have enough to represent Ulam’s conjecture graphically. Given the 
elements a1, - ++, G@, of A, mark m points x1, - + + , x, as vertices of a graph Gy. 
Interpret p(a;, a;)=0 to mean there is no branch from x; to x;. If +47, draw a 
branch between x; and x; exactly when p(a;, aj) =1. Similarly, draw a graph Gp 
to represent the metric space (B, g). Ulam’s conjecture now reads: 

If G4—«x; is isomorphic to Gg—y; for each 2, then Gy, is isomorphic to Gp. 

A slightly different point of view is provided by the notion of a graph recon- 
struction. Suppose we start with ” graphs, Mi, ---,H,, each with n—1 vertices. 
A graph K is called a reconstruction of the H;’s if K has m vertices, ti, +--+, tn, 
such that K —#; is essentially (isomorphic to) H;. In the above discussion, G4 
is a reconstruction of the graphs G4—x;, as is Gp of the Ga—vy,;. If we regard 
G4—x; as essentially the same graph as Gp—vy; (through the isomorphism), and 
call it, say, H;, then we have the following version of the problem. 

If K and K’ are reconstructions of Mi, ---,H,, then K and K’ are isomor- 
phic. 

Thus, the problem becomes one of essential uniqueness of graph reconstruc- 
tions. This is Ulam’s conjecture in its most concrete form. 


4, Some results on graph reconstructions. There are two quite different 
ways of looking at graph reconstructions. First, given a reconstruction G of 
G—x1,-°+,G—%x,, are there any essentially different reconstructions? Of equal 
interest is a second point of view: given ~ graphs M, ---, H,, each with n—1 
vertices, does a reconstruction exist? 

The reader can convince himself that the graphs of Figure 2 have no recon- 
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struction (certain vertices are labelled for later reference). Thus, the existence 
given in Ulam’s conjecture is by no means a trivial assumption. We shall see in 
Section 5 exactly how nontrivial it is. In this section, we shall discuss some re- 
sults which have been achieved concerning uniqueness and construction of re- 
constructions, assuming existence. 

The question of uniqueness has been settled for “small” graphs. A theorem 
due to Kelly [7] says that Ulam’s conjecture is true if 2 is less than 7. This has 
been extended to n=7 by Harary and Palmer [5]. Unfortunately, the argu- 
ments used (consideration of all cases) do not suggest a method of proof for 
arbitrary 7. 

Some of the best results to date depend upon connectivity conditions. G is 
said to be connecied if any two distinct vertices of G can be linked by a path 
made up of branches of G. In Figure 2, H; is connected while Hs is not. Here 
Hs has four components (maximal connected subgraphs), and H,, two compo- 
nents. A free is a connected graph with no circuits (Hj, and He are trees, while 
3, H4 and Hs are not). 


ay C3 


bs 
by cz 
Ay He Hs 
® e 
®@ ) 
Hs Hs 
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In the paper referred to above, Kelly was able to show that any tree is 
uniquely reconstructible from its n—1 vertex subgraphs. Thus, Ulam’s conjec- 
ture is true for trees. The reader might try translating the assumption that G is 
a tree back into the original language of the problem. Harary and Palmer [6] 
improved upon Kelly’s theorem by showing that any tree can in fact be uniquely 
reconstructed from just the subgraphs obtained by removing the endpoints 
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(vertices with just one incident branch). A more recent improvement, which we 
shall not discuss here, can be found in [2]. 


The following result is quite remarkable. Suppose the graphs Gi, ---, Ga, 
~e 
_— 
y 
G T 
Fic. 3 
oo” ° 
o—® 6 
rs 
_ o——— © 
Gi = Ge = G3 Gs 
e 
¢—_—___— 
G; =Gs= G7 , Gg = Gy 
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each with »—1 vertices, have a reconstruction G. If at most one G; is connected, 
then G is unique. 

The hypothesis on the number of connected G; appears surprising at first. 
In fact, it is equivalent to saying that any reconstruction which exists must be 
disconnected. For suppose G is connected. It is easy to see that some set of n—1 
branches of G forms a tree T (Figure 3). Further, since n>2, T has at least two 
end points, say x and y. Then, at least G—x and G—y are connected. It is 
straightforward to show that, conversely, if G is disconnected, then all but pos- 
sibly one G; are disconnected. 

The disconnectedness of any reconstruction can be exploited to actually 
produce one, component by component. For simplicity, the constructions will 
be given for particular graphs. The reader might try extending them to the 
general case. 

First consider G; through Gy as shown in Figure 4. Here, all G; are discon- 
nected. The trick is to observe that the smallest number (greater than 1) of 
components in any G; is 3, and to try to produce a suitable G with this number 
of components. Of those G; with three components, choose one, say Go, with a 
smallest component. For convenience, label its components Ci, C, and C3, with 
C; the smallest (see Figure 5). Keep C, and C, as candidates for components of 


step 1 step 2 
x 
Co ‘ —_ \ 
, | 
; ? ; 
l @ / 
\ f 
; _ 
@ C2 
Ci C3 C, 
step 3 step 4 
C, [ / 
/ ——-8 
H 
G 
G; 
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G. There remains to produce one more. Take C; or C2, say C2, and remove a 
vertex which does not disconnect C2, say x (but not y), forming a graph H. Now 
identify some G; having three components, two being Ci and H. A good choice 
is Gs. The third component of Gs; can now be taken as the final component of G. 

Note that G as constructed has no isolates (vertices with no incident branches). 
This could have been predicted from the given G;: G will have k isolates exactly 
when & of the G; have k—1 isolates and each of the remaining x —k has at least 
(but possibly more than) k isolates. In this event, the construction just given 
fails. However, when G has isolates, it is particularly easy to reconstruct. For 
example, in Figure 6, we have k=3, and we can simply pick some G; (say Gi) 
having two isolates and adjoin another one. 


6 @ e——_? ° 
oo -* 
69 
G,=G.= 3 Gy = G; = Ges 
e ° e 
® 
e e 
e——? 
Gi = Gs G 
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To sum up, if a reconstruction G of the graphs G; exists, we can: (1) predict 
from the G; when G must be disconnected, and (2) in this event, construct G 
uniquely. 

These results produce another immediately upon taking complements. The 
complement of G, denoted G’, is formed by erasing all branches of G and drawing 
in any which were not originally there. It is easy to see that G is a reconstruction 
of Gi, -- +, G, exactly when G° is a reconstruction of Gj, ---, G. In Figure 5, 
note that G* is connected, so this is an honest extension to some graphs not 
already treated. 

The constructions we have just described are special cases of the general ones 
discussed by Harary [4], who also has additional details and references on graph 
reconstructions. 
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5. A more general reconstruction problem. We have already seen (Figure 2) 
that ” graphs may very well have no reconstruction. To see how bad the situa- 
tion can be, note that even H;3 and Hs; by themselves have none; that is, there is 
no five vertex graph G such that G—x is H3; and G—y is Hs. In this section, we 
focus attention on the general existence problem: given Gi, - + +, Gz, each with 
n—1 vertices (k Sn), does there exist an m vertex G with vertices x1, ++: , xz 
such that G—x; is essentially G;? Ideally, we would like necessary and sufficient 
conditions on the G;’s which will tell us when to expect a reconstruction, and 
then a method for finding it (or them). 

To get some feeling for what is happening, we begin with the simplest non- 
trivial case, k=2. A first impression is that G, and G2, have to be “almost the 
same” for there to be a reconstruction. Specifically, if G—x,=G, and G—x,. =Gn,, 
then 


(G — x1) — x2 = Gy — xe 
and 
(G — %2) — 41 = Go—- %4. 


But it is easy to see that (G—x,)—x2 and (G—x2) —x, are identical. Thus, G, 
and Ge are the same “to within one vertex.” More carefully: if a reconstruction 
of G, and G, exists, then there must be vertices x of G; and y of Gz such that 
G,—x is isomorphic to Ge—y. 

This is a very reasonable necessary condition in that it is both easily stated 
and applied (try it on H3 and H; of Figure 2). Happily, it is also sufficient. For 
suppose Gi;—x =G,—y. We can produce a suitable G by starting with a copy of 
G,—x and drawing in two new vertices and appropriate branches to each. In 


Figure 2, we have H,—a1=Hs—c2, and two possible constructions are indicated 
in Figure 7, (a) and (b), where G—u =A, and G—v=4H3. 


(c) 


(a) (b) 
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Incidentally, this shows that a reconstruction is never unique when k=2, 
since we are free to put in or leave out the branch between the two adjoined 
vertices (u and v of Figure 7 (a) and (b)). It also suggests a method for producing 
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all possible reconstructions of two graphs. Simply list all pairs of vertices a and 
b such that G,—a=G2,—), and repeat the constructions for each pair. For ex- 
ample, referring again to Figure 2, note that H;—a;=H3;—c 3. A reconstruction 
K begun with a copy of H,—as; is given in Figure 7 (c), where K—i=H, and 
K-2= Hs. 

Having found a nice solution for k=2, we are led naturally to attempt a 
similar theorem for the general case. Half of the above reasoning carries through 
immediately. If G—x;=G;, then 


(G — *;) — 4; = G; — x; = (G — 453) — 4%; 
= Gj — 4, 


so that, in pairs, G; and G; are identical to within a vertex. This isn’t surprising, 
since any reconstruction of k graphs is a reconstruction of any two of them. The 
bubble bursts, however, with H,, He and Hy of Figure 2. In pairs, H,—a, and 
H.—b1; H2—b2 and H3—c; and H3—c3 and H,—a3, are isomorphic. There is not 
even any cheating, since no vertex has been used twice. But there is no recon- 
struction of Hy, He and Hs. 

It turns out that, to insure existence, we need not only isomorphisms between 
G;—-x and G;—y, but particular kinds of isomorphisms. This is best explained 
in terms of vertex orderings. If x7 is an ordering of the vertices of G;, for 1SiSn 
—1and 1S 7Sk, then we can define maps f(,, from G;—x' to Giz,-—x1*" by: 


t l+r . . . 
Ny Ks Wisist—1iori+rsSisn-il, 
and 
t lir . . 
Xp MiG uwisi<yr. 


Here, values of ¢ are specified by 1S¢Smin(3, k—1), and, for each #, values of 
r by tSrsk—1. Now, in general, these maps may have no particular properties 
of interest. In the event that they are all isomorphisms, we call the orderings 
compatible. A general existence theorem can now be stated. 


THEOREM. The graphs Gi, +++, Gy have a reconstruction tf and only if their 
vertices can be ordered compaiibly. 


Most of the details of the proof are uninteresting, with the exception of the 
constructive part. When the vertex orderings are compatible, a reconstruction 
can be produced as follows. Mark 1 points, say 11, - + - , Wn, as vertices. Draw a 
branch between v; and 9; for 2$1<jSn exactly when x;_, and xj_, are connected 
by a branch in G;. For 123, connect v, with v; when x{ and xj_, are connected 
in Gp. Finally, if R>2, connect v; and v if x? and x3 are connected in G3. 

Two observations are in order. First, G is constructed from Gi, G, and Gz. 
The compatibility conditions simply insure that the other G; are recoverable 
from G. Second, the conditions are more theoretical than practical, as there is 
no clear cut way of determining whether or not there are compatible orderings. 
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It happens that, when k =2, the vertices of G,; and Gz can always be ordered so 
that the isomorphism G;—x<G2—y is of the required form. For k>2, however, 
complications mount rapidly. Unfortunately, at least for the present, there does 
not seem to be any way to avoid this in the general case. 

To bring the discussion full cycle, it is easy to verify that the following is an 
equivalent statement of Ulam’s conjecture. 

If x4 and yi, where 1Sin—1, are compatible vertex orderings of G;, for 
1<j<n, then, for each j, the map xj—yj is an isomorphism of G; onto itself. 

This formulation clearly lacks the simple appeal of the others. It does serve 
to pinpoint the real difficulty built into the conjecture, as any proof will in effect 
show that compatible orderings are essentially unique. 


6. Conclusion. Our concluding remarks will be confined to a brief statement 
of the current extent of ignorance about graph reconstructions. 

Virtually nothing is known, for example, about when uniqueness starts 
happening. For k=2, reconstructions are never unique. For k= 3, uniqueness 
may occur. For example, three complete graphs of ~—1 vertices each have 
exactly one reconstruction, namely the complete graph of vertices (complete 
means that every possible branch is drawn in). When else can three graphs 
uniquely determine a reconstruction? When do k graphs uniquely determine a 
reconstruction? Is it possible to characterize a reasonable class of ” vertex 
graphs which are uniquely reconstructible from certain subgraphs (as Kelly did 
for trees and all the »—1 vertex subgraphs, and Harary and Palmer did for 
trees and their maximal subtrees)? 

Such questions are interesting in their own right, and even partial solutions 
would certainly yield some insight into the general problem of existence and 
uniqueness of graph reconstructions. 
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AWN APPLICATION OF QUASIGROUPS TO GEOMETRY 
DAVID MERRIELL, Vassar College 


There exist in recent literature several applications of quasigroups to geome- 
try. The purpose of this article is to show how identities in a particular idem- 
potent medial quasigroup may be used to prove some theorems in elementary 
geometry. 

A quasigroup is a binary system (Q, -) which is closed under its operation and 
in which for any a and b in Q, the equations ax =) and ya=6 have solutions in 
QO. A binary system is idempotent if a?=a for every a in Q. It is medial if 


(1) (ab) (cd) = (ac) (bd) 


for every a, b,c,d in Q. Different authors have called medial quasigroups abelian, 
entropic, symmetric, and alternation quasigroups. The algebraic properties of 
such systems have been studied intensively. For idempotent medial quasigroups, 
the following identities hold: 


(2) a(bc) = (ab)(ac)  left-distributive law; 
(3) (ab)c = (ac)(bc) right-distributive law; 
(4) a(ba) = (ab)a flexible law. 


If (F, +, -) is afield and7z is any element of F, then the system (F, A) in 
which a A 6 is defined to be (1—r)a+rd is an idempotent medial quasigroup. 
Within these quasigroups, excluding the cases where r=0, 1, the left and right 
cancellation laws hold: 


(5) aAb=aAc implies 6 =c, and @aAc=6 Ac implies a = Bb. 


If “A” and “o” are two operations corresponding to elements r ands in F, then 


(6) (aA bbo(cA d) = (aoc) A (bod). 
Also, for any k in F 

(7) k(a A b) = (ka) A (Rb), 

and in the special case where s=1/2, one obtains the identities 
(8) (@Ab)+ (Ad) =(¢+tagA(b+4); 
(9) aqQAbd+(bAa =a); 

(10) a+ (bAc) = (a+b) A (a+). 


Now let us consider an example of a medial quasigroup, given by Sholander 
[2], in which Q is the set of points in the Euclidean plane, a is a fixed triangle, 
and ab is the third vertex of the triangle 8 with vertices (a, b, ab) which is di- 
rectly similar to a. This means that 6 is similar toa and has the same orientation. 
It seems natural to require that a?=a in this model. Thus the quasigroup is 
idempotent. 


44 
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Let the points (x, y) of the plane be represented by complex numbers x-+7y. 
Then if the fixed triangle a has vertices represented by the complex numbers 
(0, 1,7), the vertices of 8 are (a, b, aAb) where aAb = (1—r)a+rb. Therefore all 
of the above identities apply. Corresponding to each identity there is a geo- 
metric property, in most cases not immediately obvious, of directly similar 
triangles. We shall use the symbols a and b for the vertices as well as for the 
complex numbers which represent them. We shall also use capital letters 
A, B, C,+++ to represent the vertices corresponding to a, b,c,:-- 

In [3], I. M. Yaglom proved the following theorem: Given an arbitrary con- 
vex quadrilateral ABCD, wf equilateral triangles are constructed exterior to the 
quadrilateral on sides AB and CD and interior to 1t on BC and DA, then the free 
vertices of the equilateral) triangles form a parallelogram. Our identity (8) may 
be used to prove the following generalization: Given an arbitrary quadrilateral 
ABCD, if triangles directly similar to a fixed triangle are constructed on the 
directed sides 4B, CB, CD, and AD, then the free vertices of the triangles form 
a parallelogram. The free vertices are aAb, cAb, cAd, and aAd. The diagonals 
of the quadrilateral which they form intersect at [(aAb)+(cAd)]|/2 and 
[(cAb)+(aAd)|/2 which are both equal to [(a+c)A(b+d)]/2. Hence the 
quadrilateral is a parallelogram. 

The quasigroup identities can also be used to generalize results of Garfunkel 
and Garfunkel [1]. Following their lead, we define three directed segments 
A,B, A2Bz, A3B3 to be r-symmetric if and only if, when the triangles 4,B;C; 
are constructed directly similar to (0, 1, 7) where is any complex number not 
on the real line, then the triangle C,C:C3 is also directly similar to (0, 1, 7). 
The authors call the three segments “equi-Pappian” in case the fixed triangle 
is equilateral, that is, if r=(1+7/3)/2. 

In this more general context, the first three theorems of [1] become: 


THEOREM 1. If ABC and A'B'C’' are two triangles directly similar to (0, 1,7), 
then the joins AA’, BB’, CC’ of corresponding vertices are r-symmetrtec. 


COROLLARY. Three directed line segments AA’, AB’, AC’ are r-symmeiric tf 
and only tf A'B'C' is directly similar to (0, 1, 7). 

THEOREM 2. Quadrilateral ABCD 1s a parallelogram tf and only tf AD, BC, 
and AC are r-symmetric. 


THEOREM 3. If the three sides AB, CD, and BC of a directed quadrilateral are 
r-symmetric, then BD, AC, and AD are r-symmetric. 


- In the proofs which follow, xAy=(1—r)x+ry. The triangle X YZ is directly 
similar to (0, 1, 7) if and only if xAy=z. 
Proof of Theorem 1. Using (1), (aAa’)AAd’) = (Ab) Ala’ Ad’) =cAc’. 
Proof of Corollary. Let (aAa’)A(aAb’)=aAc'’. Since (aAa’)A(aAd’) 
=(aAa)A(a' Ab’) =a (a' Ad’), we use (5) to obtain a’ Ab’ =c’. Hence A’B’C’ 
is directly similar to (0, 1, rv). The converse is a direct consequence of the 
theorem. 
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Proof of Theorem 2. lf AD, BC, and AC are r-symmetric, then (aAd) A (bAc) 
=<a/\c. Hence (i—r)*’a+(1—r)r(d +b) +r2c = (1—r)a+rc so that r(1—r)(d+d) 
=r(i—r)(a+c). Since r40, 1, a+c=b+d. This implies that ABCD is a par- 
allelogram. The argument is reversible. Note that the condition (aAd) A(bAc) 
=qa/\c is necessary and sufficient for ABCD to be a parallelogram. 

Procf of Theorem 3. It is given that (aAb) A(cAd)=bAc. By (1), (Ad) 
A(afvc) =(bAa)A(dAc).Using first (9) and then (8), together with the fact 
that ~(wAy)=(—x)A(—y), 


Aa AdAcd =lat+b—-aAd]Ale+d—-(CAA| 
= [a+ 6)A(+4]—[@Ab) ACA] 
=[@Ad+GAd]—Acd =add. 


Therefore BD, AC, and AD are r-symmetric. 

Finally, we use this method to prove Theorem 4 of [1]: If equilateral tri- 
angles are constructed outwardly on the alternate sides AB, CD, EF of a directed 
hexagon so that their free vertices P, Q, R form an oppositely directed equi- 
lateral triangle, then the free vertices of the equilateral triangles constructed 
on the other alternate sides BC, DE, FA also form an oppositely directed 
equilateral triangle. 

Here the vertices P, Q, R are aAb, cAd, and eAf and we are given that 
(cAd)A(aAb)=eAf. A necessary and sufficient condition for the triangle 
(a, b, ab) to be equilateral is that (aAb) Aa =b. By (4), (aAb)Aa=aA (bAa). 
Therefore, 


cAaAdAb)=eAf, 
cha=(d€AbACAS/=GdAad AAS), 
bAfACecAa=dhe, 
GAadAFfAagd=dhe, 
d@AeAbAc =f Na. 
Since b/c, d/Ae, and fAa are the free vertices of the second set of triangles, 
this completes the proof. 
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AN ANALOGUE OF PTOLEMY’S THEOREM 
IN SPHERICAL GEOMETRY 


J. E. VALENTINE, Utah State University 


1. Introduction. In a note [3] in this journal, L. M. Blumenthal and B. E. 
Gillam exhibited, among other things, the geometrical significance of the cofac- 
tors of the determinant 


A(pi, po) +++, Puta) = | cos(pep;/p) |, 


where 2, 7=1, 2,---, n+2, and fi, po, >> +>, Pure are n+2 points of the n- 
dimensional spherical surface S,,, (the “surface” of a sphere of radius p in 
euclidean (x-+1)-dimensional space, with shorter arc metric, where p:p; = pj); 
denotes the distance of the points p:, p;). 

Haantjes [4] gave a proof of the spherical analogue of the ptolemaic inequal- 
ity, and [5] developed techniques which give a new proof of Ptolemy’s Theorem 
and its converse in the euclidean plane. It is further stated [5] that these tech- 
niques give proofs of the analogue of Ptolemy's Theorem and its converse in the 
spherical and hyperbolic planes. However, his analogue of the converse of 
Ptolemy’s Theorem in the hyperbolic plane is false; e.g., the determinant 
| sinh? pip; /2| =0, where z, 7=1, 2, 3, 4 for four points on a horocycle in the 
hyperbolic plane (see [6]), and consequently the result for the spherical plane 
is not immediate. 

The purpose of this note is to give the geometrical significance of the 
determinant 

(pr, pay Pa, ba) = | sin® pip;/2p|_, 

where 2, 7=1, 2, 3, 4, and pi, po, 3, ps are four points of the spherical plane of 
radius p. The determinant y(f1, £2, ps, 4) is a principal minor of a determinant 
obtained from A(f,, pe, p3, ps). The spherical analogue of Ptolemy’s Theorem 
and its converse is thus obtained. Though the techniques and proofs are quite 
different from Haantjes’s [5], the theorem and its converse in the spherical 
plane are the same. Moreover, the techniques used here give an immediate 
generalization. 

The analogue of Ptolemy’s Theorem and its converse is more apparent if it 
is remembered that the determinant 


0 1 1 1 1 
1 0 pipe? pips? pipe? 
1 pipe? O pops” popd? 
1 pips’ pops? OO  pspe? 
1 pips? pops? paps? 0 


vanishes for each quadruple of points 1, fo, D3, ps of the euclidean plane 
(see [1], pp. 153, 154). 


D(fi, p2, Ps, ps) = 


47 


48 J. E. VALENTINE [January 


It is noted that Ptolemy’s Theorem and its converse for the euclidean plane 
may be stated as follows: 


THEOREM (Ptolemy). Four points pi, po, p3, pa of the euclidean plane lie on a 
circle or line tf and only if the determinant 


C(pi, po, Ps, bs) = | pep?|, where i,7 = 1,2,3,4 vanishes. 


2. Preliminary Remarks. It is well known [2, pp. 162-163] that the deter- 
minants 


A(p1, pa, ps, bs) = | cos(pepj/p)|, where i,j = 1, 2, 3, 4, 


and 


A(f1, pe, Ps, Pa, ps) — | cos(p+p;/p) | } where 1,7 = 1, 2, 3, 4, 5, 


vanish for each quadruple and for each quintuple of points of the 2-dimensional 
spherical surface of radius p. Moreover, A(f1, po, p3) = | cos (pib;/p)| =0, where 
z,j=1, 2, 3, and vanishes if and only if the three points are on a great circle. 


3. Analogue of the Ptolemaic Inequality. A metric space is said to satisfy 
the ptolemaic inequality provided for each quadruple 9, q, 7, s of its points, the 
three products pq-rs, pr-qs, ps-gr of “opposite” distances satisfy the triangle 
inequality. It is known [2, p. 80, exercise 5] that the spherical plane is not 
ptolemaic. The spherical plane, however, does satisfy an analogous inequality. 


THEOREM 3.1. If pi, po, ps, ps are four points, then the determinant 
V(b1, pa, bs, bs) = | sin®*(pip;/2p) |S 0, 
where 1,7=1, 2, 3, 4, and y(pi, bo, ps, ps) =O tf fi, po, ps, bs ue on a great circle. 


Proof. Since A(p1, pe, p3, ps) =0, it follows that the determinant obtained 
from A(f1, pe, p3, 4), by bordering it with a first row and a first column with 
common element —1, and the rest of the elements in the first row all zeroes, and 
the rest of the elements in the first column all ones, also vanishes. 

The result of subtracting the first column of this bordered determinant from 
the second, third, fourth, and fifth columns, respectively, and making use of the 
fact that 1—cos x =2 sin?(x/2) is 


—1 - 1 
(1) ’ See ee ee ee | 0, where i,7 = 1, 2,3, 4. 
1 + —2sin?(pip;/2p) 


If three of the four points do not lie on a great circle, assume the labeling 
so that A(pe, ps, ps) ¥0. Let [1, 2] denote the cofactor of the element in the 
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first row and second column; an expansion theorem for determinants yields: 


—2*y(p1, pr, Ps, pa) A( pa, ps, ps) — [1, 2]? = 0. 


Since A(po, $3, 4) > 0, it follows that y(p1, p2, p3, ps) SO. 

If each triple of the four points lies on a great circle, then the rank of 
A(pi, po, ps, ps) is two, and consequently the rank of the determinant in (1) is 
three. Thus, in this case, y(f1, po, ps, Ps) =0. 


Coro.uary. If p, g, 7, s is a quadruple of points, then the three products of the 
sines of half the “opposite” distances satisfy the triangle inequaltty. 


Proof. By Theorem 3.1, y(~, g, 7, s) $0. However, expansion of y(, gq, 7, 5) 
and routine but tedious computations yield y(, gq, r, s) =A-B-C-D, where 


A = — [sind sin 6 + sin e sin @ + sin a sin B| 
B 
C = [sin \ sin 6 — sine sin @ + sina sin 6] 


D = [—sin \ siné6 + sine sin @+ sin asin 6] 


and A= pg/2p, 6=rs/2p, €=pr/2p, 0=qs/2p, a=ps/2p, and B=qr/2p. 

Since y(?, 9, 7, 5) SO, it follows that exactly one of B, C, D cannot be nega- 
tive. If exactly one of B, C, D were negative, then y(~, q, 7, s)20, and hence 
y(p, g, 7, S) =0. Thus one of the other two factors B, C, D would be equal to 
zero. Addition of the negative factor and the zero factor would contradict the 
fact that the sum of two nonpositive numbers is nonpositive. In exactly the 
same manner, it follows that no two of the factors B, C, D can be negative. 
Therefore, B, C, D are all nonnegative. 


[sin \ sin 6 + sin e sin @ — sin a sin 6] 


4, Analogue of Ptolemy’s Theorem. Theorem 3.1 shows that y(#, g, 7, s) =0 
if p, g, r, S are points on a great circle. We now establish the following: 


THEOREM 4.1. If $, g, 7, s are four points on a circle, then y(p, gq, 1’, s) =9. 


Proof. lf p, g, r, s are points on a circle, C(c: pc), with center c and radius pc, 
assume first that p, g, r, s are on a semicircle. It follows that one of the angles 
with vertex ¢ is equal to the sum of the other three. Assume the labeling so that 


Ze:~p,s= Le:p,gt Le:q,r+ Ler, s, 
where Zc:x, y denotes the angle made by the tangents to the great circles 


through c, x and c, y, respectively, at their common point c. By the use of simple 
trigonometric identities, it is seen that 


[sin ($ Z c:p, r)-sin($ Zc: g, r)|] + [sind Z c:p, g)-sin(S Z cir, s)] 


(2) [sin(4 Zc:p,r)-sin(g Z ciq, s)]. 


Multiplication of equation (2) by sin?(pc/p) and substitution of sin(«y/2p) for 
sin(3 Zc:x, y)-sin(pc/p), where x, y=, 9, 7, 5, lead to the equation 


50 J. E. VALENTINE [January 


[sin(ps/2p) -sin(gr/2p)] + [sin(p¢/2p) -sin(rs/2p)] = [sin(pr/2p) -sin(qs/2p) J. 


It follows from the factorization of y(~, g, 7, s) as given in the proof of the 
corollary to Theorem 3.1 that y(~, q, 7, s) =0. 

The only case left is when three of the points lie on a semicircle of C(c: pc) 
and the other point is on the circle, but not on the same semicircle. This case 
is treated very much like the above case, which completes the proof. 


5. Analogue of the Converse of Ptolemy’s Theorem. In order to establish 
the analogue of the converse of Ptolemy's Theorem, we first establish the 
following: 


THEOREM 5.1. If p1, po, ps, pa, Ps are five points, then the determinant 
(Pi, Pa, Ps, Pay Ps) = | sin*(pip;/2p)|, where i,j = 1, 2,3, 4, 5 vanishes. 


Proof. Since A(p1, po, ps, fs, Ps) =O, it follows that the determinant A, ob- 
tained from A(fi, po, £3, ps, Ps) by bordering it with a first column and a first 
row with common element —1, and the rest of the elements of the first column 
all ones, and the rest of the elements in the first row all zeroes, also vanishes. 
If the quintuple contains a triple that does not lie on a great circle, then the 
leading principal minor of order four of A is nonzero, while the determinants 
obtained from this principal minor by adjoining one row and one column or two 
rows and two columns vanish. It follows that the rank of A is four. If each triple 
lies on a great circle, then the rank of A is three. The determinant B, obtained 
from A by subtracting the first column from the second, third, fourth, fifth, and 
sixth columns, respectively, and then substituting —2sin?(«/2p) for cos(x«/p) —1 
also has rank three or four. Hence, 


| sin®(psp;/2p)| = 0, where i, 7 = 1, 2, 3, 4,5. 


THEOREM 5.2. If p, g, r, s are four points with y(p, g, r, s) =0 and tf g, 7, s 
are on a great circle, then p ts also on that great ctrcle. 


Proof. Since a point has exactly one diametral point, no generality is lost if 
it is assumed that g and s are not diametral points. Then q, s determine a unique 
great circle. If and any one of g, 7, s are diametral points, then # lies on the 
great circle through g, 7, s and the proof is complete. Suppose then, that p is 
not the diametral point of q, 7, or s. 

If the first row of the determinant (1) in the proof of Theorem 3.1 is inter- 
changed with the second, third, fourth, and fifth rows; and if [4, 5] denotes the 
cofactor of the element in the fourth row and fifth column of the determinant 
so obtained, then it follows that [4, 5]=0. 

A fairly straightforward expansion of the determinant [4, 5] utilizing the 
spherical law of cosines as well as the spherical Jaw of sines (see [1], pp. 183— 
185 for this type expansion), yields sin(Zp:qg, s) =0. Consequently, ~, g, s lie 
on a great circle. Since g, r, s lie on a great circle and q, s determine a unique 
great circle, it follows that p, g, 7, s lie on a great circle. 
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THEOREM 5.3. If p, g, r, s are four points with y(p, q, 7, 8) =0 and af q, 7, s le 
on a circle, then p 1s also on that circle. 


Proof. lf q, r, s lie on a great circle and y(p, qg, 7, s) =0, then by Theorem 5.2, 
b also lies on that great circle. Suppose then that q, 7, s lie on a circle C that is 
not a great circle, y(p, g, r, S)=0, and p is not on C. It follows that one of the 
great circles C(p, g), C(p, r), or C(p, s), say Cp, g) has another point ¢ common 
to C. Then y(q, 7, s, ¢, p) =0 by Theorem 5.1, y(qg, 7, s, p) =0 by hypothesis, and 
v(q, r, Ss, t) =0 by Theorem 4.1. Hence, the rank of y(q, 7, s, t, p) is three. Conse- 
quently, y(r, p, ¢, g) =0, and p, t, g are on a great circle. Hence, by Theorem 5.2, 
g, 7, t are on a great circle, contrary to the fact that qg, r, ¢ lie on a circle which is 
not a great circle. Therefore, p lies on the circle of q, 7, s. 

Since each three points of the spherical plane lie on a circle, the following 
spherical analogue of Ptolemy’s Theorem and its converse has now been 
established. 


THEOREM 5.4. Four points pi, po, ps, ps of the spherical plane lie on a circle 1f 
and only tf the determinant 


v(p1, po, ps, ps) = | sin?(pip;/2p) | = 0, where 1,7 = 1, 2, 3, 4. 


COROLLARY. If p1, po, - ++, pn are nm pairwise distinct points of the spherical 
plane (nZA4), then a necessary and sufficient condition that pi, po, +++, Pn lie on 
a circle 1s that the determinant y(hi, po,-°--, Pa) =| sin?(p.p;/2p)| , where 
1,71, 2,---,%, has rank three. 


Proof. Suppose 1, po, - + *, Pa are pairwise distinct points of the spherical 
plane which lie on a circle. Then the leading principal minor of order three of 
(pi, 2 - + *, Pu) is nonzero, while the determinants obtained from this prin- 
cipal minor by adjoining one row and one column or two rows and two columns 
vanish by Theorem 4.1 and Theorem 5.1, respectively. Therefore, the rank of 


Y(p1, D2, - + +, Pn) is three. 
Conversely, if the rank of y(f1, p2, °° *, fn) is three, then every principal 
minor of y(pf1, 2, -- +, fn) of order four is zero. It follows from Theorem 5.3 


that each quadruple of the points, and hence the z points, lie on a circle. 
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GROUPS WHICH ARE THE UNION OF THREE SUBGROUPS 
M. BRUCKHEIMER, The Open University, England 
A. C. BRYAN and A. MUIR, The City University, London, England 
In this paper we consider groups which can be written as the (set-theoretic) 
union of three subgroups. For completeness, we first dispose of the union of two 
subgroups; although this result is well known, we use similar methods of proof 


subsequently. (Throughout this paper we use union in the set union sense and 
not as usually used in group theory.) 


LEMMA 1. Except for trivial cases, a group cannot be the union of two subgroups. 


Proof. Let G be a group, and suppose there exist two subgroups A and B 
such that 


AUB=G. 
Clearly 4=G or B=G (i.e, ACB or BCA) represent possible cases: such 


trivial cases, where one subgroup is a subset of another subgroup, will not be 
considered in the future. Suppose, therefore, that we have the situation of Fig. 1, 


A=A\:B 
B=B\A 


K=A(\B 


Fic. 1 


where A4¥@ and B¥@. Let GCA and 6CB. Then 
aE A>bECA, 
ageEeBa>acB, 


both of which contradict the hypotheses. Hence AUB =G, which contra- 
dicts the closure of G. 


We now turn to the three group case, and suppose that A, B, and C are three 
subgroups of G such that 


AUBUC=G. 
If ACB, etc., we are effectively dealing with two subgroups of G, and this is 


not possible by Lemma 1. Hence we have the configuration of Fig. 2, where 
A, Band C must be nonempty sets. 
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Fic. 2 


LemMA 2. L=M=N=2. 

Proof. Consider JEL and EC. Then 
mdEecealec, 
dEB>ZEB, 
le A>EiCA, 


all of which contradict the hypotheses. Since C4@, we must have L=@. 
Similarly M=N=@. 


Lemma 3. Each of A, B, and C contains its inverses. 


Proof. The inverse @! of any element GCA either belongs to A or to K. 
Suppose @!=kCK. Then, from aGk=eCK, it follows that @€K. Hence 
aE. 


Lema 4. If @€A and b€B, then abEC. 
The proof follows by contradiction as before. 
Lemna 5. If G, @CA, then G4€K. 


Proof. The element Ga belongs either to A or to K. Suppose that it belongs 
to A, and consider 644, for some 6€ B. Then by Lemma 4, used twice, 


(6a)a, € B. 
By hypothesis and Lemma 4 again, 6(aa,) ECC. Hence 6% CK. 
LEMMA 6. K 1s an invariant subgroup of G. 
Proof. Clearly K is a group. Further if @€ A and kREK, then 
a ka €C K. 


For Ra belongs to A or K; since RC K>4CK, we must have kG€ A. Hence by 
Lemmas 3 and 5, the result follows. 
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Clearly the left cosets @K of K are subsets of A. In fact 
Lema 7. GK=A. 


Proof. Suppose there is an element 4,€ A, but @@a@K. Then by Lemmas 3 
and 5, @'4,=RkCK. Hence @,=4k, for some REK. 

It follows from this result and Lemma 4 that the factor group G/K is the 
Klein Four Group, V. 


THEOREM 1. A group G 1s the (nontrivial) union of three subgroups uf and only 
uf it 1s homomorphic to the Klein Four Group. 


Proof. One half of this theorem has been proved; it remains to show that if 
G is homomorphic to V then it is the union of three subgroups. Let K be the 
kernel of the homomorphism and A, B, C the cosets of K. Consider the struc- 
ture A, where 
A=AUK. 
(i) A is closed, since 
AA=K>é4%,€C K 


. . _ forall d,d, CG A and RE K. 
AK lometh ; 


Gi) aE A=a-!CA, since each element in V is its own inverse. 
This completes the proof. 

It follows from Theorem 1, that if G is finite, it must be of order 4m. Clearly 
there are many groups of order 4m which are not the union of three subgroups. 
For example the cyclic group Cs, of order 4m and the alternating group A, for 
n>A4. Also any locally cyclic group is not the union of three subgroups. This 
follows from Lemma 5 by considering, say, the subgroup generated by an @ and 
a b. This subgroup cannot be cyclic. 

On the other hand there are many examples of groups of order 4m which are 
the union of three subgroups. We shall consider a few examples before returning 
to more general considerations. We shall refer to the decomposition of a group 
into (three) subgroups and call a group which allows such a decomposition a 
3-group. 

Example 1. V itself obviously admits such a decomposition. 

V —_ {Co, Co, Co}. 


Example 2. There are five groups of order 8. Disregarding Cs, we are left with 
C4X C2, CoXC2X Ce, Da, and Q (the quaternion group). Each admits a decom- 
position as follows: 


Cs X& Co {Ca Ca, Vt, 

Co X C2 X C2 > {V,V,V}, 
nn leek Vi, 
— {Ca, Ca, Cah. 
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It will be noticed that there are only four different decompositions possible for 
a group of order 8, and this displays an example of each. (We consider two 
decompositions {X, Y, Z} and {Xi, Vi, Z:} as the same if there exist isomor- 
phisms X>X,, YY, ZZ.) 

Example 3. The dihedral group Dem of order 4m defined by 


a’ = §? = (ad)? =e, 
admits a decomposition for all m. The subgroup K, where 
K= 1 e, az. 5 a2im—1) t 
is invariant, and the factor group Dem/K is V. The decomposition is 
Dom —> {Cam Dm, Dn}. 
Example 4. The dicyclic group of order 4m, defined by 
a" =e, a” = (ab)? = 0, 
admits a decomposition for all even m. The subgroup K, where 
K= fe, a2. , gir) } 


is invariant, and the factor group modulo K is V. The subgroups of the decom- 
position are Co, and two dicyclic groups, each of order 2m. 

We can immediately dispose of two questions: 

(i) Can a 3-group have different decompositions? 

(ii) Can two different 3-groups have the same decomposition? 
Both questions, as is to be expected, are answered in the affirmative by the 
following examples. 

Example 5. C.X D4 has two distinct decompositions: 


C2 X Di {Cs X Ca, Ds, Duh, 
and 
Co Dy {Ca X Ca, Co X Co X Ca, C2 X Co X Co}. 
Example 6. The group of order 16 defined by the generating relations 
a? = 6? = ¢? = e, abc = bca = cab, 


has a decomposition {CX Ca, Da, D4} - from Example 5, this is also a decom- 
position of C2 D4. 

(if a 3-group is non-Abelian, then the subgroups of a decomposition can be 
(locally) Abelian (@) or non-Abelian (9t). This gives four possible types of 
decompositions, namely, {@, @, @}, {@, @, r}, {@, x, x}, and {x, x, a}. 
The decomposition iQ, Q, at, however, cannot exist as is shown by 


LEMMA 8. If G—>{A, B, c} and A and B are Abelian, then C 1s also Abelian. 
Proof. Since KCA, K is Abelian. If é, CC then we can always write @= ab 
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and ¢, = 4b,=abk, for some GC A: 5, nEB and RE K. It follows that C is com- 
mutative. Similarly the elements of C and K commute. 
The remaining three types of decompositions all exist as the following 
example shows. 
Example 7. 
O— {Cy C4, C4} is of form G— {@, a, a}, 
Dy—{Cs, Ss, Ss} is of form G > {@, x, 2}, 
S3X V—{Ds, De, De} is of form G — {x, 2, I}. 
We summarize these results in the following theorem: 


THEOREM 2. Each decomposition of a 3-group 1s one of the forms {@, a, a}, 
{@, 2, WU} or { 9, 9, I}. 


If a 3-group G is Abelian then its center is G. For non-Abelian 3-groups we 
have the following two results: 


THEOREM 3. A non-Abelian 3-group G has an Abelian decomposition, 1.e., 
G— | a, G, ge}, af and only tf the center of Gis K. 


Proof. Let G>{A, B, C} and let Z be the centre of G. 

Firstly, assume that A, B,and C are Abelian. Then ZD K. Suppose 4,E AMZ. 
For each @€C there exists a 6€ B such that ¢=4,6; whence @b = bé, for all DEB. 
Since Z2 K, it follows that the elements of B and C commute. 

Also, for each @€A there exists some element EEK such that @=4,k, so 
that b= 064, for all b© B. Thus the elements of A and B commute. Similarly, 
we can show that the elements of A and C commute. 

It follows that G is Abelian, which is a contradiction. Hence Z=K. 

Conversely, suppose Z=K. For any fixed element 4CA, there exist ele- 
ments R@K such that @=4,k, for all @C A. It follows that A is Abelian. Sim- 
ilarly B and C are Abelian. 


THEOREM 4. If G admits a decomposition |A, B, C} of type {@, N, x}, 
then the center Z of Gis contained in A. 


Proof. Suppose 6,€ Bf\Z. For every 6€8B there exists a RCK such that 
b=5, k. 

It follows that the elements of B and K commute, and that B itself is com- 
mutative. Therefore B is Abelian, which is a contradiction. Similarly COZ = @. 

Example 8. For De Ce, S3, Sst, the center Z is such that ZNA#@. 

From Theorems 3 and 4 we have that if G has a decomposition {A, B, c} 
of type {@, IN, x}, then the center Z is either properly contained in K or, at 
most, it also contains elements of A. Are there other possibilities when the de- 
composition is of type {9t, 9t, 9}? It is clear that Z cannot contain elements of 
A and B and not C, since Z\\A and ZB are groups and a group cannot be 
expressed as the nontrivial union of two groups (Lemma 1). But there is the 
possibility that Z contains elements of A, B and C; in this case Z is itself neces- 
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sarily a 3-group with decomposition Z—>{ZNA, Z(\B, ZC}. (Z may, of 
course, be a 3-group in other cases.) The existence of such a decomposition re- 
quires that K be non-Abelian. For, if K were Abelian we could prove that, for 
instance, A was Abelian. The method of proof is similar to that used in The- 
orem 4, 

Example 9. For S3X V—{Dae, Ds, Ds}, none of ZA, ZB, ZC is empty. 

Let /(A) be the set of inner automorphisms of G defined by elements of A. 
Then /(A) is a subgroup of I(G), the group of inner automorphisms of G; we 
have, trivially, /(G) =I(A)UJI(B)UI(C). Therefore, we have 


THEOREM 5. The group of inner automorphisms of a 3-group G 1s etther ttself 
a 3-group or degenerate, in the sense that it 1s one of [(A), [(B), or I(C). 


We note that degeneracy (as defined in Theorem 5) does not necessarily 
preclude J(G) from being a 3-group, as is shown by the next example. However, 
as we are concerned with the structure of [(G) relative to G{A, B,C } , this 
definition is appropriate. 


Example 10. 
O—{ C4, Ca, Ca}; I(G)=V (nondegenerate) 
D>} Ce, S3, Ss} ; I(G) =I(S3)=S3 (degenerate) 
C2X Di | C1XCo2, Da, Da} ; I(G)=1(Ds=V_ (degenerate) 
CoX Dr { CaXCo, C2XC2XC2, CoXC2XCo}; I(G)=V (nondegenerate) : 


For an Abelian decomposition of a non-Abelian 3-group G, it follows from 
Theorems 1 and 3 that [(G) =G/Z=V. Hence, we have 


THEOREM 6. A non-Abelian 3-group has an Abelian decomposition tf and only 
af the group of inner automorphisms 1s the Klein Four Group. 


Finally, we consider the relation between the 3-group structure and degen- 
eracy of I(G). This is given by 


THEOREM 7. The group of inner automorphisms of a 3-group 1s degenerate tf 
and only wf the center contains elements other than elements from K. 


Proof. Suppose 4.€ AMZ. Since for every 6€ B there are elements EC such 
that 6=2Za,. If we denote bv z(g) the inner automorphism of G defined by g, 
then we have that 7(b) =72(@). Therefore /(B)=TI(C), and 7(B) =7(B)UI(K) 
= J(C). Hence /(G) is degenerate. 

In fact, since every element GC A is of the form 4=4,k, for some ECK, we 
havé 1(@) =1(k). Therefore [(A) =/(K) and I(G) =1(B). 

Conversely, /(B)CI(C) implies either 


(i) i(6) = i() 5b = kZSZOBHEG, 
or 
(i) i(6) =i@) 35 =tZ=>ZNAG. 
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CORRECTION TO ‘“‘FUNCTIONAL ANALYSIS PROOFS OF SOME THEOREMS 
IN FUNCTION THEORY” 


L. A. RuBEL AND B. A. TAYLOR (vol. 76 (1969) 483-489) 


There is a gap, pointed out to the authors by Benjamin Lepson, in the argu- 
ment supporting the Remark on page 486, since # may have unwanted zeros. 
To fill this gap, simply take f=(g/h)k, where g and h are as before, but where 
k has a high order zero at each unwanted zero of h, and where k takes the value 
1, to a high order, at each wanted zero of h. 
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OPTIMALLY SEPARATED CONTRACTIONS 
D. J. NEwman, Yeshiva University, and L. Raymon, Temple University 


We introduce the class of (normalized) contractions, K, as the set of func- 
tions, f(x), defined on [0, 1] and satisfying 


f(0) = 0, | f(x) — f(y) | < | x — y) | for all x, y in [0, 1]. 


Under the usual sup-norm metric, 


7(@)|| = max | f(@)|, 


S21 


the class K forms a compact set of functions and one which plays a central role 
in approximation theory. Our purpose in this note is to determine how far apart 
n elements of K can be from one another. 


THEOREM I. Gwen that fi, fo, > ++, faGK we can choose t¥j such that 
Ife — fill S 2/(logs n). 
II. There exist n functions fi, fo, + + +, fn©€K such that for all t, 7 (¢7) we have 


fi — fill = 2/(og: 2). 


(Here (x) denotes the smallest integer which isn’t smaller than x; e.g., 


(S)=5, (r)=4, (-V2)= —1.) 


Proof. Clearly we need to prove I only for »=2*+41 and II only for n =2*, 
The basic consideration in either case is the collection of functions qu, d2, - + - , 
¢:*, namely those continuous functions which vanish on the negative axis and 
whose slope is constantly +1 or —1 in each of the intervals ((v—1)/k, v/R), 
where p is any positive integer not exceeding k. 
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Proof of Il. For our 2* functions f; we choose the functions ¢; themselves. 
Clearly these belong to K. If 14j, moreover, then there is a first interval 
((v—1)/k, v/k) wherein the slopes of ¢; and @,; are different (otherwise ¢; would 
identically equal ¢;, contradicting the fact that 47). We may suppose that ¢; 
has slope +1, ¢; slope —1 in this interval, then 


oi((v — 1)/k) = o;(( — 1)/), 
bi(v/k) = os((v — 1)/k) + 1/7, 
bj(v/k) = o;((v — 1)/k) — 1/R. 


Hence ¢:(v/k) —¢;(v/k) =2/k, and we conclude that | b:—93|| =2/k as required. 
For I we need the following fact: 


LEMMA. If fEK then 
|| f(s) — BCR + 1) om(((k + 1)e — 1)/k)|| S 1/(R + 1) 


for at least one of our functions Om. 


Proof. f(x) is given. Define a sequence of +1’s inductively by setting, 
forv=1,2,---:,8, 


ey = sgl(k + 1) f(e + 1)/(R + 1)) — (a tbe t-:-+++e6-1)] 


(the empty sum being interpreted as 0) where we define sex = +1 for x20 and 
—{ for x <0. 

Choose ¢m So as to have the slopes ¢, throughout |(v—1)/z, v/k]. We claim 
that this choice satisfies our requirements. We simply proceed by induction to 
prove that, for y=0, 1, 2,---,k, 


| f(x) — R(R + 1)“ m(((R + 1x — 1)/k)| S 1/(R + 1) 


throughout [y/(k-+1), (v+1)/(k+1)]. This is clearly true for v= 0 so we assume 
it to hold below »v Sk and proceed to the case of » itself. 
By definition of mn, we have, throughout [y/(k+1), (v+1)/(k+1)], 


R(R+ 1) bm(((R + 1) ¢ — 1)/k) = (artes t ++ + teu)/(R+1) + (&— 0/(k + 1))e. 


Hence the required inequality can be written as —1Sg(x)S1 throughout 


[y/(R+1), (v+1)/(k+1)], where 
g(x) = [(R+ 1)f(x) — (a bet -++ +e1)]/e— (Rk + 1)x — »). 


Now note that the function g(x) is nonincreasing. It is therefore sufficient 
to show that 


gv/A+1)) S11, g(@t+1)/R+1)) 2-1. 
The first inequality follows by the inductive hypothesis since 
g(v/(k + 1)) = [(R+ 1)f0/(R + 1)) —- (a t--- +61] /e. 
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As for the second inequality we have 


(k+ If + 1)/R + 1)) —- (ates + +e) - 1 


Cy 


g((v + 1)/(R + 1)) = 


and this is 2 —1, since, by the definition of «6, 
(AE DAY + D/A +1) —-(at--++e1)]/e = 0. 
We can now give the 


Proof of I. Let fi, fo, > + + , fokr: be given and for each of these f’s determine 
the corresponding ¢» given by the lemma. For at least two of these, say f; and 
fj, we obtain the same ¢,. Thus 


Il fe(x) — BCR + 1)-2bm([(2 + 1)% — 1]/2)|| S 1/(2 + 1) and 
fila) — BCR + 1)—%bn([(2 + 1)¢ — 1]/2)|| S 1/(2 + 1) 80 that 
| f(x) — f(x)|| S 2/0 + 1). 


Partially supported by NSF GP 7504. 


CONCERNING THE ITERATED 6 FUNCTION 
P, A. Catiin, Carnegie-Mellon University 


Define a set of functions ¢” mapping integers greater than 1 into the natural 
numbers, such that @°(a) =a and @t!(a) =¢(o*(a)), where @ is Euler’s function. 
Furthermore, define the class C(a) of a to be the integer x such that $”(a) =2, 
and let M be the set of natural numbers with the least value of any member in 
their respective classes. Harold Shapiro made the conjecture in the last para- 
graph of his article [2] that each element of M was prime, and W. H. Mills 
[1] exhibited several composite members of M. The purpose of this note is to 
extend their results concerning the factorization of elements of M. 

H. Shapiro gave a theorem (Theorem 15, [2]) which stated essentially that 
if S is the set of numbers in class 7 that are less than 2”+!, then the factors of 
an element of S are in S. This is also true for odd elements of M. 


THEOREM 1. If m 1s an odd element of M, the factors of m are in M. 


Proof. It is sufficient to show that an arbitrary factor is in M. If mE M is 
prime, then the theorem is obvious. Otherwise, if mC is odd, then so are its 
factors. Let m=ab. By Theorem 1 of [2], C(m) = C(a)+C(0). 

If 6M then there must be a lower number cC M in the same class as b. 
However, by Theorem 1 of [2], we see that C(m) =C(a)+C(b) =C(ac) where 
ac<m, contradicting the definition of M. Thus, 5€© MV, proving the theorem. 

As a consequence of Theorem 7 of [2], we can make a similar statement for 
even members of WM: in this case, the only prime factor of an element of M is 
2, which is in M. However, 2 is the only known even member of M. 
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The following theorem shows that the existence of finitely many primes in 
M is equivalent to an apparently weaker condition: 


THEOREM 2. There are finitely many primes in M if and vunly if there are 
finitely many odd numbers in M. 


Proof. If there are infinitely many primes in M, then obviously there are 
infinitely many odd numbers. Conversely, if an odd prime pC M then peC M 
for at most a finite number of a. This follows since p*€ M implies 


QeC(p) = QC(pe) pr < Dad (p)+1 


or 
Moc p< QC (p)+1/a 


which is false for a sufficiently large. Thus, if M contains only finitely many odd 
primes then it contains only finitely many odd prime powers; and by Theorem 1, 
only finitely many odd integers. 

As a corollary, Theorem 2 implies that S contains infinitely many primes if 
and only if it contains infinitely many odd numbers. This follows because for 
any mC M thereare finitely many sCS such that m<s<2°)+! so if M contains 
finitely many primes, so does S; the converse follows from MCS. 

The questions of whether or not M and S contain infinitely many odd inte- 
gers remain open. In all likelihood the smallest integer of each class is odd. 
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A NOTE ON COMMUTING AUTOMORPHISMS OF RINGS 


Jianc Luu, North Carolina State University at Raleigh 


Let R bearing. An automorphism T of Ris called a commuting automorphism 
of R if x?x =xx? for each x in R. If, in addition, 7 is not the identity automor- 
phism of R, then T is called a nontrivial commuting automorphism. In [1], 
Divinsky has proved that if a simple Artinian ring possesses a nontrivial com- 
muting automorphism, then it is a field. 

The purpose of this note is to generalize the Divinsky result for arbitrary 
prime rings. The proof here is simpler and shorter than that given by Divinsky 
for the simple Artinian case. 


“Turorem. Let R be a prime ring. If R possesses a nonirwial commuting auto- 
morphism, then Ris a commutative integral domain. 


As an immediate consequence of the theorem, we have 


CoROLLARY. Let R be a primitwe ring. If R possesses a nontrivial automor- 
phism, then R 1s a field. 
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To prove the theorem, we shall require the following lemma due to Divinsky 
[1]. For easy reference, we exhibit his proof here. 


LEMMA 1. If T 1s a commuting automorphism of a ring R, then for each x, yER, 
(«—xT) |x, y]=0, where [x, vy] =xy—yx. 

Proof. Polarizing |x, x7]=0 gives [x, y7]=|x7, vy] and hence |x, (xy)?] 
= [x?, xy]. But [x, (xy)?]=x7[x, v7] and [x?, xy] =x[x7, y|=x[x, v7]. Thus, 
(x —x7) |x, y7]=0. Replacing y? by y since T is an automorphism, we establish 
the lemma. 


CorROLLARY 1. If T 15 a commuting automorphism of a ring R, then for every 
x, yER, (n—x?)R[x, y]=0. 


Proof. We note z|x, vy] = [x, zv]—[x, z]y for z€ R and use Lemma 1. 


COROLLARY 2. Let R be a prime ring and T a commuting automorphism of R. 
If «ER, x¥x?, then xEC, the center of R. 


Proof. By Corollary 1, («—x?)R|x, y]=0 for all yER. Since x —x7 0, the 
primeness of R implies [x, y] =0 for all yER. 


Proof of the theorem. Since T is nontrivial, there exists xCR such that « 4x7 
and by Corollary 2, «CC. Suppose there exists yER, yEC; then x +yvEC, and 
by Corollary 2, y?=y and (x+y)? =x+4; so x? =x, a contradiction. This com- 
pletes the proof. 


This work was done while the author was at Wright-Patterson Air Force Base under the 
contract NO. F 33615-67-C-1758 of Ohio State University. 
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EIGENVALUES OF A MATRIX OF RANK bk 
Louis BranpD, University of Houston 

In an Xn matrix A of rank k al) determinants of order >& are zero; hence 

its characteristic equation is 
A” — a, A"! + agdA™? — - + + + (—1)*a, A™* = O, 
where a, is the sum of all r-rowed principal minors of A. Thus A has the eigen- 
value 0 at least of multiplicity n —k; and its & remaining eigenvalues are roots of 
ht — a, AF} +. ds \F-? — eee (—1)* a, = 0, 

some of which may also be 0. 


Since a; is the trace of A, the result in [1] follows from this result for k =1. 
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RESEARCH PROBLEMS 


EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (af any are known to the author) and by a brief description of known 
partial results. Material should be sent to Victor Klee, Department of Mathematics, University 
of Washington, Seattle, WA 98105. 


WHAT IS THE MAXIMUM LENGTH OF A d-DIMENSIONAL SNAKE? 
Victor KLEE, University of Washington 


Let J(d) denote the graph formed by the vertices and edges of a d-dimen- 
sional cube. The vertices are the 24 d-tuples of 0’s and 1’s, and two vertices are 
joined by an edge if and only if they differ in exactly one coordinate. As the 
term is used here, a d-dimensional snake is a simple circuit C formed from ver- 
tices and edges of I(d) such that any edge of I(d) joining two vertices of C is in 
fact an edge of C. The figures below show two circuits in (3), the first being of 
length 8 and the second of length 6. The second is a snake but the first is not, 
as the defining condition for snakes is violated by each of the four edges omitted 
from the first circuit. In fact, no 3-dimensional snake has more than 6 vertices. 


eat a, 


nonsnake snake 


The title problem asks for an evaluation (or, lacking that, for a good estimate 
of asyinptotic behavior as d—~) of S(d), the maximum length of a d-dimen- 
sional snake. S(d) is known exactly for only a few values of d; specifically, 
S(2) =4, S(3)=6, S(4) =8, S(5) =14, and S(6)=26. Beyond that, it is known 
that 


_ a 
7 Sd) 2 1 1 — 12/2 


<< — ~~ sO for alld 2 6. 
4(d—1)~ 24 ~ 2 %d(d—1)2?+2 


(*) S 
The lower bound is due to Danzer and Klee [4], the upper bound to Douglas 
[7]. The value of S(5) was found by Even [8] and follows from a more general 
result of Singleton [15] and Douglas [6] stated below. The value of S(6) was 
found by Davies [5] as the result of a computer search. 

Snakes were first studied by Kautz [10], who called them uwnit-distance errer- 
checking codes because they can be used to add an error-checking feature to 
certain analog-to-digital conversion systems. Zuravlev |17]| noted a relationship 
between the values of S(-) and the efficiency of certain algorithms for simplify- 
ing disjunctive normal forms in Boolean algebra. Black [2] suggested a use for 
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snakes in the design of electronic combination locks. Estimates for S(d) that 
were superseded by (*) were given by Abbott [1], Danzer and Klee [4], Glago- 
lev [9], Kautz [10], Larman [13], Ramanujacharyulu and Menon [14], 
Singleton [15], and Vasil’ev [16]. An expository account was given by Klee [12]. 

In addition to the work on snakes, some attention has been devoted to more 
restricted classes of circuits that have better error-checking properties. A 
d-dimensional circuit code of spread s (also called SIB, code or circuit code of 
minimum distance s) is a simple circuit C in [(d) such that any two vertices of C 
differing in exactly r coordinates, with r<s, can be joined by a path formed from 
r edges of C. Every circuit is of spread 1, and the circuits of spread 2 are pre- 
cisely the snakes. The snake in the figure above is also of spread 3, but the figure 
below depicts a 4-dimensional snake of length 8 that is not of spread 3, as is seen 
by considering the vertices x» and yo. In fact, no 4-dimensional circuit of spread 
3 has more than 6 vertices. 


[Z(4) is represented by two copies of J(3), and the eight edges of I(4) not 
shown join a vertex in one copy to the corresponding vertex in the other copy. 
The snake in question is formed from the edges xox; and yoy in addition to the 
six heavy edges. | 

Let C(d, s) denote the maximum length of circuits in I(d) of spread s. Then 
C(d, 1) =24, C(d, 2) =S(d), and for s>2 the existing knowledge about C(d, s) is 
even more meager than for s=2. The following values are among those estab- 
lished by Singleton [15] and Douglas [6]: C(d,s)=2d for d< [3s/2|-+2; 
C((3s/2]+2,s)=4s+6 for even s; C(|3s/2]4+2,s)=4s+4 for odd s; 
C({3s/2|+3, s)=4s+8 for odd s=9. For general d and s, lower bounds on 
C(d, s) have been given by Singleton [15] and Klee [11], upper bounds by 
Chien, Freiman, and Tang [3] and Douglas [7]. 


Preparation of this paper was supported in part by the Office of Naval Research and the Boe- 
ing Scientific Research Laboratories. 
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ON THE WRONSKIAN TEST FOR INDEPENDENCE 
R, A. Horn, The Johns Hopkins University 
Let n21 be an integer and let { x(x) \* 1 be # functions each of which is 


n—1 times differentiable on some interval J=(a, b)C@. In courses on elemen- 
tary differential equations the Wronskian matrix 


u(x) +++ Upn(x) 
(1) (1) 
Uu x oo. Un xX 
K(t1,° °°, Up x) = 1 @) «*) ; xeEl 
ty (8) + + + ay" (2) 
and the Wronskian W(m, ++ - , Uny %) =det K(um, - ++, Ua; x) are introduced in 


order to test whether a set of solutions of a normalized homogeneous linear 
differential equation of order 7 is a linearly independent set on J. Using the dif- 
ferential equation, one shows that the Wronskian of a set of solutions either 
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vanishes identically or never vanishes on J. Thus, a set of functions { u:(x) }%_, 
is linearly dependent on J if and only if W(m, +--+, ua; x) =O for all xET, pro- 
vided thai there 1s some one nth order normalized homogeneous linear differential 
equation on I of which each ux(x) is a solution, i1SkSn. 

Quite naturally, the alert student will want to know if the latter condition 
can be dropped so that the identical vanishing of the Wronskian can be used as 
a test for linear dependence of a general set of (sufficiently smooth) functions. 
Some textbooks anticipate this question by providing an example to show that 
it cannot be dropped; a typical example is u(x) =x, u(x) =x| *| on T=(—1, 1). 
The student, however, frequently feels that the example is “unnatural” and that 
for the usual smooth functions one meets in calculus books the identical vanish- 
ing of the Wronskian on J should be a sure test for dependence on J. This feeling 
is justified by the following easy result. 


THEOREM. Let T=(a, 6)CR and let | ux(x) be be a set of n analytic func- 
tions on I, n 2 1. These functions are linearly dependent on I tf and only if 
W(t, +++, Un; x) =0 for all xEl. 


Proof. The “only if” statement is well known and does not require analytic- 
ity. Analyticity is essential for the converse, however, as the above example 
shows. Since the theorem is trivial for 7=1, we may assume n= 2 and by selec- 
tion of a suitable subset of functions and reordering we may also assume that 


W(u1, ++ *, Un1j %) is not identically zero on J. Thus, there is some nonempty 
interval J’=(c, d)CTI such that W(u, - - > , Up_1; x) 0 for all xCl’, and hence 
W (ua, mt y Un—1, Us; x) 


= u(y) + Py(x)ue-?) (4) +--+ + Py_i(x)u(x) = 0 
W(t1,° ++, Un—1; %) 


is a normalized homogeneous linear differential equation of order n—1 with 
continuous coefficients on J’. Since there can be no more than x—1 linearly 


independent solutions on J’ and since each of the m functions 1;(”), + - +, Un(x) 
is a solution, we conclude that these functions must be dependent on J’. Thus, 
there exist constants q, +--+, ¢, such that f(x)=cuyi(~) + + +» +énun(x) =0 for 


all xE I’. But f(x) is analytic on J and vanishes identically on a nonempty open 
set J’CI, so f(x) =0 for all x J. Thus, the set { x(x) \? -1 18 dependent on J. 

If W(t, +--+ ° , Un3 x)=#=00n J and if Wim, - + + , una; x) 0 for all xCTI, we 
see from the proof that analyticity is not required in order to conclude that 
the functions are dependent on J. This generalizes the familiar fact that if 
W(u1, U2; x) =0 on J and if u(x) 0 for all xC J, then mu, and wu, are dependent. 

For, the validity of the theorem it would, of course, be sufficient to assume 
only that W(x) =0 for all x in some nonempty open subset of J, or that W(x) =0 
for all x in some infinite subset of J with an accumulation point interior to J, or 
to make any other assumption about W(x) which forces it to vanish everywhere 
on J. The same proot also shows that the Wronskian of a finite set of functions 
analytic in some region of the complex plane will vanish identically if and only 
if the functions are dependent in the region. 
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A STRONGER VERSION OF A METRIZATION THEOREM OF K. MORITA 
J. H. Weston, J. R. SHILLETO, AND S. A. RANKIN, University of Saskatchewan, Regina 
In 1955, the following metrization theorem was proven by K. Morita [2]. 


THEOREM (Morita). A 7, space 1s metrizable tf and only tf there exists a 
sequence {Fa n a positive integer} of closed, locally finite coverings of X such that 
for any open set U and «CU there ts a positive integer n with { FES, xe F} 
CU. 


The proof which appears in [1] suggests that the locally finiteness condition 
may be weakened. In fact, we make the following observation. 


DEFINITION. A collection @ of subsets of a topological space is called clo- 
sure preserving 1f and only tf for each subcollection @ of @, (U {BI BEB})- 
=U{B-|BEB}. 


DEFINITION. For two collections of sets @ and B, let 
@AB={ANB|AE@, BEB}. 


We obtain Qi/\@GQ2/\ +--+ A QGna Oy extending the definition inductively to collections 
@i, Qo, - ++, Qn, for an arbitrary integer n>1. 


THEOREM. If @ and & are closed, closure preserving collections of subsets of a 
topological space, then @/\B ts closed and closure preserving. 


Proof. Let @={A,\iEI}, B={B,|FET} and let RCIXJ. To show F 
=U{AsB;| (4, 7)€ Rj} is closed, it suffices to verify that each x¢ F has a 
neighborhood not meeting F. Since x«€(A;(\B;) for each (4, 7)/ER we have 
either x EA; or xEB;. Let pi: IXJ—I and p.: IX J— J be the projections, let 
I’CpiR be those indices for which x. A; and let J’C p2R be those indices for 
which x€B;. Then the set W=[Y—-U{A,JiEl}|IA[Y—-U{B,|jE7’}] is 
open, since the families @ and @ are closed and closure preserving, and W clearly 
contains x. Moreover, W/\F=@: for, given (A:\B;) with (4, 7)ER, either 
iE!’ or fEJ’; if cE2’ then A,VB;CA; whereas WC Y—A,, and similarly, if 
jEJ' then Ai(\B;CB; whereas WO Y—B,. 

We remark that every locally finite collection of subsets of a topological 
space is closure preserving but the converse is not true. 


THEOREM. A 7 space X 1s metrizable tf and only if there exists a sequence 
{Fal n a positive integer} of closed, closure preserving coverings of X such that for 
any open set Uandx€ U there exists a positive integer n with U { FE Fn xCF } CU. 


Proof. Simply define $,=5iA52/A +--+ A¥n, apply the above theorem, and 
without further modification carry out the proof as given in [1, p. 196]. 
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A PHYSICAL APPLICATION OF A REARRANGEMENT INEQUALITY 
M. S. KLaMKIN, Ford Motor Company, Dearborn, Mich. 


A usual problem in calculus and differential equations is to find the time of 
discharge of a tank with liquid through an orifice. Here the relevant differential 
equation is 


dV /dt = A(h)(dh/dt) = — cA,V/2gh, 


where 
h=height of liquid above orifice, 
A (h) =cross-sectional area of tank at height h, 
A,=area of orifice, 
g=gravitational constant, 
c=coefficient of discharge; approximately 
=.61 or t/(r7+2) for turbulent flow. 

For a nonroutine problem in tank discharge, consider the following exercise 
[1]: 

Shown below are schematic figures for two identical tanks (except for orien- 
tation) which are frustums of a right circular cone. If both tanks are initially full 
of water, which tank will empty out first? (Assume that the orifice areas are 
congruent and that the tanks start discharging at the same time.) 


The usual method of solving this problem is to integrate the two relevant 
differential equations to find the times of discharge and then to compare times. 
A much more elegant solution can be gotten physically by noting that discharge 
rate for tank B is greater than the corresponding discharge rate for tank A when 
compared at equal cross-sections of B which are above its middle cross-section. 
This more than makes up for the low cross-sections when the discharge rate for 
tank B is less than that for tank A. 

The above result will also hold for an arbitrary tank (and its inversion) whose 
cross-sectional area A (h) is a monotonic decreasing function of the height above 
the orifice. Mathematically, this is equivalent to showing that 


H A(h)dh H A(H — h)dh 
J Vh > J, Vh ) 


(Note that here A (Hf —A) is the area function for the inverted tank.) 
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The latter inequality will follow from the more general one: 
If D(x) and E(x) are arbitrary monotonic decreasing functions, then 


2a 2a 
[= f D(x) E(«)dx > J = J D(2a — x) E(x)dx. 
0 0 
Proof. I and J can be rewritten as 


[= [ (p@z@ + D(2Qa — x)E(2a — x) } da, 
0 


J= [ {200 — x) E(x) + D(x)E(2a — x)}dx. 
Then, 


I-J= f (p@ —~ D(2a — x)}{ E(x) — E(2a — x)}dx > 0 


since the integrand is nonnegative. 

A discrete analog of the previous problem would be to arrange ” persons at 
specified positions on one side of a see-saw to obtain the maximum turning 
moment. Physically, it is intuitive that the heaviest person should be at the 
end position, the next heaviest person at the next to end position, etc. A math- 
ematical proof is given by the following rearrangement theorem for two se- 
quences [2] and is analogous to the previous integral proof: 


If ja} and ib} are two given sequences except for arrangement, then > _ab is 
greatest when {a} and \b} are monotonic in the same sense and least when they are 
monotonic in opposite senses. 


Proof. Suppose that the a’s are in ascending order but not the b’s. Then there 
areajandak such that a;Sa,; and 0;>),. Since 


ajby + ab; — (a3b; + axbs) = (ax — ay) (b; — bx) 2 O, 


we do not diminish > ab by exchanging 5; and b;. A finite number of such 
exchanges leads to an ascending order of the b’s. The other half of the theorem 
is proved in the same way. 

More general rearrangement theorems are given in [2, Chap. X]. 


é 
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GEOMETRY PREPARATION FOR HIGH SCHOOL MATHEMATICS TEACHERS 


Rut E. M. Wone, University of Hawaii 


Introduction. The high school geometry course has been the subject of much 
debate in recent years and numerous recommendations have been made by 
individuals and professional groups. Accompanying these diverse proposals for 
modification in the high school geometry program have been concern for and 
indecision regarding the nature of adequate preparation in geometry for math- 
ematics teachers. 

As one aspect of a study on geometry for teachers [1], a survey of selected 
teacher education institutions was conducted during the academic year 1967-68, 
to ascertain the nature of geometry preparation provided and to obtain recom- 
mendations for the future. 

Stratified sampling was used with the following three categories: state uni- 
versities, teachers colleges, and other teacher education institutions. For pur- 
poses of this investigation, “state university” was defined as a state institution 
which offers graduate degrees at the doctoral level in liberal arts, general, and 
professional fields; “teachers college” meant not only an institution with “teach- 
ers” in its name, but any state college or institution which prior to 1963 was 
primarily concerned with teacher preparation as indicated by its accreditation 
in teacher education only; “other” included liberal arts colleges and private 
universities with general and professional programs. Only institutions accredited 
by nationwide or regional accrediting associations were considered for possible 
inclusion. Wherever possible at least one institution in each category was 
selected from each state. 

Two forms of a questionnaire were developed, one each for the mathematics 
and education departments in each institution. Respondents were generally 
chairmen of their respective departments or individuals responsible for teacher 
training. Replies were received from at least one department in each of 155 
institutions, making a 92.2 percent return from the institutions contacted, and 
from both departments in 130 institutions. Of the 130 institutions, 48 were state 
universities, 43 were teachers colleges, and 39 were other teacher education 
institutions. Since one or two questions were not answered on most forms, the 
total number of responses varied from item to item. Percents were calculated 
on the basis of total responses for a given item. 


Geometry Courses for Teachers. Prospective secondary school mathematics 
teachers were required to take (or majority took) at least one course in geometry 
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in 93 percent of the 123 institutions reporting. In most cases, institutions offered 
more geometry courses than were required of prospective teachers. As shown in 
Table 1, the number of required courses varied from zero to three while the 
number of available courses ranged from zero to five. 


TABLE 1. Number of Institutions Offering Designated Number of Geometry Courses 


Number of Courses 


a. Required of Teachers 8 7 62 50 44 36 9 #7 0 0 
b. Available to Teachers 6 5 32 27 48 37 25 19 12 9 4 3 


In a breakdown by type of institution, it appeared that state universities 
offered a wider range of geometry courses than either teachers colleges or liberal 
arts and private colleges, state universities being the only ones offering as many 
as five courses. This was not surprising since state universities, by the definition 
used in this study, offered the widest range of fields of study and generally had 
larger enrollments and faculties than the others. The few institutions which had 
no geometry courses available were among the ‘other’ teacher education institu- 
tions. Again, it would seem that the nature of their program, as in the case of 
small liberal arts colleges, influenced their restricted offering. 

Table 2 shows the number of geometry courses available to prospective 
mathematics teachers by type of institution. The number of courses most com- 
monly offered by both state universities and teachers colleges was two, available 
in approximately 43 percent of them. The same percentage of other institutions 
offered one course. 


TABLE 2. Number of Geometry Courses Available to Prospective Mathematics Teachers 


Institutions 
Number ——— Total 
of State Teachers Other 
Courses ————————_- )Ss§ f % 
f %G f % f % 
0 0 0 6 4.6 6 4.6 
1 5 3.8 11 8.5 19 14.6 35 26.9 
2 19 14.6 19 14.6 10 7.8 48 36.9 
‘3 11 8.5 9 6.9 5 3.8 25 19.2 
4 5 3.8 4. 3.1 3 2.3 12 9.2 
5 4 3.1 0 0 4 3.1 
Total 44 33.8 43 33.1 43 33.1 130 99.9 


A classification of existing courses, based on specific textbooks reported and 
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course descriptions and outlines provided, was made as follows: “contemporary 
geometry” was defined as a course primarily concerned with axiomatic structure 
and a re-examination of elementary Euclidean geometry, “survey of geometry” 
as a course which treats several geometries, and “college geometry” as the tra- 
ditional advanced Euclidean geometry. This admittedly rough classification 
revealed that of 121 institutions reporting, 66 percent offered contemporary 
geometry, 40 percent projective geometry, 34 percent survey of geometry, 12 
percent college geometry, 11 percent non-Euclidean geometry, and 10 percent 
differential geometry. 

Institutions were generally in favor of emphasizing transformations in college 
level geometry, 60 percent indicating this. The majority were also in favor of 
approaching transformations from algebraic and synthetic points of view. 

The chi square statistic was used to test the independence of these responses 
from department and type of institution. 

There was no significant difference between departments or among institu- 
tions in response to the question regarding emphasis on transformations in 
geometry for teachers. However, it may be noted that there were a higher per- 
centage of ‘yes’ and lower percentages of ‘no’ and ‘undecided’ responses from 
mathematics departments than from education departments. More specifically, 
Table 3 shows that 68/100 or 68 percent of the mathematics departments advo- 
cated emphasis on transformations, while 49/95 or approximately 50 percent 
of the education departments favored it. Six out of 100 or 6 percent of the math- 
ematics departments and 11/95 or approximately 12 percent of the education 
departments were not in favor of transformations, and 26/100 or 26 percent of 
the mathematics departments and 35/95 or approximately 36 percent of the 
education departments were undecided. 

Among the institutions, state universities had the highest percentage of 
‘yes’ and other institutions the highest percentage of ‘undecided’ responses. 


TABLE 3. Emphasis on Transformations in College Geometry by Departments 


Departments 


Total 
Response Mathematics Education 
ae F 

% % 70 
Yes 68 34.9 49 25.1 117 60.0 
No 6 3.1 11 5.6 17 8.7 
Undecided 26 13.3 35 17.9 61 31.3 
Total 100 51.3 95 48.7 195 100.0 


There were significant differences between responses from the two depart- 
ments regarding algebraic and synthetic approaches to transformations at 
P <.05;1.e., chi square values as large as 8.847 for Table 4 and 6.793 for Table 5 
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could occur by random sampling alone less than five out of a hundred times. 
While the majority of each of the departments were in favor of an algebraic 
approach to transformations, mathematics departments gave a much higher 
‘yes’ response and education departments a relatively high ‘undecided’ response. 
As shown in Table 4, 65 out of 91 or approximately 71 percent of the mathemat- 
ics departments advocated an algebraic approach while 49/89 or approximately 
55 percent of the education departments were in favor of it. ‘Undecided’ re- 
sponses were given by 15/91 or approximately 16 percent of the mathematics 
departments in contrast to 32/98 or approximately 36 percent of the education 
departments. 


TABLE 4. Transformations Developed from Algebraic Point of View by Departments 


D tment 
epartments Total 
Response Mathematics Education 
f 
% % 7 

Yes 65 36.1 49 27.2 114 63.3 
No 11 6.1 8 4.4 19 10.6 
Undecided 15 8.3 32 17.8 47 26.1 

Total 91 50.6 89 49.4 180 100.0 


The significant differences between departments on the synthetic approach 
to transformations were again due to a high percentage (50/83 or 60%) of ‘yes’ 
responses from mathematics departments and a higher percentage (33/81 or 
41%) of ‘undecided’ responses from education departments as shown in Table 5. 
These differences may have been due to the division existing in many institu- 
tions between subject matter and professional training, with decisions as to 
subject matter left wholly to mathematics departments. 


TABLE 5. Transformations in College Geometry Approached Synthetically by Departments 


Departments Total 
Response Mathematics Education 
f 

% % 70 
Yes 50 30.5 34 20.7 84 51.2 
No 14 8.5 14 8.5 28 17.1 
Undecided 19 11.6 33 29.1 52 31.7 
Total 83 50.6 81 49.4 164 100.0 


There was no real difference among institutions on questions of algebraic 
and synthetic approaches to transformations. However, the highest percentage 
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of ‘yes’ responses was given by state universities and the highest percentage of 
‘undecided’ by ‘other’ institutions. 

Of 210 respondents, 62 percent were satisfied with the geometry courses 
being offered to prospective teachers in their institutions. Despite this high 
degree of satisfaction expressed, when asked for recommendations for change, 
only 20 percent recommended no change. Thirty-three percent felt that there 
was a need to change emphases in existing courses, 31 percent recommended 
adding courses to those presently available or requiring additional courses, 11 
percent suggested both a change of emphasis and additional coursework, and 
5 percent suggested other changes. 

There were 134 institutions which responded to an invitation to comment 
on the teacher education program in geometry. Remarks made by eight or more 
individuals were as follows: 


a. Satished with present program 22 
b. Use variety of approaches to geometry; include study of various geometries 14 
c. Teacher emphasis rather than content; teacher needs to acquire “spirit” of geometry, ap- 
preciation of mathematics 13 
d. Relate geometry to rest of mathematics, to other subject areas 10 
e. Stress axiomatic structure, logic, proof 10 
f. Change emphasis from that of developing research mathematicians; relevance to high school 8 
g. Increase requirements in geometry 8 


Of those expressing satisfaction with their present program, 72.7 percent 
were from institutions offering a contemporary geometry course as described 
earlier. Mathematics departments advocated more strongly relating geometry 
to the rest of mathematics and to other subject matter areas, stressing axiomatic 
structure and proof, and increasing requirements. The greatest discrepancies 
among institutions occurred in the following ways: frequent mention by ‘other’ 
institutions of stress on axiomatic structure in contrast to one such response 
from state universities; a high frequency of suggestions for teacher emphasis 
rather than content only on the part of teachers colleges compared to a single 
similar suggestion from ‘other’ institutions; most recommendations for change 
of emphasis from that of developing research mathematicians from state univer- 
sities in contrast to none from teachers colleges. Some of these differences may 
be a natural consequence of the program emphasized in each type of institution. 
For example, the training of research mathematicians may have dominated the 
graduate program in mathematics in state universities and may have caused 
concern on the part of those involved in teacher education that emphasis in 
undergraduate courses may not be appropriate for prospective teachers. On the 
other hand, teachers colleges whose primary focus has always been teacher 
preparation might have found this of less concern. Also, teachers colleges would 
probably have given greater consideration to aspects of the teacher preparation 
program beyond content of mathematics courses than ‘other’ institutions where 
the major emphasis is liberal arts. 


Views on High School Geometry. Since teacher preparation in geometry 
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cannot be isolated from the high school geometry program, recommendations 
on secondary school geometry were also solicited from the same institutions. 

There was no marked expression of satisfaction or dissatisfaction toward 
the high school geometry program as exemplified by SMSG geometry. Of the 
201 responding to the question of whether they were satisfied, 35 percent said 
‘yes,’ 30 percent ‘no,’ and 34 percent ‘undecided.’ There were no significant 
differences in response between departments or among institutions. This was 
the first item, however, in which there was a higher percentage of ‘undecided’ 
responses from mathematics departments than from education departments. 
This was probably due to the fact that the secondary schools and their programs 
are of major concern to those in education departments engaged in the training 
of high school teachers and hence there was more familiarity on their part with 
SMSG geometry and the benefits and problems of teaching it to high school 
students. 

In analyzing responses by type of institution, it is interesting to note that 
while differences were very slight, the highest frequency among state univer- 
sities was in a ‘no’ response, the highest among teachers colleges ‘undecided,’ 
and the highest among other institutions ‘yes.’ 

Responses regarding various changes recommended in high school geometry 
are summarized in Table 6. 


TABLE 6. Changes Recommended in High School Geometry 


Response 
Change Advocated Yes No Undecided Total 
f % f % f % 
Less Emphasis on Formal Proof 58 31 99 53 30 16 187 
More Coordinate Geometry 139 72 25 13 28 15 192 
Vector Approach 73 40 Al 22 71 38 185 
Transformations Approach 82 44 37 20 68 36 187 
More Rigor 41 22 97 53 45 25 183 


There were significant differences between departments on the first two 
items in Table 6. The great majority (73%) of mathematics departments felt 
that there should not be less emphasis on formal proof whereas more education 
departments favored less emphasis than not as shown in Table 7. The differences 
were significant at the .001 level; hence they could have arisen by chance less 
than once in a thousand times. 

Also, while coordinate geometry was favored by the group as a whole, a 
higher percentage of education departments advocated it and a higher percent- 
age of mathematics departments were against it. The latter result was especially 
surprising in the light of the strong algebraic point of view expressed by math- 
ematics departments earlier in relation to transformations in geometry courses 
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TABLE 7. Less Emphasis on Formal Proof Advocated for High School Geometry 


Departments Total 
Response Mathematics Education 
f 
f % % 70 

Yes 14 7.5 44 23.5 58 31.0 
No 69 36.9 30 16.0 99 52.9 
Undecided 11 5.9 19 10.2 30 16.0 

Total 04 50.3 93 49.7 187 100.0 


for teachers. The differences between departments shown in Table 8 were signifi- 
cant at the .01 level. 


TABLE 8. More Coordinate Geometry Advocated for High School Geometry 


Departments Total 
Response Mathematics Education 
f 

% % 0 
Yes 64 33.3 75 39.1 139 72.4 
No 20 10.4 5 2.6 25 13.0 
Undecided 12 6.3 16 8.3 28 14.6 
Total 96 50.0 96 50.0 192 100.0 


Reactions to questions relating to vector and transformations approaches 
to high school geometry were characterized by a relatively high percentage of 
‘undecided’ responses. One possible explanation for the high ‘undecided’ re- 
sponse is that unlike coordinate geometry which was recommended as early as 
1959 and has been incorporated in instructional materials for high school which 
have been available for several years, vector geometry and transformation 
geometry for high school can still be considered in the experimental stages with 
few materials readily available. 

Institutions differed significantly (P <.05) in their response to more rigor. 
As shown in Table 9, the majority of state universities did not advocate more 
rigor, but less than half of the institutions in each of the other categories ex- 
pressed similar views. More teachers colleges were in favor of more rigor than 
any of the other institutions and the high ‘undecided’ response from ‘other’ 
institutions was evident again. 

In considering the question of more rigor in relation to the earlier one on less 
emphasis on formal proof, one might be tempted to think of them as opposites 
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and assume that the majority view against less emphasis on formal proof would 
imply support of more rigor in high school geometry. This was not the case. The 
majority did not advocate more rigor. It appeared that either respondents were 
making a distinction between emphasis on formal proof and rigor, or were saying 
that the present high school geometry program was satisfactory in terms of 
proof and rigor (neither more nor less) and that no change was recommended. 


TABLE 9. More Rigor Advocated for High School Geometry by Type of Institution 


Institutions 


Total 
Response State Teachers Other 
a ‘ 

% % % 70 
Yes 11 6.0 16 8.7 14 7.7 41 22.4 
No 49 26.8 25 13.7 23 12.6 97 53.0 
Undecided 14 7.7 11 6.0 20 10.9 45 24.6 
Total 74 40.4 52 28.4 57 31.1 183 100.0 


Summary. The high rate of return in the survey of colleges and universities 
was interpreted as an indication of interest in and concern for the problem of 
teacher preparation in geometry. At the same time the relatively high percent- 
age of ‘undecided’ and omitted responses on many questions confirmed the 
general indecision indicated in the literature. 

The survey revealed that present training of mathematics teachers at most 
of the selected institutions includes at least one required course in geometry, 
with Euclidean geometry from the contemporary viewpoint being the most 
frequent offering. While there seemed to be general satisfaction with teacher 
preparation in geometry, the majority hoped to see some change in the existing 
programs. 

The inclusion of transformations in geometry for teachers was supported 
on the basis of strong recommendations from mathematicians and educators. 

The high school geometry course desired by most university mathematicians 
and educators seemed to be one similar to SMSG geometry in degree of emphasis 
on formal proof and rigor but with more coordinate geometry incorporated. 

Differences of opinion between mathematicians and educators may have 
been due to a number of factors, including lack of knowledge or appreciation 
of the situation. In any case, it must be recognized that these individuals occupy 
positions in their respective institutions where they can influence decisions on 
the teacher preparation program. Significant differences of opinion among such 
persons seem to suggest the need for careful and thoughtful consideration of 
the bases for all recommendations. They point to the advisability of continued 
collaboration among university mathematicians, educators, and competent 
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experienced teachers, not only in preparing instructional materials for high 
school geometry and determining the content of mathematics courses for 
teachers, but in examining the total preparation program of mathematics 
teachers. Each one—the mathematician, the mathematics educator, the high 
school teacher—has deep interest in, intimate knowledge of, and rich experience 
in his own specialized area, possessed by no one outside his field. Each should 
thus have a unique contribution to make in this determination. 
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ELEMENTARY PROBLEMS 


Solutions of Elementary Problems should be sent to Problems Group, Mathematics Depart- 
ment, University of Maine, Orono, ME 04473. To facilitate their consideration, solutions of 
Elementary Problems in this 1ssue should be typed (with double spacing) on separate, signed 
sheets and should be matled before April 30, 1970. Contributors (an the United States) who 
desire acknowledgement of receipt of their solutions are asked to enclose self-addressed stamped 
postcards. 


E 2208. Proposed by D. Rameswar Rao, Secunderabad, India 


Let 2=5 and 2Sbn. Prove 


Ps] = 0 (mod b — 1). 
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E 2209. Proposed by M. S. Klamkin, Ford Scientific Laboratory 


Determine the locus of the centroids of all triangles similar to a given triangle 
and inscribed in another given triangle. 


E 2210. Proposed by R. N. Lloyd, University of Nottingham, England 


While the Dormouse lay sleeping in the middle of the table, the Mad 
Hatter and the March Hare sat down for tea. A number of places had been laid, 
and the Hatter and the Hare each moved alternately to a seat next to his old 
one. Each changed his direction of movement when, and only when, either the 
other had just changed his or else so as to avoid their both sitting on the same 
chair. Tea finished when both the Hatter and the Hare were again sitting in 
their initial positions. After how many moves was the tea-party over? 


E 2211. Proposed by M. D. Landau, Lafayetie College 


An exercise in Rudin, Principles of Mathematical Analysis, states 
f2** sin 2dt<2/x for «>0. Can this bound be improved to some smaller mul- 


tiple of 1/x? 


E 2212. Proposed by G. E. Peterson, University of Missourt, at St. Louts 


If {s,} and {t,} are sequences such that )ox., |&| converges and lima+. 
(> 2.1 Satz) /Sn exists and is nonzero, then lim,.. s, exists and is nonzero. 


E 2213. Proposed by H. Demir, Middle East Technical University, Ankara, 
Turkey 


Let us say that a (planar) polygon has the Nagel property if the lines through 
the vertices of the polygon and bisecting the perimeter of the polygon are con- 
current. It is known that all triangles have the Nagel Property and that not all 
quadrilaterals have the property. Determine the simple nondegenerate quadri- 
laterals that have the Nagel property. 


E 2214. Proposed by M.S. Klamkin, Ford Scientific Laboratory and B. Ross 
Taylor, York High School 


It is intuitive that every simple n-gon (%>3) possesses at least one interior 
diagonal. For a simple n-gon what is the least number of diagonals which, except 
for their endpoints, lie wholly in its interior? 

SOLUTIONS OF ELEMENTARY PROBLEMS 
A Necessary and Sufficient Condition 


E 2143 [1969, 82, 413|. Proposed by Peter Kornya, University of British 
Columbia 


In a triangle with sides a, b, c the line joining the centroid and the incenter 
is perpendicular to the bisector of the angle opposite side c. Show that the 
arithmetic mean of a, 0, c equals the harmonic mean of a and DB. 
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I. Solution by F. Leuenberger, Feldmetlen, Switzerland. Let G denote the 
centroid, J the incenter, A the area, h, and fj the altitudes to sides a and b, andr 
the inradius of triangle ABC. Let P and Q denote the points of intersection 
of line GI with sides BC and CA. Since triangles GPC and GQC have the same 
total area as triangles TPC and [QC, we have $h,+4=2r. The desired equa- 
tion now follows immediately from 


2A 2A 2A 
on™) hy =——) and eee s 
a b atobo+e 


II. Solution by L. D. Goldstone, Watervliet, N. Y. In trilinear normals, the 
line through P(f, g, #) parallel to la+mB-+ny=0 is 


lf + mg + nh 7 af + bg + ch 


la + mB + ny aa+be+ey 
(cf. Smith, Conic Sections, p. 346). GI is parallel to the external bisector of 
angle C (for which a+8=0), hence take /=m=1. n=0. Let P=/(1, 1, 1), 
f=g=h=1. Hence 
1+1 _ a +b+e 
atB aa+t be+cy 
The line passes through G(1/a, 1/b, 1/c)= (a, B, y), which implies 


i+ atbébt+e 


(1/a) + (1/s) 1+141— 


Also solved by Anders Bager (Denmark), Leon Bankoff, M. G. Greening (Australia), C. V. 
Subba Rama Iyer (India), Vinay G. Kane (India), Norman Miller, Simeon Reich (Israel), Sister 
Stephanie Sloyan, P. Naga Sundaram (India), C. S. Venkataraman (India), A. W. Walker, Charles 
Wexler, Mark Yu, and the proposer. 


Polynomial Approximations on All of the Real Line 
E 2165 [1969, 413]. Proposed by J. L. Kazdan, University of Pennsylvania 


If fEC(R) can be uniformly approximated throughout R by polynomials, 
then f is itself a polynomial. Supply a proof, or a counterexample. 


Solution by D. M. Bloom, Brooklyn College. The statement is true. If poly- 
nomials P, approach f uniformly, then for some n we have | P s(x) — f(x) | <1 
for allz2n, allx in R. Hence for all 27, | P s(x) —P,(x)| <2 (all real x), so that 
P;—P, 1s a bounded polynomial, and hence is constant. Taking limits as 
10, f—P,, is constant, and hence f=P,+C is a polynomial. 


Also solved by S. M. Ambler, Joel Anderson, Anders Bager (Denmark), D. R. Brillinger, 
Charles Dunham, R. L. Enison, G. F. Feissner, D. E. Frohardt, D. S. Greenstein, D. W. Hadwin, 
D, A. Herrero, Ellen Hertzmark, G. A. Heuer, Kenneth Lang & Steven Minsker, Douglas Lind, 
Dan Marcus, M. D. Mavinkurve (India), Henrik Meyer (Denmark), Bryan Powers, Steve Rohde, 
R. A. Struble, Philip Trauber, J. E. Wilkins, Jr., Mark Yu, and the proposer. 
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Tossing a Polygon onto a Checkerboard 


E 2166 [1969, 413]. Proposed by Michael Stolnicki, Oakland Community 
College, Michigan 

Let a checkerboard consist of squares with sides of length 4. A regular 
4n-gon with radius 1 is tossed on the board. Determine the probability that the 
polygon will cross a line of the checkerboard. 


Solution by Clyde Kessel, student, John Hersey High School, Prospect Heights, 
Ill. Without loss of generality we can assume the center of the polygon falls on 
or within one given square, at a distance d from one of the closest sides. Erect 
the perpendicular through the center of the polygon to the given side of the 
square. This perpendicular will form angles with the radii of the polygon. One 
of the angles with the smallest absolute value will be labelled 6. The vertex 
associated with this radius is the closest vertex to the given side since its dis- 
tance is d—cos @ and is a minimum when cos @ is a maximum. 

If the polygon crosses the given edge of the square d must be less than cos 6. 
Since the number of sides of the polygon is a multiple of 4, the situation with 
respect to all four sides of the square is equivalent. Therefore the center of the 
polygon must be within cos 9 units from a side of the square. The probability 
that this will occur is 


(1) (16 cos 6 — 4 cos? 6)/16 


This is the ratio of the area of the region in which the center may lie so that the 
polygon will intersect the square, to the area of the square. 

|@| must be less than 7/4n. Therefore the probability that an angle close to 
any given @ will occur is 


(2) 4nd0/7r. 
The probability that @ will occur and the polygon will intersect the square is the 


product of (1) and (2). The required probability is the sum of this product for 
all values of |6|, that is 


An potlsn 1 4n . n . 7 1 
— cos 0 — —- cost 6) 9 = —™ sin 7 — sin = — 
wdJo 4 rg An Ar 2n 8 


Also solved by Slobodan Cuk (Yugoslavia), Jordi Dou (Spain), Thomas Hughes, M. S. Klam- 
kin, Harry Lass, and the proposer. 


A Condition on a Subset of a Group 
E 2168 [1969, 413]. Proposed by Rosta Jénos, Central Research Institute for 
Physics, Budapest, Hungary 


Let G denote a group and K a subset of G. Prove the following theorem: 
If for a given natural number n22, the relations K-!CK and K*2K hold, 
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then K”~! is a subgroup of G and K*~? is identical with some (e.g., left) coset of 
K*—-! in G. (Here K* denotes as usual the set of all products of the form 
hike cee hi with k;€K,j=1, 2, oo , 1.) 


Solution by Douglas Lind, Stanford University If R=ki +--+ RannCK™, 
then ky 'C KCK, so k-'=k7!,--- ky'CK™. Also, if m=a=my- ++ mMa4 
CK}, then kmCK"CK, so km= (km) (me - + + Mn1)C KK"? =K"™!, Hence 
K*—' is a subgroup of G. 

Let e be the identity of G. Since K-!CK, eC K®, so K™-?CK’ for all r22. 
Hence if m=2s is even, then K?CK*C ---CK*CK, which implies K is a 
subgroup of G, and the result is immediate except in the trivially false case 
n=2. If n=2i+1 is odd, then KCK°C --- CK*#+!1CK, so K=K%= -.- - 
= K*, Then for any REK, kK™!CK*=K"~’, while K” 2 =k(k1!K"-?) ChR™}, 
so K"-*=kK"—! is a coset of K™—}. 

Also solved by Anders Bager (Denmark), W. N. Bell, Orin Chein, J. P. Comiskey, Ted Cullen 
Stuart DeSousa, Neal Felsinger, W. F. Fox, Robert Gilmer, M. G. Greening (Australia), D. W.- 
Hadwin, Lee Hagglund, G. A. Heuer, H. S. Lieberman, Donna Martellotto & Kathy Shiple, J. V- 
Michalowicz, J. H. Oppenheim, E. F. Schmeichel, D. A. Sibley, R. C. Singal (India), Judith 
Soriano, Stephen Spindler, E. J. Taft, W. A. Thrash, Jr., Philip Trauber, and the proposer. 

Sibley remarks that K" is normal in the subgroup of G generated by K. That the second 


statement of the problem is meaningless for 7=2 was pointed out by the proposer but inadver- 
tently omitted in the printing. Most of the solvers, however, noted the omission. 


A Convergent Series of Arcs 


E 2169 [1969, 414]. Proposed by J. J. Hirstein, Illinois State University 


Let U bea unit circle. Let Cy be a circle whose diameter is half the diameter 
of U. Recursively, let Cy41 be a circle whose diameter is half the diameter of C,. 
Put C, tangent to U on the inside (at a,). Put C, tangent to C, on the outside 
and tangent to U on the inside (at d.). Continue in this manner, putting C, 
tangent to C,_1 on the outside and tangent to U on the inside (at a,). Let s; be 
the arc length a; dis. Does >)72, 5; converge? If so, what is the limit? 


Solution by Simeon Resch, Israel Institute of Technology, Hatfa. Let O be the 
center of U, and let O, be the center of the circle C,. Applying the law of 
cosines to the triangle 0,4,,0,0, we obtain 


2 
Syn = arccos On00n+1 = arocos (1 — ope) 


< arccos(1 — 1/2?@-)) = £,. 


By l’Hospital’s rule, lim,.. tnsi/tn=}. By the ratio test, 0°, t, converges. 
Since s,>0, the series in question also converges. Its sum is 2.04900825 ap- 
proximately. 

Also solved by Anders Bager (Denmark), Michael Goldberg, D. A. Herrero, G. A. Heuer, and 
Thomas Hughes. 


Goldberg gives the sum as approximately 27/3, Hughes and this editor agree upon 2.0490114 
with error less than 1077. 
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A Divergent Series 


E 2170 [1969, 414]. Proposed by M. Slater, University of Bristol, England 
Let {an} be an increasing series of positive reals. Suppose limn.. da= ©. 
Show that 07, cos}(dn/dn41) = ©. 


I Solution by E. F. Schmetchel, The College of Wooster. lf 0<x1<x2<x3, it is 
readily verified that cos~!(«1/x3) Scos—+(«1/x2) +cos!(x2/x3). (Simply take the 
cosine of both sides.) It follows easily by induction that if 0<xj<x,< --- 
<x,, then 


_,f 1 if *1 _,{ * _, f en-1 
(1) cos-+{ —  cos'(“) + cost(—) +--+ + cos ). 
Xn X92 X3 Xn 


Set ko=1. Let ki be the least positive integer such that a;,,22a,,, and in 
general, let k; be the least positive integer such that a,,22a,,_,. Note that 
x; _,/Qr, Ss. 

If S, denotes the mth partial sum of the given series, then by (1) we have 


kn~-1 1 
Si,1= >, cos (— ~) 2 > cos ( =) = ncos} (=) > n. 
f=1 Qi+t j=l kj 2 


So the partial sums of the series are unbounded, implying divergence as required. 


II. Solution by D. A. Zave, UNIVAC, Roseville, Minn. Since the second 
derivative of cos~! is negative on (0, 1), we observe that cos~! is convex upward 
n [0, 1]. It follows that 


cos“'(x) = 4r(1 — x) 21-4, « € [0, 1]. 


If m and n are positive integers with mn, then 


ak Qin 


n Ok n Z Qk4+1 — 
Dos(—“)e O(1-— ja Sey 
kam Ak+1 k=m Ak+1 k=m An+1 On+1 


Letting n—>«, we observe that the tail of the series is never less than 1. It 
follows that the series does not converge. 


IV 


Also solved by Anders Bager (Denmark), D. E. Frohart, Leon Gerber, M.S. Klamkin, Doug- 
las Lind Dan Marcus, M. D. Mavinkurve (India), Jernej Polajnar (Yugoslavia), Simeon 
Reich (Israel), R. A. Struble, J. E. Wilkins, Jr., and the proposer. 

Note. Solution II shows that cos“! may be replaced by any function f such that f(x)21—x on 
(a, 1], where OSa<1. 


A Minimum Partition Problem 


E 2171 [1969, 414]. Proposed by Kenneth Jackman, Federal Electric Corpora- 
tion, Fairbanks, Alaska 


Given NV, what is the smallest W for which Bi+B,.+ ---+B,=W, and 
BiB. --+B,2N, with all B;, positive integers. 
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Note. The statement of the problem is ambiguous since it is not clear whether 
the integer ¢ is fixed. Solutions were submitted for both cases. 


I. (c fixed.) Solution by David Zeitlin, Minneapolis, Minnesota. From the 
arithmetic-geometric inequality, we have 
W Bir Br:-- +B 


C C 


Thus, W=c./N, if integral; otherwise, W = [c,/N]+1. 


II. (¢ not fixed.) Solutton by M.S. Klamkin, Ford Scientific Laboratory. The 
dual of this problem is to find the largest number which can be obtained as the 
product of positive integers whose sum is SS. This problem was proposed by 
Leo Moser and solved by L. Carlitz [Problem 125, Pt Mu Epsilon Journal, 
Fall, 1961]. If PGS) denotes the maximum product, it was shown that 


3” if S = 3m, 
P(S) = 44:37 if S = 3m+ 1, 
2:3" if S = 3m + 2. 


Here S is partitioned into as many 3’s as possible. 
It now follows immediately that if P(S) +15 NS P(S-+1), then Wnin=S+1 
(the corresponding partition is not unique in general). 


Also solved by M. T. Bird, Slobodan Cuk & Jernej Polajnar (Yugoslavia), Michael Goldberg, 
M. G. Greening, (Australia), G. A. Heuer, T. F. Hughes, Douglas Lind, Henrik Meyer (Denmark), 
Norman Miller, E. F. Schmeichel, C. S. Venkataraman (India), and the proposer. 
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All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State Unt- 
versity, New Brunswick, NJ 08903. To facilitate their consideration, solutions of Advanced 
Problems in this issue should be typed (with double spacing) on separate signed sheets and 
should be mailed before April 30, 1970. Contributors (in the United Siates) who desire 
acknowledgement of receipt of their solutions are asked to enclose self-addressed stamped 
postcards. 


5707. Proposed by W. A. Vasconcelos, Rutgers—The State University 


Let R be an integral domain and G a finite group. Assume the characteristic 
of R does not divide | G| . Prove that each R-derivation of R[G] is inner. 


5708. Proposed by C. W. Avery, San Jose State College 


Let K be a finite extension of the field k, complete in a non-Archimedean 
valuation. Let K and & denote the residue class fields. Problem 16, p. 129 of 
P. J. McCarthy, Algebraic Extensions of Fields asserts that K is separable over 
k if K is separable over #. Disprove. 
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5709. Proposed by W. A. J. Luxemburg, California Institute of Technology 


For all x >0, determine 


lim i fF expl exp] —(wp-t -°- + 4m) [ders - ++ dXn, 
n> ar Dy (2) 


<x 


5710. Proposed by R. E. Shafer, Lawrence Radiation Laboratory, Livermore, 
California 


where 


+t oan 


Dal) = {en Lee, = = 


It is well known that 


[R — 2Rrcosé+ 7 


=0 


la < | R| . Re) > —1, Re(v) #0. Find the set of functions F,(r, R) independent 
of @ such that 


[R° — 2Rrcosé+ r'| = DF, (r, R)C, (cos 0), Re(u) > — 1. 


5711. Proposed by Dan Marcus, York University, Toronto 


Let A =(G@mjn)mn-1 be an infinite matrix of nonzero integers such that for 
each m, the set of prime divisors of numbers in the mth row is finite. Prove that 
the system of congruences Xmin=Am,n(mod Xm) is solvable in primes. 


5712. Proposed by Dan Marcus, York University, Toronto 


Is it possible to topologize the integers in such a way that the connected sets 
are the sets of consecutive integers? Generalize to the lattice points of m-space. 


5713. Proposed by D. P. Geller, University of Michigan 
For any graph G with p points, g lines, and chromatic number x, show 


x 2 p?/(p? — 29). 


SOLUTIONS OF ADVANCED PROBLEMS 


Multiplicative Solutions of Some Number Theoretic Equations 


5653 [1969, 200]. Proposed by Richard Stanley, Harvard University 


Find in each case the real-valued multiplicative number-theoretic function f 
which satisfies the stated condition. pw is the Mébius function, @ is the Euler 
totient function, d(z) is the number of divisors of m, and a(n) is the sum of the 
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divisors of n. 


(A) > f(d) = w(n)f(n). (C) 2 o(e)f(n/e) = d(n)f(n). 
din eln 3 d 
(B) lam) | = SO. om > ae = o(n)f(n). 
d\n d\n 


Solution by M.S. Demos, Villanova University. Since f is multiplicative, it is 
enough to find f(p*) where p is a prime. In all cases f(1) is arbitrary, except as 
noted in case (B). If we set in (A) and (B) ~=1, p, p?, - - + , p* in succession, 
we obtain easily: 


(A) fl) =¢, fe) = fp?) = — ¢/2, flo) = 0 fora 2 3. 
(B) fa) =¢20, fio) = —¢/2* fora2 il. 
(C) Setting n = p%, the equation becomes 


(a+ flor) = Li olPsor) = (0) + (P — DIO) +L o9/0°9. 
We replace 7 by j7+1, and notice that ¢(p*+!) = p¢(p) for 721, thus obtaining 
ofl") = (6 — DflO) + 0D 6PM) 


= (p — Df") + plato) — 0}, 
Then f(p2) = (ap —1)f(p7"')/a, and for a2 1 


—1)(2p6—1)---(ap—1 
va) =e, pepe) = EHD =D 


(D) Setn = p% Then 


a! 


p*f( pe) _ po} — | 
p—-i p—-1 
whence [f(p2) = — pf(pt"1)/(p*—-1). Finally for a2 1, 
(— 1)" | 
(p—1)(p? — 1) +++ (pe? -— 21) 


f(e*) _ f(£9, 


fl) = 6, f(p) = 


Also solved by T. M. Apostol, Anders Bager (Denmark), Joel Berman, W. J. Blundon, L. 
Carlitz, M. D. Chowdhury, Josef Dane’ (Czechoslovakia), K. J. Davis, Ray Glenn, M. G. Greening 
(Australia), Emil Grosswald, O. P. Lossers (Netherlands), E. J. F. Primrose (England), Simeon 
Reich (Israel), F. G. Schmitt, Jr., A. P. Shah (India), D. A. Smith, Al Somayajulu, Philip Trauber, 
E., W. Trost (Switzerland), C. S. Venkataraman (India), and the proposer. 


The proposer establishes the solution for all parts of the problem by obtaining generating 
functions similar to the methods used in solving 5293 [1966, 555] and 5446 [1967, 1274]. 
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Application of a Lemma of Gauss in a Sum of Sines 


5656 [1969, 200]. Proposed by P. A. Catlin, Carnegie-Mellon University 


Let a and m be positive integers such that 2x-+1 is prime, and let the 
brackets denote the greatest integer function. When is >”, [sin(2raj/(2n-+1)) | 
an even integer? 


Solution by Joel Berman, Unwersity of Washington. Put S=sin(27raj/(2n+1)). 
If a and 2”z+1 are not relatively prime, then 2x+1 divides a, S=0O, and 
>, |S] =0, which is even. 

If (a, 2n+1)=1, then [.S]=0 if aj=m (mod 2n+1) where 0OSmSn; and 
[S]=—1 if aj=m (mod 2n+1) where n<m<2n+1. Hence >_[S] is even if 
and only if the number v of values of 7, 1Sj<z, for which aj is congruent to 
numbers greater than z and less than 2n-+1 is even. By a lemma of Gauss (cf. 
Niven and Zuckerman, An Introduction to the Theory of Numbers, p. 70), it 
follows that a/(2n+1)=(—1)’, so that v is even if and only if @ is a quadratic 
residue modulo 2”-+1. 


Also solved by Merrill Barnebey, Robert Breusch, Arthur Gittleman, M. G. Greening (Aus- 
tralia), Henrik Meyer (Denmark), A. P. Shah & C. G. Khatri (India), Joel Spencer, and the pro- 
poser. 
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Introduction to Complex Analysis. By Rolf Nevanlinna and V. Paatero. Trans- 
lated by T. Kévari and G. S. Goodman. Addison-Wesley, Reading, Mass., 
1969. ix+348 pp. $11.50. (Telegraphic Review, August/September 1969.) 


A book on complex analysis having Rolf Nevanlinna as one of the authors 
has to be treated with respect if not veneration. It is one of the masters of the 
craft that speaks, one to whom the most spectacular advance in complex func- 
tion theory of this century is due. What is said is said with great precision, and 
it is well said. One could only wish that more were said. 
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This is a book that should be at the beck and call of every mathematician 
who ventures to give a course in the theory of functions of a complex variable. 
It is full of novel points of view, alternate modes of attack, and 320 excellent 
problems. A knowledgeable teacher with a good class can use it as a text, and 
if he knows the why and wherefore of the various steps, he has here the basis for 
a very stimulating course. But he has to be prepared to provide the missing 
prose of a descriptive and motivational nature. A somewhat sophisticated stu- 
dent may use it for self-study. He must not be repelled by the terseness of the 
style, and he should be willing to work all the problems. This will give him a 
superb grasp of the subject and the satisfaction of having mastered a rigorous 
uptodate introduction to analytic function theory. 

The book is singularly autonomous (a reference to a proof of the Jordan 
curve theorem is about the only concession). Historical information is doled out 
sparingly; only in the chapters on the Riemann zeta function and conformal 
mapping is the allowance more liberal. There is no reference to Rolf Nevanlinna 
in the text; for American readers this is a deplorable omission. 

The reviewer would have liked a fuller treatment of the graph theoretical 
description of Riemann surfaces in Chapters 6 and 12. This is one of the many 
tools used by the “Finnish school” in the profound investigations of the value 
distribution problem for meromorphic functions, and here is one of the places 
where more descriptive prose would fill a need. 

Section 10.5 has the title “Laplace Integrals.” Granted that Laplace may 
have studied them; surely they are not what most mathematicians would under- 
stand by a Laplace integral. A word of caution to the reader would have been 
in order. 

The elegant introduction of complex numbers in Chapter 1 as a two dimen- 
sional division algebra appears to be due to Weierstrass of all people. The use 
of homotopy theory in complex integration is welcome. So is the extensive use 
of harmonic functions and harmonic measure and many other innovations. 

EINAR HILLE, University of New Mexico 


Calculus, Volume 1 (One Variable Calculus, with an Introduction to Linear 
Algebra), 2nd ed. By Tom. M. Apostol, Blaisdell, Waltham, Mass., 1967. 


This classic book, one of the first in the new wave of calculus texts which 


broke several years ago, has been improved in its new edition, with “ ... smaller 
chapters,...mean value theorem and routine applications... introduced 
..earlier,...new illustrative examples,... (expanded) applications to 


physics and engineering, and many new and easter exercises.” (Emphasis mine.) 
It is now easier to use in the classroom and no less stimulating. A prospective 
user should consult both editions and the review of the first (this MONTHLY, 
69 (1962) pp. 449-451) for information supplementing my comments, which are 
based primarily on my unorthodox but successful use of the book as a text for 
a second year course for freshmen and sophomores with varied backgrounds. 
I taught about ¢ of the course with no text. 
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I began with polynomial approximations (Chapter 7). This discussion of 
Taylor polynomials and error approximation precedes the introduction of se- 
quences and series (Chapters 11 and 12), a splendid pedagogical device which 
stresses the importance of the remainder and the techniques required to esti- 
mate it. Similar careful preparation for the later introduction of sophisticated 
subjects is evident in the treatment of differential equations (Chapter 8). The 
stress On existence and uniqueness theorems in less than full generality and 
the numerous examples were both a good review and a base to which I could 
easily anchor the more advanced material appropriate to a second year course. 

This edition contains a thorough and useful introduction to vector algebra 
and the study of curves as vector valued functions (Chapters 12-16) which was 
more than sufficient to prepare the students for my subsequent introduction of 
the Frechet derivative for scalar valued functions of a vector and a geometric 
discussion of line integrals and exact vector fields. 

I could comfortably assign sections to students to read. There are no 
formulae for cranking out answers to the problems. The arguments are correct, 
clear, and analytical, though they rely perhaps a little too heavily on carefully 
designed notation and previously proved theorems. Students with no “mathe- 
matical maturity” might find them very hard going. I felt free—and compelled 
—to devote my lectures to the intuitive and geometric content of the material. 

I thoroughly enjoyed using this book, although I did not “follow it.” (Nor 
can I imagine one I would follow.) I recommend it for an honors class of fresh- 
men or for students with some familiarity with college mathematics, for exam- 
ple, advanced placement students or high school teachers at a summer institute. 
All instructors should own a copy to use as a reference work. Let us profit from 
the author’s labors. 

E. D. BoLKErR, Bryn Mawr College 


An Introduction to Real Analysis. By Burton Randol. Harcourt, Brace & World, 
New York, 1969. xi+112 pp. $6.50. (Telegraphic Review, Aug./Sept. 1969.) 


A popular approach to analysis involves a two semester course in elementary 
calculus followed by a two semester course in advanced calculus or real analysis 
with considerable attention given to modern aspects of topology. This approach 
has the mathematics major as its target. The non-math major usually takes one 
or two applied math courses to complete his mathematics minor. 

The present text could be used effectively as a basis for one-semester course 
following elementary calculus. It is written with a clear and concise style. The 
topics considered are few in number but are prudently selected so as to allow 
maximum flexibility in designing a course which is student oriented. Its theorem- 
proof-remark format is particularly well-suited for such a course. The text is 
designed to continue and amplify the students work in elementary calculus. 
This text will occupy an important place in the analysis curriculum. 

EDWARD PERESSINI, College of Great Falls, Montana 
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SOME ASPECTS OF FOURIER SERIES 


CASPER GOFFMAN, Purdue University 
DANIEL WATERMAN, Syracuse University 


A continuous function whose Fourier series diverges may be “corrected” so 
that the Fourier series of the new function converges uniformly, either by an 
appropriate change of variable or by a modification of the given function on 
a suitable set of arbitrarily small measure. 

Our purpose is to give an exposition of some mathematics which is more or 
less related to these phenomena. Certain of the matters we discuss seem to be 
ready for a more thorough treatment than they have as yet been given in the 
literature. 


1. Preliminaries. We first remind the reader that a continuous function may 
have a Fourier series which diverges at some points. Let f be a continuous func- 
tion of period 27. Its Fourier series is given by 


ao °° . 
7 + >> a, cos nx + bd, sin nx, 
n=] 


where 

1 7 1 7 

an = —{ f@ cosnidt, 6b, = —{ f(é) sin nidt, n=0,1,2,--- 

vis —7T T —f 

Then, if 
ao n . 
in() = > + Qe (aa cos ht + by sin Rd), n=0,1,2,--°, 
k=1 
each s, has the form s,(x) = [7 Da(x —#)f(£)dt, where the functions D,, n =0, 1, 2, 
- +, are called the Dirichlet kernels. The partial sums s, of the Fourier series 


of f are thus the convolutions s, =D, *f. 
If, instead of the partial sums s, we consider their averages, 


(So + 51+ - > + Sa), 


on = 


nm+1 


Casper Goffman received his PhD at Ohio State Univ. under Henry Blumberg. He has held 
positions at Westinghouse, Univ. Kentucky, Univ. Oklahoma, Wayne, and Purdue, and has spent 
leaves at the Inst. Adv. Study and Westfield College, London. He has published extensively in real 
analysis and authored several books: Real Functions, First Course in Functional Analysis (with 
G. Pedrick), Calculus of Several Variables, and Introduction to Real Analysis. 

Daniel Waterman wrote his Chicago thesis under Antoni Zygmund and has continued re- 
search in Fourier analysis and related fields. He has held positions with the Cowles Commission 
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we obtain 


Dot Dit-+>+Da 
= -oooOO One eee = Kn ° 
a af af 


The functions K, are called the Fejér kernels. They are positive, whereas the 
D, are oscillatory. For each n=0, 1, 2,-°-, 


f _ Dalal - f Kalba = 1. 


However, /™, | D,(t) | dt goes to infinity as ” increases. Since the K,, are positive, 
we have /7, | K,(2) dt=1. Moreover, it is easy to see that K,(t) converges uni- 
formly to zero on | t| >6, for each 6>0. This implies that o, converges uniformly 
to f, for each continuous f. 

We indicate why the fact that L,=/7, | D,(t)| dt goes to infinity as ” in- 
creases implies that there is a continuous f whose Fourier series diverges. We 
work at x =0. Then, let 


On 


dnl) = Sn(O) = Dy « f(0). 


For each 2, ¢, is a linear functional on the space of periodic continuous functions 
of period 27. A simple computation shows that L, is the norm of ¢,. By the 
Banach-Steinhaus theorem, the unboundedness of {Z,} implies that there is an 
f for which {on(f) } is unbounded. The Fourier series of f is accordingly diver- 
gent at x =0. The numbers L, are called Lebesgue constants. Indeed, the above 
argument of elementary functional analysis was introduced by Lebesgue [L]| in 
1908. 


2. A theorem of Pal and Bohr. Although there are continuous functions whose 
Fourier series do not converge, the situation may be amended by a simple 
change of variable as shown by the following result of P4l [P| and Bohr [B]: 


THEOREM 1. Jf f is a continuous function on |—1, 1], with f(—7) =f (rr), there 
is a homeomorphism g of |—7, w| with itself such that the Fourier series of fog 
converges untformly. 


The only known proof of this fact is accomplished by complex variable 
methods. We indicate the salient features of the proof, which involve a quite 
remarkable theorem of Fejér. The Riemann mapping theorem asserts that if D 
is a simply connected open set, not the whole plane, there is a conformal one- 
one mapping f of the unit disk onto D. Moreover, if D is a bounded set in the 
plane and its boundary is a simple Jordan curve, Carathéodory has shown that 
f may be extended to a homeomorphism between the closure of the unit disk 
and the closure of D. The theorem of Fejér asserts that if })”.» a, 2” is the 
power series expansion of f(z) in | 3| <1, then this series converges uniformly to 
the extension of f on |z| $1. The proof of this theorem of Fejér is elementary. 
It involves the simple fact that the area of Dis given by )z-1 k |az|? together 


1970] SOME ASPECTS OF FOURIER SERIES 121 


with the following Tauberian theorem (which is also easy to prove): Jf the 
series )vaxe** is uniformly (C, 1) summable and > k| ax|?<~, then the series 
converges untformly. 

The theorem of Pal and Bohr follows from that of Fejér. By adding a con- 
tinuous periodic function of bounded variation, the given function may be 
changed into a function f for which f(—7)=f(7)=f(€) for exactly one 
&EC(—7, 7m), unless f is identically a constant. It suffices to prove the theorem 
for such an f. Now let h be continuous and periodic, increasing on (—7, &) and 
decreasing on (£, 7). The mapping of |—7, w] into the plane given by z=h/(2) 
-+-zf(t) defines a simple closed curve. Let w= F(z) map | z| <1 onto the domain 
bounded by this curve. F extends to a homeomorphism on |z| <1. Thus we have 
the function F(e#) mapping [—7, 7] one-one onto the curve z=h(t) +7f(2). 
We may select /' so that F(—1)=z(—7). Then if we let 

g(t) = x0 Fe*), 
we have the theorem of Pal and Bohr. 

It would be of interest to obtain a real variables proof of this theorem. In 
particular, extensions to other kinds of series might then become possible. 
Another question concerns the same phenomenon for complex valued functions. 
This amounts to the existence of a single homeomorphism g for which fi o g and 
feo g have uniformly convergent Fourier series, for any two continuous real 
functions with f;(—7) =fi(7) and fo(—7) =fo(ar). Of course, this question may 
be asked for m functions. 

The recent theorem of Carleson, which says that if f@ Ze then its Fourier 
series converges almost everywhere, implies that, for every finite-valued mea- 
surable f, there is a homeomorphism g such that the Fourier series of f o g con- 
verges almost everywhere, since g may be taken so that f o g€ Le. There should, 
however, be a direct proof of this fact. 


3. The Salem criterion. Related to the above problem is that of the existence 
of continuous functions whose Fourier series converge uniformly after every 
change of variable. In this context, a criterion of Salem [S] on the uniform 
convergence of Fourier series is useful. 


THEOREM 2. If f ts continuous, f(—m) =f(m), and tf the sum 


joer) Meor8) ser 
Na NN 
j(x+7—*) —fe+n) 


4(n + 1) 


(n odd) as well as the corresponding sum obtained by substituting —7a for mw, con- 
verge uniformly to zero as n increases, the Fourter sertes of f converges untformly. 


Tr(x) = 


+ 
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The proof of this theorem involves rather tedious but quite natural modifi- 
cations of the condition that 


sin nt 
t 


[ Wetot+se-)-yo} a, 
0 
converges uniformly to zero if and only if the Fourier series of f converges 
uniformly. 

This criterion has interesting consequences. First, if w is the modulus of 
continuity of f there is an absolute constant C such that 


| Tr(x)| < C log n-w(x/n). 


It follows that if w(r/n) =0(1/log ), then the Fourier series of f converges uni- 
formly. This is the Dini-Lipschitz test. 

As a second application of the Salem criterion we obtain the standard 
Jordan condition. Clearly, 


| Tr(x)| < (=) (1454 +—)+ r 
ny . nN 2 m m+i- 


where w is the modulus of continuity of f, V is the variation of f, and m is any 
integer smaller than 4(n+1). If we choose m, as we may, so that m goes to 
infinity, and w(z/n)log m goes to zero, as ” increases, we obtain the fact that if f 
is continuous, of bounded variation, and f(—7) =f(7), then the Fourier series 
of f converges uniformly. 


4. The Garsia-Sawyer criterion. An interesting generalization of this last 
condition also follows easily from the Salem criterion. This involves functions 
complementary in the sense of W. H. Young (see e.g., [Zy]), which we now 
define. 

Let ¢(u), u20, be a continuous, strictly increasing function such that 
o(0) =0, and lim,.. 6(u) = ©, and let W(v) be the inverse of ¢. Now, let 


B(u) =f soa, V(v) =f vou, u,v 0. 


The functions ® and W are said to be complementary 1n the sense of Young. We 
have Young’s inequality: If a20, )20, then 


ab S O(a) + VO). 


By an elaboration of the above method, the following generalization of the 
Jordan condition is obtained. First, we say that a function f is of bounded 
®-vartation if 


nl 


sup 2, &(| f(wi41) — f(s) |) <M 
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for some (/>0, the sup taken over all partitions 
—T = HS Xe < ces OSX, = 7. 


THEOREM 3. If f ts continuous, of bounded ®-variation, and the complementary 
function WV of ® is such that >(_,V(1/n) < ©, then the Fourier series of f con- 
verges uniformly. 


The special case ®(u) =u?, p>1, was given by L. C. Young [Y] before the 
work of Salem. 

A number of questions remain. For example, if ® is such that its comple- 
mentary function VY does not satisfy the above condition, is there an f of 
bounded ®-variation whose Fourier series does not converge uniformly? 

A result on uniform convergence of Fourier series involving a different gen- 
eralization of bounded variation has recently been given by Garsia and Sawyer 
[GS]. They start with the fact that a continuous f is of bounded variation if 
and only if 


1 
f n(y)dy < «, 
0 


where we assume [0, 1] is the exact range of f, and n(y) is the cardinality of 
f(y) when this is finite and © otherwise. They make a slight change in the 
form of this condition. For each y, let E, = {x f(x)> yt. Then £, is open. Let 
N(y) be the number of components of £,, considered modulo 27. Then f is of 
bounded variation if and only if 


f voy < ©, 


(We note that this is the one-dimensional case of an n-dimensional result of 
Fleming and Rishel [FR], where the role of N(y) is taken by the perimeter of a 
set in the sense of de Giorgi.) 

Garsia and Sawyer suppose Ny) to be finite almost everywhere, and obtain 
an interesting decomposition of f. For each v and x, they let f,(«) be the measure 
of the set of points y, in the ordinate set, [0, f(x) |, of f at x, for which N(y) =n. 
Then f,,(«) is continuous, and by considering integral means, is easily seen to be 
of bounded variation. Obviously, f = >57.; fa. By applying to this decomposition 
the fact that if £ is an open set, with m components, then all the partial sums of 
the Fourier series of the characteristic function of £ are bounded by 


A + Blog n, 
where A and B are absolute constants, they obtain the following result: 


THEOREM 4. If f is continuous, with f(—m) =f(a), and such that 


1 
J log N(y)dy < &, 
0 
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then the Fourter sertes of f converges uniformly. 

It would be interesting to know if this result is best possible, and what con- 
nection there is between conditions such as those of Theorem 3 and those of 
Theorem 4. 

5. Analogues for other groups. Since the behavior of the Walsh-Fourier 
series parallels that of the trigonometric Fourier series in many ways, it is 
natural to ask if Theorems 3 and 4 have analogues for Walsh series. 

Suppose x€ [0, 1] with dyadic expansion 


o 


+= > tale), 


n=1 


where d,(x) =0 or 1. We assume that the terminating expansion is chosen for 
dyadic rationals. Let 


ola) = 1, Yar(x) = etrane 
and for m=2%+2"-+4+ --- +2 whereOSm< -:- <n,and 
Wn(x) =Worr(x )por2(x)- > > Pore(x). 


These are the Walsh functions. The functions r,=wo" are the Rademacher func- 
tions. The Walsh functions form a complete orthonormal system in L?[0, 1]. If 
fEL'[0, 1], we will denote the mth partial sum of its Walsh-Fourier series by 


Sn(x, f). 


The. Walsh functions may be regarded as the characters of the direct product 
of a countable infinity of cyclic groups of order 2, 2°. In his now classical paper 
on Walsh-Fourier series, Fine [F,| restructured the interval [0, 1] by intro- 
ducing the topology and addition of the group 2°. Following his terminology, 
we define 


, 1 
ete = > m { (dn(x) + dn(x’)) mod 2}. 
The topology is determined by the metric 
1 
ale, 2”) = X= | dale) — dae). 


The following analogue of Theorem 2 has been demonstrated by C. 
Onneweer [0,1]: 


THEOREM 5. Let f be a continuous function of period 1. Let 
f(w + 2/2") — f(x + 3/2") f(a + 4/2") — f(x + 5/2*+) 
a nn aay an 
n (x + (2041 _ 2) /2"+1) _ f(x + (2n+1 _ 1) /2"+1) 

2” — I 


Un(x) = + 


+ 
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Then 1f U,(x«) converges uniformly to zero as n increases, Sn(x, f) converges uni- 
formly to f. 

From this result we can easily derive the Dini-Lipschitz and Jordan tests 
for uniform convergence. An analogue of Theorem 3 on bounded ®-variation 
also holds for Walsh-Fourier series as a consequence of Theorem 5. 

The validity of the theorem of Salem for the characters of the group 2° 
suggests the problem of extending that result to other groups. 

Let us suppose that G is a totally disconnected compact abelian group 
satisfying the second axiom of countability. Let X be its character group; then 
X isa discrete, countable, abelian torsion group. X can be written as the union 
of an increasing sequence of subgroups X, such that X9= {xo} , where Xo is the 
identity character, and X,/Xn-_. is cyclic of prime order p,. We suppose sup Pa 
=p<o.The X, are chosen so that there is a sequence {¢,} of characters such 
that dPrx2GXnu~wXn and GennECxX,. Let m,= [|]? pi, mo=1. We may order X as 
follows. Each non-negative integer k has a unique representation, k= > $_) aw; 
with 0OSa;< p44 for OSiSs. We define 

a a4 Gy 
Xt = Godi + °° Gs. 
We note that xn, =¢, and X,= {xs:0Si<m,}. 

Now let G, be the annihilator of Xp, ie., Grn = {xEGixs(x) =1 forOSk<m,}. 
Then G=GQDG@DGiD-:-- and Ny G,={0}. It is easily seen that the G, 
form a basis for the neighborhoods of 0 in G. Choose %n©@Gn~Gni1 such that 
Xm, (Xn) =exp(27i/pny1). Each x«€G has a unique representation x= iy dix:, 
050;<pis1. This representation defines an ordering in G by means of the 
lexicographical ordering of the sequences {d;}. It is clear that 


Gn = {x €G:ie= Dl ba, with bo = +--+ = bay = OF. 
0 


Thus each coset of G, has a unique representation z+Gn, where = ipod by x:, 
050;< pis. We denote these z, ordered lexicographically, by {2 7 0Sa<mMyp. 
Let w= exp(271/pz41). 


The Fourier series of an integrable function f is given by 
> CaXn(%), where Cr = f SOX) dt. 
0 G 


Fourier series over the group G have been studied by Vilenkin [V]. 
The following generalization of the theorem of Salem has been shown by 
Onneweer and Waterman [OW]: 


THEOREM 6. Suppose f 1s continuous on G. Then the Fourter series of f con- 


verges uniformly tf 
ae @) fa 
| Do fu — 2a — jatn)oon ” 


a=1 @ j=0 
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converges to zero uniformly in u and a,€& { 1,2,°°°, bry —1} as kR- 0, 


From this result one can readily obtain tests for uniform convergence and 
results on bounded ®-variation analogous to those already obtained for trig- 
onometric and Walsh series. Onneweer [O2| has also succeeded in extending 
Theorem 4 to this setting. 


6. A theorem of Goffman and Waterman. Theorem 3 yields a wide class of 
functions with the property that the Fourier series of f o g converges uniformly 
for every homeomorphism g. This suggests the question as to whether or not one 
can characterize the functions with this property. In this regard, a charac- 
terization has been obtained by Goffman and Waterman [GW,] of the con- 
tinuous functions f, with f(—7) =f(7), for which the Fourier series of f o g con- 
verges everywhere (perhaps not uniformly) for every homeomorphism g. It is 
obtained by an elaboration of the method of Salem. 

We define right and left systems of intervals at a point x. Let {Rn} bea 
sequence of positive integers such that lim, k,= © and lim, (k,/n)=0. For 


each mand m=1,-°---+, Ra, let Inm be disjoint closed intervals such that [njm_1 
is to the left of Inm. If for some real x the collection 
I = {Inmin =1,2,+++,m=1,--+, ky} 


has the property that, for each e>0, there is an N such that InnC (x, «+e) for 
all n>JN, then 9 is called a rzght system of intervals at x. For each right system 
g, consider the sequence 


kn 4 

an(9) — > — fam), 
i=1 M™ 

where f(I) =f(0)—f(a) for any interval I=[a, b]. Left systems, 9, and the 

numbers a,(9) are defined similarly. 

An elaboration and suitable modification of the technique of the Salem 
proof shows that if f is continuous, f(—7) =f(m), and lim, a,(9) =0 for every 
right and left system g, then the Fourier series of f converges everywhere. If, 
convetsely, there is an g for which lim, a,(9) +0, then there is a homeomorphism 
g such that the Fourier series of f o g diverges somewhere. Concerning the con- 
struction of g, we only note that the following lemma is used. The symbol 
> (f, k, 2, 6) denotes 


LG) CS +} 


LEMMA 1. Lei {Rn} be a sequence of integers with the properties lim, kn = © 
and lim, k,/n=0. There is a sequence {én}, 0<éa<a/n, such that for every func- 
tion h, continuous in a neighborhood of zero, with h(0)=0, there 1s a sequence 
{On}, 0<O0n<(a/n)—€n, such that 


((2kn+1)r) /n sin nt 
f h(t) = dt and (1/1) >> (h, Bn, %, 9n) are equiconvergent. 
0 
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We thus have the following result: 


THEOREM 7. A continuous f, with f(—7) =f(a), ts such that f o g has an every- 
where convergent Fourier sertes for each homeomorphism g if and only if 
lim; @n(9) =0 for every right and left system g. 


The analogous question with uniform convergence remains open. There are 
continuous functions whose Fourier series converge everywhere without con- 
verging uniformly [Zy]. Is there a continuous function f such that the Fourier 
series of f o g converges everywhere, for every homeomorphism g, but does not 
converge uniformly for some g? 


7. The Menchov Theorem. We turn now to the second way in which the 
divergence of the Fourier series of a continuous function may be corrected. This 
is given by the following theorem of Menchov [Ba]. 


THEOREM 8. If f is a measurable function on [—7, r| and e>0, then there is a 
continuous function g such that f(x) =g(x) except on a set of measure less than e, 
and the Fourier sertes of g converges untformly. 


The proof of this theorem is quite complicated. There seems to be little hope 
of proving it by finding a g which solves the problem by being in a class for which 
Fourier series converge uniformly. In particular, for each modulus of continuity 
w, there is a continuous f such that any g whose modulus is w differs from f 
almost everywhere, [G]. This theorem also holds for Walsh series, the proof 
being similar in principle and more transparent. We shall accordingly discuss 
this case, which was given by Kotlyar [K] and in sharper form by Price [Pr]. 

We indicate the argument given by Price. Let V() be the vector space de- 
termined by the functions Y;, 17< 2”, and for n<k, let V(n, k) be the vector space 
determined by the functions y;, 2"<1<2*. Then V(n, k) is the orthogonal com- 
plement of V(m) in V(k). Let g,; be the characteristic function of the interval 
[7/2", (j+1)/2"]. The following lemma allows enough freedom to prove the 
adjustment theorem: 


LEMMA 2. For each 8n;, each NZn, and each r>0, there 1s a function 
gCV(N, N-++r), which vanishes off [j/2", ({+1)/2"], is equal to gn; except on a 
set of measure less than 1/2"*, and 1s such that g and the partial sums of tts Walsh 
sertes are all bounded by 2°. 


By an ingenious iterated application of this lemma, using the fact that, for 
each n, the corresponding N can be taken to be arbitrarily large, Price obtains 
the following noteworthy result. 


THEOREM 9. If f is measurable and finite a.e. on [0, 1], then for each e>0 and 
Pix<g<-'+<Sp<cga<cree, 


where {dn—Pn} ts increasing, there 1s a set E of measure less than ¢, and a function 
g, which differs from f only on E, whose Walsh-Fourter series converges untformly 
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and tnvolves only Walsh functions ,, for which p:<n<q;. Moreover, for continuous 
f, the set E depends only on the modulus of continutty of f. 


The scope of validity of the phenomenon of Theorems 8 and 9 remains to 
be determined. In particular, it would be interesting to find out whether it 
belongs to either the theory of real functions or to harmonic analysis. That is to 
say, is it concerned with orthonormal systems, or with group characters? 


8. Walsh-like systems. Another interesting problem is to study the effect of 
changes of variable on orthogonal systems. Suppose {on} is a system of func- 
tions on (0, 1) with @o>=1. From this system we form {Wn} just as the Walsh 
functions are obtained from the Rademacher functions. Thus for 7=2*1+ -- - 
+.2% with OSki<ko< +++ <ks, we set 


If {Wn} is an orthogonal system on (0, 1) it is called a W-system. We assume 
also that there is a K and an mp such that 


(*) [ lul=x for n = mo. 


The study of these systems has shown that they are essentially of two types 
depending on whether or not we assume 


(**) | dn(x)| S1 ae. for every n. 


Under this assumption, the results on W-systems largely parallel those for the 
Walsh system [A, 185-196]. Without (**) the behavior may be like that of the 
systems generated by the strongly lacunary trigonometric sequences { 212 
cos mx} and {2! sin m,x}, with may1/m, =3 [A, 190-191]. 

It has been shown by Waterman [W] that a W-system satisfying (*) and 
(**) behaves like the Walsh system because it is obtained from the Walsh sys- 
tem by a suitable change of variable. 

It is not difficult to see that there is a set £ of measure K in (0, 1) such that 

(1) | wn| =1 on £ for every n, 

(2) m({vn40}CE*) >0 for only finitely many n, 

(3) Wn} is orthogonal relative to E. 

If we now let A bea 1-1 measure preserving map of (0, 1) onto itself such that 
A((0, K)) =£, we see that {WnoA (Kx) i isa W-system on (0, 1) and | PnoA (Kx) | 
=1 a.e. for all x. We may therefore assume | bn (x) | =1 a.e. for all m. We can 
show that we can define a function on (0, 1) by means of the dyadic representa- 
tion of its values, 


y(x) = sayooa3+ °° 


with a, = (1/7) log ¢,(x). It may be shown that for every measurable EC (0, 1), 
y—1(E) is measurable and m(y—1(£)) =m(£). Clearly 


Wn(x) = Wnoy(x) ae. for all n. 
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Further we have the following result: 


THEOREM 10. {Wn} ts complete if and only tf there 1s a metric automorphism 
n of (0, 1) such that W,,(x) =W, o n(x) a.e. for every n. 


Stein [St] has shown that there is a summable function whose Walsh- 
Fourier series diverges a.e., and Billard [Bi] has shown that the Walsh-Fourier 
series of a function of class L? converges a.e. As corollaries of these results and 
Theorem 10 we obtain: 


COROLLARY 1. For any W-system there 1s an integrable function whose W- 
Fourter serves diverges a.e. 


CoROLLARY 2. The W-Fourter series of an L? function converges a.e. 


9. Almost everywhere convergence. Menchov proved Theorem 8 in connec- 
tion with a problem of Lusin on convergence of trigonometric series. Lusin’s 
question was the following: If f is a measurable function, is there a trigonometric 
series, with coefficients converging to zero, which converges almost everywhere 
to f? This was answered in the affirmative by Menchov [Ba] for functions 
which are finite almost everywhere, using Theorem 8 as a tool in the proof. An 
analogous fact has not been established for the Walsh functions, and seems to 
be difficult. Thus, the theorem of Section 7 seems to be easier for Walsh series 
than for trigonometric series, while the theorem on the existence of a series 
converging almost everywhere to a given measurable function seems to be 
harder for Walsh series. 

Regarding the almost everywhere convergence of the Fourier series of a 
function in L?, the proof has been recently given for Walsh series by Billard 
[Bi]. The proof, though by no means easy, is substantially less difficult than 
for the trigonometric case. Along the same lines, the theorem that a Fourier 
series is always (C, a) summable, a>0, turned out to be a major effort for the 
case of Walsh series. It was proved by Fine [F.]. 

The only complete orthonormal systems for which it is known that every 
finite measurable function is given by an almost everywhere convergent series 
are the trigonometric functions and the Haar functions. We recall that the 
To 1] functions are the following complete orthonormal set on the interval 

0,1): 


1, x € [0, 3) 

hoo(x) = 1, Aor(x) = 3-1, « € 3,1] 

0, «= 3 ? 
2k—2 2k—1 

v2", rel Qn+1 =) 

= 1,2, 
Ning (x) = an 2k —1 2k 1 yn 
rv ? re ( Ant , =| od “? 


[ 0, elsewhere, 
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The fact that, for each finite measurable f, there is a Haar series which con- 
verges almost everywhere was obtained by Bari [T,| as a consequence of the 
theorem of Lusin [H | which asserts that, for every such f, there is a continuous 
F such that F’=f almost everywhere. The desired Haar series has as its coeffi- 
cients rather evident linear combinations of the values of F at end points of 
dyadic intervals. By modifying F by a suitable Cantor function, it is clear that 
we may omit any finite set of Haar functions and obtain, for each finite measur- 
able f, a Haar series which converges almost everywhere to f and involves only 
the remaining Haar functions. Whether or not certain infinite sets of Haar func- 
tions may be omitted has not been considered. 

The fact that a finite measurable function may be represented as an almost 
everywhere limit of Haar functions, with a finite set of functions omitted, as- 
sures the existence of sets of multiplicity of measure 0 for the Haar functions. 
We recall that a set Sis a set of uniqueness for a system {dn} if the convergence 
to zero of >.7.; dna on the complement of S implies that the coefficients a, 
are all zero. Also S is a set of multiplicity if it is not a set of uniqueness. Since 
the above implies the existence of a nonzero Haar series converging almost 
everywhere to zero, it assures the existence of sets of multiplicity of measure 
zero. 

Whether or not every measurable function is the almost everywhere limit 
of a series of trigonometric functions with finitely many omitted, is not known. 


10. Convergence in measure. The original Lusin problem included functions 
which may be infinite on sets of positive measure. It has recently been shown by 
Arutyanyan and Talalyan [AT] that if f is infinite on a set of positive measure, 
then no Haar series converges almost everywhere to f; and the same applies to 
Walsh series. Considerable light has been cast on this matter by Gundy [Gu], 
who has related it to martingale theory. 

In regard to Lusin’s question, Menchov asked whether the answer is affirma- 
tive using convergence in measure instead of almost everywhere convergence. 
He showed [M]| that for each measurable function f, finite or infinite, there is a 
trigonometric series which converges in measure to f. The proof given by Men- 
chov is grimly complicated. Fortunately, Talalyan [T, | gave an elegant, under- 
standable, although by no means easy, proof of the theorem in a more general 
setting. 


THEOREM 11. Jf {dn} is a sequence in L?|a, b|, p>1, of functions of norm 1, 
which is a Schauder basis for L?{a, b| then, for each measurable f, finite or infinite, 
there is a series )inu1 Onbn, With lim, dn=0, which converges in measure to f. 


A central role in the proof of this theorem is assumed by a lemma. 


LEMMA 3. If G 1s a bounded measurable set of posttive measure, {dn} is a 
sequence in L?(G), p>1, of functions of norm 1, whichis a Schauder basis for L?(G), 
and f ts finite and measurable, then for each e>0 and positive n, there 1s a measur- 
able sete ECG with m(E)<e, and there are reals a, (R=n+1,---, m) with 
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| a.| <e such that 


dy Udi — f <e and || Dd) adil) <(lflle +6, 
k=n+1 G—E£ k=n-+1 F 
for each measurable FCG—E and s=n+1,--+,m. 


From the manner in which this lemma is used in the proof of the theorem, 
it follows that any finite subset of {dn} can be dropped and the theorem re- 
mains valid. It is again not clear whether or not certain infinite sets can be 
dropped. 

It seems that the phenomenon of Theorem 11 should have no relation to 
Schauder bases in L,. However, the proof of Talalyan is rigidly bound to this 
special class of systems. A new method would be needed to go further. 

The Lemma 3 has further interest. Indeed, the existence of universal series 
for all possible systems was established by Talalyan [T.] with its use. It turns 
out, however, that the existence of such series is quite easy to establish, as 
noted by Goffman and Waterman [GW,]. We discuss the matter in Section 12. 


11. The Miintz Theorem. We consider first the functions 1, x, ---,x",--> 
defined on [0, 1]. The Weierstrass polynomial approximation theorem asserts 
that each continuous function on [0, 1] is the uniform limit of finite linear 
combinations of these functions. On the other hand, only analytic functions 
are uniform limits of series >.) a,x”. A similar fact holds for L?[0, 1], p21. 
Indeed, if }o7.9 dnx” converges at x0, it must converge uniformly to an ana- 
lytic function on each interval (—|x| +6, |x| —8) for small 6>0. 

Some of the functions in the set 1, x, x7, --- may be removed, and each 
continuous function is still a uniform limit of finite linear combinations of the 
remaining functions. We have the marvelous theorem of Miintz [Fe]: 


The finite linear combinations of 1, x™, +++, x™,-+-+ are dense in C[0, 1] 
if and only if dog. 1/m=o. 


We say that a sequence is fofal in a space X if its finite linear combinations 
are dense in X. Miintz’s Theorem holds in L?[0, 1], with the difference that 


the function 1 does not play a special role. Thus «1, ---, «™, +--+ 7s total in 
L?[0, 1] if and only if Dir-1 1/m= 0. The same holds for any L?[0, 1], p21. 
If om 1/n,< ©, the uniform closure of span of x™1,---, x™,--- isa 


subset of the set of analytic functions. This fact was proved by Clarkson and 
Erdés [CE]. In the same work, they showed that Miintz’s Theorem holds for 
any positive interval in the place of [0, 1]. It would be interesting to know if 
it holds for arbitrary measurable subsets of [0, 1] of positive measure. This 
would imply that the theorem holds for the space M of equivalence classes of 
measurable functions, with almost everywhere convergence or convergence 
in measure. 

Miintz Theorem assures the existence of a universal series. That is to say, 
there is a series 


Got aes s+ Faye ees 
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such that, if {Sn} is its sequence of partial sums, each f€ L?[0, 1] is the limit, 
in the Zz norm, of a subsequence of {sa}. Universal series exist in the space of 
measurable functions for every total set. 


12. The space M. We consider the space M of measurable functions on 
[0, 1], with the convergence in measure metric. This is a topological vector space 
which is not locally convex. It has a dual, the space of continuous linear func- 
tionals, which consists of the zero functional alone. In order to see this, let d be 
a nonzero linear functional on M. There is then a positive f such that ¢(f) 0. 
For each n, and 7=0,--~-, w—1, let s,; be the characteristic function of 
[i/n, (¢+1)/n]. There is an ¢=7, and a real number a, such that b(GnSni,f) =1. 
Now {anSni,f} converges to zero in M, so that ¢ is not continuous. 

We also need the fact that if {xn} is total in a topological vector space X, 
and Y is the closed span of { x2, a i in X, then Y has deficiency 0 or 1. 

From these two facts, we almost immediately have the following result: 


THEOREM 12. If Sts total in M, and F 1s a finite subset of S, then S— F 1s total 
in M. 


The two corollaries below follow readily. 
-CoROLLARY 1. If S ts total in M, there is an infinite TCS such that S—T ts 
total in M. 


COROLLARY 2. If {on} is total in M, there is a series > 1 dnbn Whose sequence 
of partial sums has the property that, for each measurable f, a subsequence con- 
verges almost everywhere to f. 


Little is known concerning the related matter for series. We pointed out 
that if a finite set of Haar functions is removed, every f€ M has a series in the 
remaining Haar functions which converges almost everywhere to f. No analogous 
property is known for trigonometric functions. The general problem has connec- 
tions with the existence of sets of multiplicity, as has already been noted. 

For the case of the Schauder functions, a system which is total in each 
L,[0, 1], p>1, but which will not be defined here, Zink [Z| has given necessary 
and sufficient conditions that a subsequence {¢,,} of the Schauder system {¢,} 
be such that every measurable function f, finite or infinite, have a series expan- 
sion 


> Un, 
which converges almost everywhere to f. 
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THE WORK OF NICHOLAS BOURBAKI* 
JEAN A. DIEUDONNE (translated by Linda Bennison) 


Dear President and friend, 
Ladies and Gentlemen, 


Thank you for your kind words on my behalf. I must admit that it is a great 
pleasure for me to escape from my duties as dean and spend a week in such a 
warm and friendly atmosphere. The tradition of friendship between French and 
Roumanian scientists, particularly between our mathematicians, is old. I am 
very happy to be a link in this chain, which I hope will continue, stronger and 
more cordial, in the years to come. Well, if you don’t mind, I shall not make a 
very long speech, and I should be happy to answer at the end of it the questions 
which, no doubt, will be put to me. I don’t pretend to deal with all the history 
of the works of Bourbaki, and I shall give you all a chance to ask questions on 
the points I merely touch upon. 


To understand the origins of Bourbaki, we shall have to go back to the 
years that Mr. Nicolescu was recalling a few moments ago. These were the 
years when we were students, the years after the 1914 war; and this war, we 
can very well say, was extremely tragic for the French mathematicians. I 
shall not try to judge or give a moral assessment of what happened at that time. 
In the great conflict of 1914-18, the German and French governments did not 
see things in the same way where science was concerned. The Germans put 
their scholars to scientific work, to raise the potential of the army by their dis- 
coveries and by the improvement of inventions or processes, which in turn 
served to augment the German fighting power. The French, at least at the 
beginning of the war and for a year or two, felt that everybody should go to the 


It is hardly necessary to identify Prof. Dieudonné to our readers; still a few facts may prove 
interesting. Prof. Dieudonné studied at the Ecole Normale supérieure from 1924-1927, was a 
fellow at Princeton, Berlin, and Ziirich, and received his doctorate in 1931. He served on the facul- 
ties at Bordeaux, Rennes, Nancy, Sao-Paulo, Michigan, Northwestern, I’ Institut des Hautes Etudes 
Sctientifiques, and presently is the Dean of the Faculty at Nice. He held visiting professorships at 
Columbia, Johns Hopkins, Rio de Janeiro, Buenos Aires, Pisa, Maryland, Tata Institute Bombay, 
Notre Dame, and Washington. His honors include the Order of the Legion of Honor, the Order of 
the Academic Palms, and membership in the Academy of Sciences. He served as President of the 
Mathematical Society of France in 1964-65. 

Prof. Dieudonné has published a number of books and about 135 research articles on analysis, 
topology, spectral theory, classical groups, formal Lie groups, and non-commutative rings. 

It is believed by some that the original Bourbaki members were C. Chevalley, J. Delsarte, 
J. Dieudonné, and A. Weil. Apparently the name “Nicolaus Bourbaki” was that of a 19th century 
French general. It was part of an “initiation” of first year mathematics students that an upper 
classman, pretending to be a famous foreign mathematician, would deliver a lecture to them in 
which the theorems bore the names of famous generals and were all wrong in a nontrivial way. 
Editor. 
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front; so the young scientists, like the rest of the French, did their duty at the 
front line. This showed a spirit of democracy and patriotism that we can only 
respect, but the result was a dreadful hecatomb of young French scientists. 
When we open the war-time directory of the Ecole Normale, we find enormous 
gaps which signify that two-thirds of the ranks were mowed down by the war. 
This situation had unfortunate repercussions for French mathematics. We 
others, too young to have been in direct contact with the war, but entering the 
University in the years after the war ended, should have had as our guides 
these young mathematicians, certain of whom we are sure would have had 
great futures. These were the young men who were brutally decimated and 
whose influence was destroyed. 

Obviously, people of previous generations were left, great scholars whom we 
all honour and respect. Masters like Picard, Montel, Borel, Hadamard, Denjoy, 
Lebesgue, etc., were living and still extremely active, but these mathematicians 
were nearly fifty years old, if not older. There was a generation between them 
and us. I am not saying that they did not teach us excellent mathematics: we 
all took first-class courses from these mathematicians (as Mr. Nicolescu is a 
witness), but it is indubitable (and true for that matter of every period) that 
a 50-year-old mathematician knows the mathematics he learned at 20 or 30, 
but has only notions, often rather vague, of the mathematics of his epoch, i.e., 
the period of time when he is 50. It is a fact we have to accept such as it is, we 
cannot do anything about it. 

So we had excellent professors to teach us the mathematics of let us say up 
to 1900, but we did not know very much about the mathematics of 1920. As I 
said before, the Germans went about things in a different way, so that the Ger- 
man mathematics school in the years following the war had a brilliance which 
was altogether exceptional. We only need to think of the mathematicians of 
the highest order who illustrated this point: C. L. Siegel, E. Noether, E. Artin, 
W. Krull, H. Hasse, etc., of whom we in France knew nothing. Not only this, 
but we also knew nothing of the rapidly developing Russian school, the bril- 
liant Polish school, which had just been born, and many others. We knew 
neither the work of F. Riesz nor that of von Neumann, etc. We had been closed 
in on ourselves and, in our world, the theory of functions reigned supreme. The 
only exception was Elie Cartan; but being 20 years ahead of his time, he was 
understood by no one. (The first to understand him after Poincaré was Hermann 
Weyl, and for 10 years he was the only one, so how could we poor little students 
have known enough to understand him?) So, apart from E. Cartan, who at this 
time didn’t count—he only started to count 20 years later, but since then his 
influence has grown steadily—we were entirely folded in on that theory of 
functions, which, while being important, represented only a part of mathematics. 


Our only opening onto the outside world at this time was the seminar of 
Hadamard, a professor, but not a very brilliant teacher, at the Collége de 
France. (He was a great enough scholar for me to be able to say this without 
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harming his reputation.) He had the idea (apparently taken from abroad, be- 
cause this had never been done in France) of inaugurating a seminar of analysis 
of current mathematical work. At the beginning of the year he distributed, to 
all those who wanted to speak on the subject, what he judged to be the most 
important memoirs of the past year, and they had to explain them at the black- 
board. It was a novelty for the time, and to us an extremely precious one, be- 
cause there we met mathematicians of many different origins. Also, it soon 
became a center of attraction for foreigners; they came in crowds. (Mr. Onicescu 
reminded me that he himself gave lectures at the Hadamard seminar in Paris.) 
So it was for us young students a source of acquaintances and views that we 
did not find in the formal mathematics courses given at the University. This 
state of affairs lasted several years, until certain of us—starting with A. Weil, 
then C. Chevalley, having been out of France meeting Italians, Germans, Poles, 
etc.—realized that if we continued in this direction, France was sure to arrive 
at a dead end. We would no doubt continue to be very brilliant in the theory 
of functions, but for the rest, French mathematicians would be forgotten. This 
would break a two-hundred-year-old tradition in France, because from Fermat 
to Poincaré, the greatest of the French mathematicians had always had the 
reputation of being universal mathematicians, as capable in arithmetic as in 
algebra, or in analysis, or in geometry. So we had this warning of the bubbling 
of ideas that was beginning to be seen outside, and several of us had the chance 
to go and see and learn at first hand the development that was going on outside 
our walls. After Hadamard retired in 1934, the seminar was carried on, in a 
slightly different form, by G. Julia. This consisted of studying in a more system- 
atic manner the great new ideas which were coming in from all directions. This 
is when the idea of drawing up an overall work which, no longer in the shape of 
a seminar, but in book form, would encompass the principal ideas of modern 
mathematics. From this was born the Bourbaki treatise. I must say that the 
collaborators of Bourbaki were very young at the time and doubtless they 
would never have started this job had they been older and better informed. In 
the first meetings for the project, the idea was that it would be finished in three 
years, and in this time we should draft the basic essentials of mathematics. 
Events and history decided differently. Little by little, as we became rather 
more competent and more aware, we realized the enormity of the job that had 
been taken on, and that there was no hope of finishing it as quickly as that. 


It is true that there were already excellent monographs at the time and, in 
fact, the Bourbaki treatise was modelled in the beginning on the excellent alge- 
bra treatise of Van der Waerden. I have no wish to detract from his merit, but 
as you know, he himself says in his preface that really his treatise had several 
authors, including E. Noether and E. Artin, so that it was a bit of an early 
Bourbaki. This treatise made a great impression. I remember it—I was working 
on my thesis at that time; it was 1930 and I was in Berlin. I still remember the 
day that Van der Waerden came out on sale. My ignorance in algebra was such 
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that nowadays I would be refused admittance to a university. I rushed to those 
volumes and was stupefied to see the new world which opened before me. At 
that time my knowledge of algebra went no further than mathématiques spéciales, 
determinants, and a little on the solvability of equations and unicursal curves. 
I had graduated from the Ecole Normale and I did not know what an ideal was, 
and only just knew what a group was! This gives you an idea of what a young 
French mathematician knew in 1930. So we tried to follow Van der Waerden, 
but in effect he only covered algebra, and even then just a small part of algebra. 
(Since then, algebra has developed considerably, partly because of Van der 
Waerden’s treatise, which is still an excellent introduction. I am often asked for 
advice on how to start out studying algebra, and to most people I say: First 
read Van der Waerden, in spite of what has been done since.) 


So we intended to do something of this kind. Now Van der Waerden uses 
very precise language and has an extremely tight organization of the develop- 
ment of ideas and of the different parts of the work as a whole. As this seemed 
to us to be the best way of setting out the book, we had to draft many things 
which had never before been dealt with in detail. General topology could only 
be found in a few memoirs and in Fréchet’s book, which was, in effect, a com- 
pilation of an enormous quantity of results, without any kind of order. I can 
say the same of Banach’s book, which is admirable for research but completely 
disorganized; in other subjects such as integration (as presented by Bourbaki) 
and certain algebra questions, there was nothing. Before the chapter of Bour- 
baki on multilinear algebra, I don’t think there was a didactic work in the world 
that explained what exterior algebra was. We had to refer to the work of Grass- 
mann, which is not particularly clear. Thus we quickly realized that we had 
rushed into an enterprise which was considerably more vast than we had im- 
agined, and you know that this enterprise is still far from finished. In my brief- 
case I have the proofs of the 34th volume, which is devoted to three chapters 
of the theory of Lie groups. There are others, many others, being prepared; 
there are already three or four editions of preceding volumes, and the end of 
the work is not in sight. 

We had to have a starting point—we had to know what we wanted to do. 
Of course, there was the idea of the Encyclopedia, which, in fact, already ex- 
isted. As you know, it had been started by the Germans in 1900, and despite 
their proverbial tenacity and ardour for work, in 1930, after several editions 
and alterations, etc., it was hopelessly behind in comparison to the mathematical 
science of that time. Nowadays, nobody would think of starting on such an 
impossible enterprise, knowing the vast amount of mathematical publications 
released every year. I believe that we shall have to wait for the day when com- 
puters have minds and are able to assimilate all that in a few minutes. For the 
time being we have not progressed that far, nor had we gone that far in 1930. 
Moreover, it would have been useless to redo something which despite its merits 
had failed. The Encyclopedia, even at that period, was above all useful as a 
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bibliographical reference, to find out where such and such a result could be 
found. But naturally, it contained no proofs, because if the Encyclopedia, al- 
ready gigantic with its 25-30 volumes, had included proofs it would have been 
ten times larger. No, we did not want to produce a work of bibliographic refer- 
ence, but one which would bea demonstrative mathematical text from beginning 
to end. And this forced us into making an extremely strict selection. What selec- 
tion? Well, that is the crucial part in Bourbaki’s evolution. The idea which soon 
became dominant is that the work had to be primarily a tool. It had to be some- 
thing usable not only in a small part of mathematics, but also in the greatest 
possible number of mathematical places. So if you like, it had to concentrate 
on basic mathematical ideas and essential research. It had to reject completely 
anything secondary that had no immediately known application and that did 
not lead directly to conceptions of known and proved importance. There was 
much sifting, which started innumerable discussions among the collaborators, 
and which also earned Bourbaki a great deal of hostility. Because as the works 
of Bourbaki became known, all those who found that their favourite subject was 
not included were not inclined to do much propaganda in his favor. So I think 
that we can attribute much of the hostility that has been shown toward Bour- 
baki at certain periods, and which is still widespread in certain countries, to this 
extremely strict selection. 


So how do we choose these fundamental theorems? Well, this is where a new 
idea came in: that of mathematical structure. I do not say it was an original idea 
of Bourbaki—there is no question of Bourbaki’s containing anything original. 
Bourbaki does not attempt to innovate mathematics, and if a theorem is in 
Bourbaki, it was proved 2, 20, or 200 years ago. What Bourbaki has done is 
to define and generalize an idea which already was widespread for a long time. 
Since Hilbert and Dedekind, we have known very well that large parts of mathe- 
matics can develop logically and fruitfully from a small number of well-chosen 
axioms. That is to say, given the bases of a theory in an axiomatic form, we can 
develop the whole theory in a more comprehensible way than we could other- 
wise. This is what gave the general idea of the notion of mathematical structure. 
Let us say immediately that this notion has since been superseded by that of 
category and functor, which includes it under a more general and convenient 
form. It is certain that it will be the duty of Bourbaki, who, as I shall explain 
later, never fears change, to incorporate the valid ideas of this theory in his 
works. 

Once this idea had been clarified, we had to decide which were the most im- 
portant mathematical structures. Naturally, this was the root of many discus- 
sions before we found ourselves in agreement. I might say that Bourbaki does 
not pretend to be infallible; he has been mistaken several times about the future 
of structures, and apologized when it was necessary, withdrawing his original 
ideas. Successive editions trace some changes clearly. Bourbaki does not pre- 
tend to want to fix or nail down mathematics; that would be exactly contrary 
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to his original purpose. But if one does not recoil from new ideas, even when they 
go beyond Bourbaki, one has no respect for tradition. Consequently this open 
systematic attitude of Bourbaki has also been a cause of hostility, this time on 
the part of people of previous generations, who criticized the liberties Bourbaki 
took with the mathematics of their time. In particular, the choice of definitions 
and the order in which the subjects were arranged were decided according to 
a logical and rational scheme. If this did not agree with what was done previ- 
ously, well, it means that what was done previously had to be thrown overboard, 
without sparing even long-established traditions. To give you an example: 
Bourbaki refuses to say non-decreasing when referring to an increasing function 
because this would be a total absurdity. We know that this term means what 
we want to say only when talking about linear (total) order relations. (If one 
says non-decreasing in the setting of a non-linear order relation, this hardly 
means increasing but not strictly tncreasing.) So Bourbaki purely and simply 
abolished this terminology, as he did many others. He also invented terminol- 
ogy, using Greek when it was necessary, but also using many words from ordi- 
nary speech, which made traditionalists wince. They did not admit easily that 
what we now call boule or pavé used to be called hypersphéroide or parallélotope, 
and their reaction was: “This work is not to be taken seriously.” A little book 
came out recently, which we liked very much. It is called “Le Jargon des 
Sciences” by Etiemble, vigilant guardian of the French language. He insists on 
preserving it in its original purity and is up in arms against the gibberish of most 
scientists. Happily, he makes an exception of French mathematicians, saying 
that they had the good sense to take simple, authentic French words from 
ordinary speech, sometimes changing their meaning. He cites attractive exam- 
ples, recent titles such as Plattiude et privilége and Sur les variétés riemanniennes 
non suffisamment pincées. This is the style in which Bourbaki is written—in a 
recognizable language and not in a jargon sprinkled with abbreviations, as in 
Anglo-Saxon texts where you are told about the C.F.T.C. which is related to 
an A.L.V. unless it is a B.S.F. or a Z.D., etc. After ten pages of this you have 
no idea what they are talking about. We think that ink is cheap enough to 
write things in full, with a well-chosen vocabulary. 


I told you then that we made a selection. I shall explain this choice in more 
detail, using a metaphor. We realized very quickly that despite introducing the 
idea of structure, which was meant to clarify and separate things, mathematics 
refused to separate into small pieces. On the other hand, it was clear that the 
old divisions, Algebra, Arithmetic, Geometry, Analysis were out of date. We 
had no respect for them and abandoned them from the start, to the fury of 
many. For example, it is well known that euclidean geometry is a special case 
of the theory of hermitian operators in Hilbert spaces. The same goes for the 
theories of algebraic curves and numbers, which come essentially from the same 
structures. I compare the old mathematical divisions with the divisions of the 
ancient zoologists, who, seeing that a dolphin and a shark or a tuna-fish were 
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similar animals, said: These are fish because they all live in the sea and have 
similar shapes. It was quite a while before they realized that the structures of 
these animals were not at all similar, and they had to be classified very differ- 
ently. Algebra, Arithmetic, Geometry and all that nonsense compare easily to 
this. One has to look at the structure of each theory and classify it in this way. 
In spite of everything though, it does not take long to make one realize that 
despite this effort towards the isolation of structures, they have a way of mixing 
very quickly and extremely fruitfully. One could say that the great ideas in 
mathematics have come when several very different structures met. So here is 
my picture of mathematics now. It is a ball of wool, a tangled hank where all 
mathematics react one upon another in an almost unpredictable way. Unpredict- 
able, because a year almost never passes without our finding new reactions of 
this kind. And then, in this ball of wool, there are a certain number of threads, 
coming out in all directions and not connecting up with anything else. Well, the 
Bourbaki method is very simple—we cut the threads. What does this mean? 
Let us look at what remains; then we make a list of what remains and a list 
of what is eliminated. What remains: The archiclassic structures (I don’t speak 
of sets, of course), linear and multilinear algebra, a little general topology (the 
least possible), a little topological vector spaces (as little as possible), homo- 
logical algebra, commutative algebra, non-commutative algebra, Lie groups, 
integration, differentiable manifolds, riemannian geometry, differential topol- 
ogy, harmonic analysis and its prolongations, ordinary and partial differential 
equations, group representation in general, and in its widest sense, analytical 
geometry. (Here of course I mean in the sense of Serre, the only tolerable sense. 
It is absolutely intolerable to use analytical geometry for linear algebra with co- 
ordinates, still called analytical geometry in the elementary books. Analytical 
geometry in this sense has never existed. There are only people who do linear 
algebra badly, by taking coordinates and this they call analytical geometry. 
Out with them! Everyone knows that analytical geometry is the theory of ana- 
lytical spaces, one of the deepest and most difficult theories of all mathematics.) 
Algebraic geometry, its twin sister, is also included, and finally the theory of 
algebraic numbers. 

This makes an imposing list. Let us now see what is excluded. The theory of 
ordinals and cardinals, universal algebra (you know very well what that is), 
lattices, non-associative algebra, most general topology, most of topological vec- 
tor spaces, most of the group theory (finite groups), most of number theory 
(analytical number theory, among others). The processes of summation and 
everything that can be called hard analysis—trigonometrical series, interpola- 
tion, series of polynomials, etc.; there are many things here; and finally, of 
course, all applied mathematics. 

There I wish to explain myself a little. I absolutely do not mean that in 
making this distinction Bourbaki makes the slightest evaluation on the inge- 
niousness and strength of theories catalogued in this way. Iam convinced that 
the theory of finite groups, for example, is at the present time one of the deepest 
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and richest in extraordinary results, while theories like non-commutative alge- 
bra are of medium difficulty. And if I had to make an evaluation I should prob- 
ably say that the most ingenious mathematics is excluded from Bourbaki, the 
results most admired because they display the ingenuity and penetration of its 
discoverer. 

We are not talking about classification then, the good on my right, the bad 
on my left—we are not playing God. I just mean that if we want to be able to 
give an account of modern mathematics which satisfies this idea of establishing 
a center from which all the rest unfolds, it is necessary to eliminate many things. 
In group theory, despite the extraordinary penetrating theorems which have 
been proved, one cannot say that we have a general method of attack. We 
have several of them, and one always has the impression that one is working 
like a craftsman, by accumulating a series of stratagems. This is not something 
which can be set forth by Bourbaki. Bourbaki can only and only wants to set 
forth theories which are rationally organized, where the methods follow natu- 
rally from the premises, and where there is hardly any room for ingenious 
stratagems. 

So, I repeat, those which Bourbaki proposes to set forth are generally 
mathematical theories almost completely worn out already, at least in their 
foundations. This is only a question of foundations, not details. These theories 
have arrived at the point where they can be outlined in an entirely rational way. 
It is certain that group theory (and still more analytical number theory) is just 
a succession of contrivances, each one more extraordinary than the last, and 
thus extremely anti-Bourbaki. I repeat, this absolutely does not mean that it is 
to be looked down upon. On the contrary, a mathematician’s work is shown 
in what he is capable of inventing, even new stratagems. You know the old 
story—the first time it is a stratagem, the third time a method. Well, I believe 
that greater merit comes to the man who invents the stratagem for the first 
time than to the man who realizes after three or four times that he can make a 
method from it. The second step is Bourbaki’s aim: to gather from the diverse 
processes used by mathematicians whatever can be shaped into a coherent 
theory, logically arranged, easily set forth and easily used. 


The work method used in Bourbaki is a terribly long and painful one, but is 
almost imposed by the project itself. In our meetings, held two or three times 
a year, once we have more or less agreed on the necessity of doing a book or 
chapter on such and such a subject (generally, we foresee a certain number of 
chapters for a book), the job of drafting it is put into the hands of the collabora- 
tor who wants to do it. So he writes one version of the proposed chapter or 
chapters from a rather vague plan. Here, generally, he is free to insert or 
neglect what he will, completely at his own risk and peril, as you will see. 
After one or two years, when the work is done, it is brought before the Bourbaki 
Congress, where it is read aloud, not missing a single page. Each proof is ex- 
amined, point by point, and is criticized pitilessly. One has to see a Bourbaki 
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Congress to realize the virulence of this criticism and how it surpasses by far 
any outside attack. The language cannot be repeated here. The question of age 
does not come into it. The ages of the Bourbaki members vary considerably— 
later I shall tell you the maximum age limit—but even when two men have a 
20-year age difference, this does not stop the younger from hauling the elder, 
who he feels has understood nothing of the question, over the coals. One has to 
know how to take it, as one should, with a smile. In any case, the reply is never 
late in coming, no one can boast of being infallible before Bourbaki members, 
and in the end, everything works out fine, despite the very long and extremely 
animated arguments. 

Certain foreigners, invited as spectators to Bourbaki meetings, always come 
out with the impression that it is a gathering of madmen. They could not im- 
agine how these people, shouting—some times three or four at the same time— 
about mathematics, could ever come up with something intelligent. It is per- 
haps a mystery but everything calms down in the end. Once the first version 
has been torn to pieces—reduced to nothing—we pick a second collaborator to 
start it all over again. This poor man knows what will happen because although 
he sets off following the new instructions, meanwhile the ideas of the Congress 
will change and next year is version will be torn to bits. A third man will start, 
and so it will go on. One would think it was an unending process, a continuous 
recurrence, but in fact, we stop for purely human reasons. When we have seen 
the same chapter come back six, seven, eight, or ten times, everybody is so 
sick of it that there is a unanimous vote to send it to press. This does not mean 
that it is perfect, and very often we realize that we were wrong, in spite of all 
the preliminary precautions, to start out on such and such a course. So we 
come up with different ideas in successive editions. But certainly the great- 
est difficulty is in the delivery of the first edition. 

An average of 8-12 years is necessary from the first moment we set to work 
on a chapter to the moment it appears in the bookshop. The ones that are com- 
ing out now are the ones that were discussed for the first time about 1955. 


I said earlier that there is a maximum age limit. This was recognized quite 
quickly for the reason I was speaking about at the start of this talk—a man of 
over 50 can still be a very good and extremely productive mathematician but 
it is rare for him to adapt to the new ideas, to the ideas of people 25 and 30 years 
younger than he. Now, an enterprise like Bourbaki seeks to be permanent. There 
is no question of saying that we nail down mathematics to such or such a period. 
If the mathematics set forth by Bourbaki no longer corresponds to the trends of 
the period, the work is useless and has to be redone. This has already happened, 
for that matter, with several volumes of Bourbaki. If there were elderly mem- 
bers of Bourbaki, they would tend to put a brake on this healthy tendency, 
believing that everything being fine at the time of their youth, there is no reason 
for change. This would be disastrous. So, to avoid tensions such as this, which 
sooner or later would cause Bourbaki’s break-up, it was decided at the time 
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the question arose, that all the Bourbaki collaborators retire at 50. 

And it is so; the present Bourbaki collaborators are all under 50. The founder- 
members, of course, retired almost ten years ago, and even those who not long 
ago were considered young are already past—or about to reach—retiring age. 
So it is a question of replacing the members who leave. How do we do that? 
Well, there are no rules, because in Bourbaki the only formal rule is the one I 
have just told you, retirement at 50. Apart from this, we can say that the only 
rule is that there are no rules. There are no rules in the sense that there is never 
a vote, we have to have unanimity on every point. Each member has the right 
to veto any chapter he feels is bad. The veto simply signifies that we do not 
allow the printing of the chapter and we have to go back and re-study it. This 
explains the lengthiness of the process—the fact that we have such a hard time 
agreeing on a final version. 

We are concerned then with replacing members affected by the age limit. 
We do not replace them formally (this would be a rule and there are no rules). 
There is no vacant seat, as with an academy. As most of the members of 
Bourbaki are professors—many in Paris—they have a chance to see at close 
range the young mathematicians, the youths who are just starting mathemati- 
cal research. A youth of value who shows promise of a great future is quickly 
noticed. When this happens, he is invited to attend one of the Congresses as a 
guinea pig. This is the traditional method. You all know what a guinea pig is— 
the small animal that we use to test all the viruses. Well, it is much the same 
thing; the wretched young man is subjected to the ball of fire which constitutes 
a Bourbaki discussion. Not only must he understand, but he must also partici- 
pate. If he is silent, he is simply not invited again. 

He must also show a certain quality. The absence of this tendency has 
stopped many great and valuable mathematicians from joining Bourbaki. Dur- 
ing a Congress, the chapters come up in the order of the day, in no particular 
order, and we never know in advance if we shall be doing only differential 
topology at this Congress, or if at the next one we shall be doing commutative 
algebra. No, everything is mixed—I cite the same example, the symbol that 
could be thought of as the Bourbaki symbol, the ball of wool. Consequently a 
Bourbaki member is supposed to take an interest in everything he hears. If he 
is a fanatical algebraist and says “I am interested in algebra and nothing else,” 
fair enough, but he will never be a member of Bourbaki. One has to take an 
interest in everything at once. Not to be capable of creating in all fields, that 
is all right. There is no question of asking everyone to be a universal mathemati- 
cian; this is reserved for a small number of geniuses. But still, one should take 
an interest in everything, and be able, when the time comes, to write a chapter 
of the treatise, even if it is not in one’s speciality. This is something which has 
happened to practically every member, and I think most of them have found it 
extremely beneficial. 

In any case, in my personal experience, I believe that if I had not been sub- 
mitted to this obligation to draft questions I did not know a thing about, 
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and to manage to pull through, I should never have done a quarter or even a 
tenth of the mathematics I have done. When one starts to write on questions 
one does not know and if one is a mathematician, one is forced to put questions 
to oneself. This is characteristic of the mathematician. Consequently one tries 
to solve them, and this leads to personal work, independent of Bourbaki, and 
more or maybe less valuable, but which was born of Bourbaki. So one cannot 
say that this is a bad system. But there are excellent minds which cannot adapt 
to this sort of obligation, profound minds which are first-class in their field, but 
to whom one must not mention other fields. There are unbending algebraists 
who will never be made to swallow analysis, and analysts for whom the field 
of quaternions is a monstrosity. These mathematicians may be first-class mathe- 
maticians, superior to most Bourbaki members—we admit it freely and I 
could give you illustrious examples—but they could never be members of 
Bourbaki. 

To return to the guinea pig. When he is invited, we start by looking for this 
quality of adaptation. Often it is not there, so we wish him luck and he goes on 
his way. Fortunately one finds from time to time among the youths, this ten- 
dency, this appetitie for universal knowledge of mathematics and adaptation 
to diverse theories. After a very short time, if we find that he gives a good return, 
he becomes a member without any voting, election, or ceremony. Bourbaki, I 
repeat, has one rule, which is not to have rules, except for retirement at 50; 


To end, I should like to reply to a recent attack on Bourbaki by certain 
young men of a certain country. Bourbaki is accused of sterilizing mathematical 
research. I must say that I completely fail to comprehend this, since Bourbaki 
has no pretension of being a work stimulating to research. I was saying earlier 
that Bourbaki can only allow himself to write on dead theories, things which 
have been definitely settled and which only need to be gleaned (except for the 
unexpected, of course). Actually one must never speak of anything dead in 
mathematics, because the day after one says it, someone takes this theory, 
introduces a new idea into it, and it lives again. Rather let us say theories dead 
at the time of writing, that is to say, nobody has made any significant discover- 
ies in these theories Bourbaki develops for 10, 20, or 50 years, whereas they are 
in the part judged important and central, serving as tools for research elsewhere. 
But they are not necessarily stimulants for research. Bourbaki is concerned 
with giving references and support to anyone who wants to know the essentials 
in a theory. He is concerned with knowing that when one wants to work, for 
example, on topological vector spaces there are three or four theorems one has 
to know: Hahn-Banach, Banach-Steinhaus, the closed graph; it is a question 
of finding them somewhere. But nobody has the idea of ameliorating the theo- 
rems; they are what they are, they are extremely useful (this is the fundamental 
point) so they are in Bourbaki. This is the important thing. As for stimulating 
research, if open problems exist in an old theory, obviously they are pointed 
out, but this is not the aim of Bourbaki. 
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The aim is, I repeat, to provide worktools, not to give stimulating speeches 
on the open problems of the new mathematics, because these open problems 
are in general much farther than Bourbaki can go. This is living mathematics 
and Bourbaki does not touch living mathematics. He cannot when, by defini- 
tion, it changes each year. If one wrote a book on that, following Bourbaki’s 
method, i.e., taking eight or ten years to work it out, you can imagine the book 
after twelve years. It would represent absolutely nothing. It would have to be 
modified continually and would be like the old Encyclopedia, never finished. 

Those are the few explanations I wanted to give you. Now I shall be very 
happy to answer questions, to add to what I have said. 


Answers to questions. 


... Bourbaki sets off, if you like, from a basic belief, an unprovable meta- 
physical belief we willingly admit. It is that mathematics is fundamentally 
simple and that for each mathematical question there is, among all the possible 
ways of dealing with it, a best way, an optimal way. We can give examples where 
this is true and examples where we cannot say, because up to now we have not 
found the optimal method. 

I cited, for example, group theory and analytical number theory, which are 
characteristic. In both one has a quantity of methods, each one more clever than 
the last. This is splendid and ingenious and of a complexity never before known, 
but we are sure that this is not the final way to deal with the question. On the 
other hand, take algebraic number theory. Since Hilbert, it is so systematized 
that we know there is a right way to handle its questions. We change them some- 
times, but in the end, little by little, we manage to find one way which is better 
than the others. This is only a belief, I repeat, a metaphysical belief. 


... On foundations we believe in the reality of mathematics, but of course 
when philosophers attack us with their paradoxes we rush to hide behind 
formalism and say: “Mathematics is just a combination of meaningless sym- 
bols,” and then we bring out Chapters 1 and 2 on set theory. Finally we are left 
in peace to go back to our mathematics and do it as we have always done, with 
the feeling each mathematician has that he is working with something real. 
This sensation is probably an illusion, but is very convenient. That is Bourbaki’s 
attitude towards foundations. 


* An address before the Roumanian Institute of Mathematics, Bucharest, Oct. 1968. 


VISIBLE SHORELINES 
F. A. VALENTINE, University of California, Los Angeles 


What simple combinatorial geometric condition will enable a collection of 
swimmers to swim in straight paths to a common point on the shoreline of a 
convex island? If we assume that the sea is flat, the answer is simple although 
unexpected. It is true that if every five swimmers can swim linearly to some com- 
mon point of the shoreline, then all can swim linearly to some common point. 
As we shall see, if the boundary of the island has four, three or two vertices, then 
the number “five” can be reduced to “four.” If the boundary has at most one 
vertex, then the corresponding condition on every “three” swimmers suffices. 
Of course, we are assuming the swimmers are points. To make this precise, the 
following concepts are used, in which £” is n-dimensional Euclidean space. 


DEFINITION. The interior, closure, boundary, and relative interior of a set C 
in E" are denoted by int C, cl C, bd C, intv C respectively. Let @ denote the 
empty set. A point x exterior to a closed set C in E” is said to see a point pEC 
via the complement of C if xp(.\C= {p I where xp is the closed segment joining 
x and p. 

A vertex of a closed convex set C in E? is a point of bd C at which there exists 
more than one line of support to C. 

The first result is the following in E?, the plane. 


1. THEOREM 1. Suppose Cis a bounded closed convex set in the plane E?, and also 
suppose S is a collection of points in the complement of C. 

(a) If every five or fewer points in S can see some point of C via the complement 
of C then there exists at least one point pEC which all the points of S can see via the 
the complement of C. 

(b) If C contains exactly four, three or two vertices, then (a) holds with the word 
“five” replaced by “four.” 

(c) If C contains at most one vertex then the statement (a) holds with the word 
“five” replaced by “three.” 

The numbers appearing tn these three statements are best in the sense that there 
exist appropriate sets for which these numbers cannot be reduced. 


The idea behind the proof is a combinatorial one of Helly type. To make this 
clear we need to recall that Helly’s theorem [2] on the line states that a collec- 
tion of closed connected segments on a line £1 has a nonempty intersection if 
and only if every pair of segments has a nonempty intersection. (In this state- 
ment a segment may degenerate to a point.) The corresponding theorem does 
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not hold for the boundary of a compact convex set C in EH? with the int C¥ @ 
as is well known for the circle [1]. However, if we delete one point x from the 
boundary of C, then a collection of bounded closed arcs of the deleted boundary 
bd C~{x} will have a nonempty intersection if and only if every two members 
of the collection have a nonempty intersection. This follows trivially from the 
natural 1-1 correspondence that can be set up between E£! and bd C~ ix}. Also 
it should be noted that if Xt is a collection of bounded closed arcs on bd C and 
if some s members of 9 have an intersection with bd C which is a nonempty 
closed arc, then all the members of 9t will have a nonempty intersection if every 
s+7z, 752, members of 90 have a nonempty connected intersection. To see this, 
observe that s members of 9% reduce the situation to one which is equivalent to 
Helly’s theorem on £'. This fact will be used to prove Theorem 1 by showing 
that in (a) s turns out to be 3, in (b) s=2, and in (c) s=1. Thus, the correspond- 
ing “Helly” numbers are “five,” “four,” and “three” for statements (a), (b), and 
(c) respectively. 

Proof of (a): For xES, let C(x) denote the set of all points of bd C each of 
which x can see via the complement of C. Since C is convex, the set C(x) is 
either an arc or a point of bd C. If for every pair «,CS, x2€C S, the set C(%1) \ C(x2) 
is connected, then the modification of Helly’s theorem [1]| as described above 
and the hypothesis in (a) implies )\zes C(x) #@, where @ denotes the empty 
set, and therefore (a) holds. 


Hence, to complete the proof, suppose there exist points #1C.S, x.€S such 
that C(x) (\C(x2) is not connected. Since S is convex, this implies that 


(1) | C(a1) OV Clas) =U pips, gd E Pads, 


where q is the vertex of Cand where bribe is an arc (if p: ¥2) or a point (if py = po) 
of the bd C. If p1=2, then the hypothesis in (a) implies that every point of S 
can see a point p, where p=q or p= 4y, so that (a) holds. Therefore, the non- 
trivial case is that in which 1% pe. In this case, g, p1, and p2 are noncollinear. 
The three lines determined by gq, f1, p2 determine seven regions in E?, as 
illustrated. 


V 


Fic, 1 
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To be precise, V, V1, and V2 are the open unbounded V-shaped regions abut- 
ting the triangle at q, #1, and pf» respectively. Also D,, D2, and Dy are the closed 
three-sided unbounded sets abutting gp1, gpo, and p1 p. respectively. Since PiXx~ho, 
condition (1) implies that ; and 2 may be reindexed so that 1G Vi, %.€ Vo. 
(Remember, x; and x, can each see g and all of arc ,f>.) If for all xE.S, we have 
gE C(x), then (a) follows. Hence, suppose there exists a point x3©S such that 
q& C(xs). This and the hypothesis in (a) implies 


C(x) O pipe 


is a nonempty connected subset of arc bib». It should be observed that this non- 
empty set is the intersection of three sets C(x1), C(%2), and: C(x), and this cor- 
responds to s=3 in our opening introductory paragraph preceding the formal 
proof. Clearly if xGS~V, then x is so situated that C(x)M C(x) bade is con- 
nected and nonempty. This, together with the hypothesis in (a), and the modifi- 
cation of Helly’s theorem [1] mentioned above applied to {C(x C'(x3) brP», 
xES~V} implies that 
N= QN C@®) #2. 

rES~V 
Now, let yEVOS. If both p; and 2, belong to C(y), then the hypothesis in (a), 
together with the facts #1€ V1, x2€ Vo, implies Cy) AN Chi po. If there exists a 
point yE VMS such that pEC(y), then pe=C(y)ON; if pEC(y), then 
bi=Ciy)ON. These facts together with the hypothesis in (a) imply that if 
VOS#@, then NO’\(Nzevns C(x)) #@, so that 
(2) N Cia) # S. 

zESs 

Hence, (a) has been established. 

To see that the number five in (a) is best, consider the closed convex set C 
bounded by a pentagon A1B,C,D,4,, as illustrated in Fig. 3(@). The set 
{ x3, t=1,°°-, 5} in Fig. 3(@) is such that every four of its members can see a 
point of bd C via the complement of C, yet N71 C(x;) = @. 

Proof of (b): (Four vertices). Designate the four vertices of bd C by 2; 
(4=1, 2, 3, 4). If vse C(x) for all xE.S and for a fixed value of z, then (b) follows 
trivially. Hence, suppose there exist points x;G©S such that »2,€C(x,) 
(=1, 2, 3, 4). The hypothesis in (b) implies that 


The set M has no vertices of C. Since each of the C(x;) @=1, 2, 3, 4) is a 
closed arc or point of bd C, condition (3) implies that we may reindex the 
x; (z=1,---, 4) so that 


(4) M = component of C(%1) (\ C(%2), C(%1) 4 C(x2) 
or so that 
(5) M = C(m). 
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If (5) holds, then since C(x) is connected and contains no vertices of bd C, 
hypothesis in (b) and the above-mentioned modification of Helly’s theorem [1] 
implies (2) holds, and hence (b) is satisfied. So we consider the case in (4). 

If C(x1)C\C(“2) is connected, then since M contains no vertices of bd C 
Helly’s modified theorem can be applied to { C(x) 0 C (x1) VC (22), xES } to 
yield (2), so that (b) holds. Hence, suppose C(«1)(\C(%2) is not connected. Since 
M contains no vertices of bd C, the nonconnectedness of C(x1)(\C(«e) and the 
argument that implied (1) implies there exists a vertex, say v3 (without loss of 
generality, since 1\JveC C(a1)(\C(«2), by construction), such that 2,3€ C(x) 
(\C(“2), and such that 


C(#1) C\ C(%2) = v3. M, 


where M is given by (4). 
Since JM is smooth, consider the two tangents L; and Lz to the extremities of 
M (L,=L,if Misa point). See Figure 2. 


Fic. 2 


Also let R; and R, be the two half-lines of tangency to C at v3, as illustrated, 
so that the set C is contained in the closed convex region W bounded by R,W/ Ro. 
Since v3 C(x3), the point x3 is contained in W. Let HA; be the closed half-space 
bounded by L; which does not contain C. Furthermore, no points of S exist in 
the complement of My\./A2U convex hull[MU(Li:NL,)], otherwise M would 
contain a vertex of bd C which violates the definition (3). This together with 
the facts x,ECL; (¢=1, 2), vu C(x3) imply that for xES the set C(x)MC(xs) is 
connected and nonempty. Hence, since the hypothesis in (b) implies that for all 
xES, yES we have C(x) MOC(y) \C(“3) 4 @ (it is also connected). Helly’s modi- 
fied theorem [1] applied to { C(x) VC (xs), *«€S} implies (2). Thus (b) is true. 

If C has exactly three vertices, then the procedure is the same as that above 
in which, however, we use three sets C(x;) (¢=1, 2, 3) with vertices v; EC(x;). 

If C has exactly two vertices v; (4=1, 2), then if (2) fails, there exist sets 
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C(x;) (4=1, 2) such that C(x1)M C(x2) is connected and such that 9; € C(%1) NC (x2), 
(s=1, 2). This implies C(x)\C(x1)(\C(x2) is connected, and Helly’s modified 
theorem [1] applied to { C(x) C (x1) VC (x2), *«€S} implies (b). 

Figures 38, 37, 36 illustrate that the number “four” is best in statement (b) 
since every three of the points (%1, x2, x3, %4) can see a point of the set C via the 
complement of C, yet all four cannot see a point of C via the complement of C. 
The points x1, %2, x3 can see the point 123, etc. 

Proof of (c). If bd Chas one vertex 1, then if 7, €Ozes C(x) there exists a set 
C(%1), %1€S such that 9, € C(x,). Therefore C(x) is smooth so that C(y)M\C(x1) 
is connected for all y€.S, and Helly’s modified theorem applied to { C(y)\C(«1), 
yES} implies (c). 

If bd C is smooth (no vertices) then C(x)(\C(y) is connected for all «ES, 
yES, and (c) follows from Helly’s modified theorem. Thus (c) has been proved. 

If Cis a segment (int C=@), then S can clearly see an endpoint of this seg- 
ment via the complement of C. The set S consisting of the vertices of a triangle 
circumscribed about a smooth convex body C shows that the number “three” 
is best in statement (c). 

This completes the proof of Theorem 1. 


Fic. 3 


2. £". The story for E", n23 is much more difficult, and we complete the 
theory only for the case where C is a smooth compact strictly convex set. 


THEOREM 2. Let C be a compact convex set in E” and suppose Cis smooth and 
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strictly convex (1.e., C has a unique plane of support at each boundary point and the 
bd C contains no line segments). Also let S be a set exterior to C. 

If every n+1 or fewer points of S can see some point of bd C via the complement 
of C, then there exists ai least one point of bd C which all of S can see via the comple- 
ment of C. 


Proof. Since C is smooth, we have int C #@. Choose an interior point O of 
C as origin for a vector space for E”, and consider the contraction of C, namely 
C(A) =AC, where 0<A <1. Since C is smooth, the set AC is smooth. The smooth- 
ness of Cand the hypothesis that every x+1 or fewer points of S can see a point 
of bd C via the complement of Cimply that every ~+1 points of S can be sep- 
arated from C by some hyperplane H. However, this separation need not be 
strict so that points of S can lie on H. However, for \ <1, each set of m+1 points 
of S can be strictly separated from AC, 0<A<1. Let T be a collection of n+2 
points selected from SUC(A). The separation property mentioned above implies 
that 7(\S can be strictly separated from 7(\C(A) by a hyperplane Ay(A). Since 
this holds for every set of n-+2 points of SUC(A), a theorem of Kirchberger [3] 
(see Appendix) applied to finite subsets of SU C(A) implies that each finite sub- 
collection of S can be strictly separated from each finite subcollection of C(A). 
Hence, by another theorem of Kirchberger [3] (see Appendix), S can be sep- 
arated from C(A) by a hyperplane H;(A), not necessarily strictly, however. Let 
H(n) be the translate of Hi(A) which supports C(A) and such that H(A) and Ay(A) 
do not contain C(A) between them. Let »CC(A)OA(A). Since { X>, 0<A\<1} 
is a bounded set, there exists a sequence (A;, 7=1, 2,- ++) such that x,-« 
€Ebd Cast. Also, since C(A)->C as A->1, the tangent plane H to C at x 
must separate S and C. Since H(\C =x, the set S can see x via the complement 
of C. This completes the proof. 

REMARK. The “Helly number” for Theorem 2 in which C is not strictly 
smooth is still unknown. Since it is possible to see all but one of the vertices of a 
regular icosahedron, the Helly number is at least 12 for compact convex sets C 
in . A similar analysis for the dodecahedron will probably yield a higher 
bound for the best number. The situation in E”, m =3 is quite involved since the 
intersection of two different sets of visibility on the boundary of C may have 
many components. The solution for polyhedra in E” itself would be of consid- 
erable interest. 


Appendix 


THEOREM (Helly [1], [4]). Let € be a family of compact convex sets in E” con- 
taining at least n+1 members. If every n+1 members of F have a point in common, 
then all the members of § have a point in common. 


THEOREM (Kirchberger, [3], [4]). Let P and Q be two finite collections of points 
in EE”. Then P and QO can be strictly separated by a hyperplane tf and only af for each 
subset T of n-+2 or fewer points of PUOQ there exists a hyperplane H(T) which 
strictly separates T(\P from T(\Q. 
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THEOREM (Kirchberger [3], [4]). Let P and Q be two sets in E*. If for each 
set T of 2n+2 or fewer points of PUO there exists a hyperplane H(T) which sep- 
arates T(\P from TC\Q, then there exists a hyperplane which separates P from OQ. 


The preparation of this paper was supported in part by NSF Grant Number 8922. 
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AN APPLICATION OF TAUBERIAN METHODS 
TO A PROBLEM IN SERIES 


DAVID DRASIN, Purdue University 


1. Introduction. Let {dn} (n=1, 2,---) be a sequence of positive numbers 
and 


v(x) = Do an. 


Clearly properties of the function yy are determined by those of the {an}, and 
conversely. For example, in a recent note Niven and Zuckerman [8] proved the 
following result: 


THEOREM A. Let a, and y be defined as above. Furthermore assume 


(1.1) 4 2a2°-°-'20. 
Then 

(1.2) lim {¥(ox) — ¥(x)} 
exists for alla>0O tf and only if 

(1.3) lim dn 
exists. 


I should like to present another proof of Theorem A in which I view (1.1) 
as a Tauberian condition which allows (1.3) to follow from (1.2). (The deriva- 
tion of (1.2) from (1.3) is more straightforward.) This proof is self-contained 
save for a very sketchy justification of Lemma 1. While no new methods are 
used, I hope that this note will make an important set of ideas—Karamata’s 
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theory of regular varying functions—more available to a larger audience. 

Rather than working with series, I consider a nonnegative function f(d), 
for 1Si<o. To recover Theorem A from Theorem B below, just choose 
f(t) =a,(nSt<n+1) for n=1,2,---. 


THEOREM B. Let f(t) be nonnegative and monotone decreasing for 1Sit<o, 
and let V(x) = fi f(di. Then 


(1.4) lim {W(ox) — V(x) } 


Im © 


exists for each a>0 tf and only tf 
(1.5) lim «f(x) 


L—- 0 
exists. 


2. Preliminaries to the derivation of (1.5) from (1.4). I shall reformulate the 
hypothesis (1.4), and show its Tauberian nature. Thus, let f(#) be positive and 
nonincreasing on 1Si< o, and set 


(2.1) V(«) = [ sa, @(x) = e¥@), 


The basic assumption (1.4) implies that corresponding to each o>0 is an hic) 
with the property that 


(2.2) B(ox) — h(c) as x0, 
P(x) 
Since ® increases, 
(A) h is monotone increasing. 


Furthermore, it follows from the equation 
B(orx)  B&(arx) &(rx) 
B(x) B(rx) B(x) 


that h must satisfy 
(B) h(or) = h(o)h(r), 0<o4,7T. 


It is not hard to see, using (A) and (B), that there must be a nonnegative num- 
ber p with 


(2.3) h(c) = g?, 
Indeed, the substitutions ¢ =e* and log h=@ transform (B) into 
(B’) o(e+y) = o(%) +46), -—2 <4,y< om. 


Since @ increases, it has a point of continuity. It is a very important fact [4, 
p. 62] that a solution of (B’) which is continuous at a point (and thus continuous 
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everywhere) must be of the form ¢(x) =px for some real p; this is clearly the 
same as (2.3). 

REMARK: The use of this argument in such situations also appears in Feller 
[2], [3, p. 269]. 

Equivalently, (2.3) may be written as 


(2.4) P(x) = xP L(x), 
where, for each o>0, 
(C) L(ox)/L(«) > 1 (x — 0), 


Functions satisfying (C) are called slowly-varying, and have been the subject 
of considerable interest (cf. [1], [3], [5], [6], [7], [9]). For present purposes, 
we need only the following important fact: 


Lemma 1. A measurable function L(x) ts slowly varying tf and only tf 1t admits 
the representation 


(2.5) L(x) = ¢(x) exp( f | (n/tat), 


where e(t) and c(t) are measurable and €(t)—>0 and c(t) —>c>0 as to. 


Proof. The result is standard, so only a sketch will be given here. Since 
x? L(x) increases, it is a simple exercise to verify that the limit in (C) is taken 
uniformly as x» in K-!<oa<K. {In fact, the limit is taken uniformly as 
x—0 in K-!'<o<K even if x°L(x) is not assumed increasing; for a proof see 
[7].} Now 


2 Je 
Li(t)dt, 


(2.6) f “L()dt = > 7 


i-1, 


and it is not hard to use this uniformity in each of the integrals on the right side 
of (2.6) to obtain 


(2.7) f “L()dt ~~ #L (2): 


details appear in [5, p. 25]. An argument due to Karamata [6, pp. 44-45] yields 
(2.5) from (2.7). 


REMARK. The notion of slowly-varying function was due to Karamata, who 
used (2.7) as his defining property. 
Let us now rewrite (2.4), using (2.1) and (2.5): 


®(x) = exp| frat] ~ exp] ft yo/tar| 


(2.8) 
= eft + o(t)} exp] f {lo +o l/sar| 
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If g is defined by 
(2.9) g(t) = tft) — [p + €(d)], 


it is clear that (2.8) can be written equivalently as 
(2.10) { { g(t)/t} dt > loge (a> 0), 
1 


In particular, given 6>0, there exists an x» =%0(6) with the property that if 
y>x>xXo, then 


(D) fi teonal <a 


3. Proof that (1.4) implies (1.5). Standard Tauberian arguments applied to 
(D) will now show that g(t)—0; i.e., 


(3.1) f(t) > p (> &). 


It is only in this stage that it is convenient to have f decreasing (but see the 
Remark at the end of this note). 
We first establish 


(3.2) lim sup é¢f(t) S p. 
{— 0 


If (3.2) were false, there would exist 7 >0 and a sequence f,—>« such that 

| hfe) >ptn (w= 1,2,-++). 
By the monotonicity of f, 

if(t) 2 fin) > W/tn)(o Fn), CU <t Sh); 
hence, if & is given (0<k<1), then 
if) > keo+n) (Rin StS ty). 

Thus 
fo tsona 2 { " [A(o +) — p — e(t)]/t} at 


ktn 
= [k(p + 1) — p|log(1/k) — o(1) = (n> &), 
but this contradicts (D), if k is chosen close enough to one. 


A similar argument proves lim inf;.,, if(t) 2p, and this together with (3.2) 
proves (3.1) and hence also (1.5). 


tn 


4. Derivation of (1.4) from (1.5). The computation in [7] is probably the 
neatest way of proving the “Abelian” part of Theorem B, but the present proof 
allows another verification. For suppose 


p = lim if(d), 
ti 0 
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and define e(t) so that the function g(¢) introduced in (2.9) vanishes identically. 


Then 
@(x) = exp| f “soat] = 4? exp| f “{eoriar |. 


L(x) = exp] f{a/ser], 


then L clearly satisfies (C), and so ® satisfies (2.2) with h(¢) =o*. Note that we 
do not require that f be decreasing for this part of the proof. 


If one sets 


ReMARE. It is left as an exercise for the reader to verify that the hypothesis 
of f decreasing in Theorem B may be weakened to 


(4.1) lim lim sup[f(Ax)/f(x) | S 1. 
AL + r— 0 


If f satisfies (4.1), the substitution F(x) =log f(«) produces a function F which 
is “slowly decreasing.” Slowly decreasing functions have played important 
roles in Tauberian arguments. The function 


f(t) = [w+sint]/t, w>1 


fails to satisfy (4.1), and yet 


lim f@di = uw logo 


I> © x 


exists for all o>0. 


I wish to thank Professors R. Bojanic and D. Shea for several helpful suggestions, and the 
National Science Foundation for their support (GP-9454). 
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LIFE WITHOUT T>, 
ALBERT WILANSKY, Lehigh University 


1. Introduction. In 1955, Kelley’s famous book [4] caused some consterna- 
tion by the omission of 7; from several of its definitions. Bourbaki, on the other 
hand, goes so far as to make 7; part of the definition of “compact.” Dealing with 
non-Hausdorff spaces has the advantages of greater generality and of working 
with fewer assumptions. Its (possible) disadvantages are that certain theorems 
become false when 7> is dropped, and the extra generality is not paid for by a 
significant increase in the number of examples covered. We propose in this article 
to dispose of both of these objections: we shall mention some significant non- 
Hausdorff spaces, and shall give a number of systematic procedures for extending 
standard theorems, or modified forms of theorems which, in their usually pre- 
sented form, deal with Hausdorff spaces. 


2. Use retracts instead of closed subspaces. A retraction r: X—S is a con- 
tinuous map from X toa subspace S satisfying r(s) =s for all sGS. If there exists 
such a retraction onto 5S, then S is called a retract of X. 

Example 2.1. If (X.) is a family of spaces such that ILX, is normal, then each 
Xq is normal. If “normal” is replaced by “T,”, which means “normal and Haus- 
dorff” this result is proved by the remark that each X, is homeomorphic with a 
closed subspace of ILX, (the injection map), and 74 is an F-hereditary property 
(i.e., inherited by closed subspaces). Without the 7, assumption, the injection 
of X, into the product need not be closed, so this argument fails. However, the 
projection P, from the product onto X, is a retraction (if we identify X. with 
its injection into the product), and normal is an r-hereditary property. (A retract 
of a normal space is normal.) Thus if ILX, is normal, so is each Xq. 

Example 2.2. The list of r-hereditary properties is very long, including all F- 
hereditary T2 properties. It also includes all C-hereditary properties (properties 
preserved by taking a continuous image). Thus if ILX, is connected, so is each 
X,. But it also includes properties which are neither F- nor C-hereditary; one 
such is that of being a Baire space. Let X be the space obtained by deleting the 
real irrationals from the complex plane; X is a Baire space but its closed sub- 
space, the real rationals, is not. But Baire is r-hereditary, so if ILX, is a Baire 
space, so is each Xq. 

Example 2.3. The classical theorem: “Jf X 1s a Hausdorff space, the diagonal 
in X XX is closed” fails for more general spaces. It may be replaced by the fol- 
lowing generalization: The graph of continuous f: X—Y is a retract of X XY. 
(Proof: Consider (x, y)—(x, fx).) Thus we obtain: If X and Y are 72 spaces and 
f: X—Y is continuous, then its graph is closed. The first mentioned result is the 
application of this to1: XX. 

Using retractions is also a convenient way of proving T, theorems. 

Example 2.4. (See [2], Lemma 6.11.) If D is a dense subspace of a T2 space 
X, if d: X—Y is continuous, and if o|D is a homeomorphism, then ¢[D] 
(\o[D]| = @, i.e., @ carries X\D and D into disjoint sets. Proof: Let W=¢-'[¢D ] 
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and r= (6-'|@[D]) O (| W). Then ¢ is a retraction of W onto D, hence D is 
closed and dense, so D= W. 

Example 2.5. Let each X_ be a compact T2 space and X =ILX,. Extend each 
projection P,: X—Xq tO re: BX—Xzq. Define r: BX—X as the map whose ath 
coordinate is rz. Then r is a retraction, hence X is closed and dense, so X =BX 
and X is compact. This is (a case of) Tychonoff’s product theorem. 


3. Use closed graph instead of continuous. We call f: X— Ya CG function if 
it has closed graph, i.e., the graph of f is a closed subset of X X Y. Since a con- 
tinuous map between T>: spaces is CG, any theorem in which CG is a hypothesis 
generalizes the corresponding theorem for continuous maps between 7>2 spaces. 
Moreover, the substitution restores the truth of some such theorems. 

Example 3.1. If X is compact, Y is 72, and f: XY is continuous, then f is 
closed (i.e., f preserves closed sets). This fails if Y is not T2, but we have the gen- 
eralization: If X is compact and f: XY is CG, then f is closed. (Proved in [8], 
Section 11.1, Fact ii.) The first mentioned result is a special case. 

The casuality in dropping Ty. is the uniqueness of limits of convergent se- 
quences, nets, filters. However, this uniqueness is recaptured by the CG assump- 
tion, as shown in the next example. 

Example 3.2. Let X and Y be real linear topological spaces, and f: X Ya 
CG additive map. Then f is linear. Proof: As usual, f(rx) =rf(x«) for all x, and for 
rational ry. For any real ¢t, let r—-t through rational values. Then rx—ix and 
f (rx) =rf (x) #f (x). Since [rx, f(rx)]€G the graph of f, and G is closed, we have 
[tx, tf(x) ]EG, hence f(x) =f (tx), completing the proof. The corresponding the- 
orem for continuous maps and T> spaces (which is, of course, a special case) may 
be proved by the same argument except that uniqueness of limits is used in the 
last step instead of CG. 


There are ways in which CG functions are better behaved than continuous 
ones. The following two remarks fail for continuous functions: If f is a CG bijec- 
tion, f-!is also CG. (Proof: It has the same graph asf.) If f: XY is CG, then f 
remains CG when the topologies of X and Y are enlarged. (Proof: A closed set 
remains closed when the topology is enlarged.) The latter remark is the Closed 
Graph Lemma which has numerous applications in functional analysis. More- 
over, one of the most famous operators in all mathematics is CG and not con- 
tinuous. This is D: C%—C, the differentiation operator. The relevant facts are 
that if (a) f,.—f uniformly and (b) f,, -g uniformly, then g=f", i.e., (c) fa —f" 
uniformly; and that (a) alone does not imply (c). 

Using CG functions is sometimes a good way to prove J; theorems. 

Example 3.3. Let X and Y be homeomorphic locally compact 7, spaces, and 
let X; and Y, be the compact T»2 spaces obtained by adjoining one point to X, Y 
respectively. Then X; and Y, are homeomorphic, with the added points cor- 
responding to each other. This standard result on uniqueness of one-point com- 
pactification may be proved thus: Extend the given homeomorphism to a bijec- 
tion f: X:— Y, in the obvious way. It is easy to check that f is CG. Hence, by 
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the “Little closed graph theorem” ([8], Section 11.1, Fact iv), f is continuous. 
Similarly f—1 is continuous. 


We remark that a category in which the morphisms are CG functions must 
be restricted; for example the identity map is not CG unless its domain is Haus- 
dorff and the composition of CG functions need not be CG. (See [8], Section 
11.2, Problem 17.) 


4. Use locally compact instead of compact. Our intention here is that a 
locally compact space is one which has a local base of compact neighborhoods of 
each point. A compact space need not be locally compact; (see [9]) but this need 
disturb us no more than the fact that a connected space need not be locally 
connected. 

Example 4.1. Consulting [4], p. 223, Theorem 5, we find the standard the- 
orem which states that if F is a family of continuous maps from a regular or a 
Hausdorff space X to a topological space Y, and Cis the compact open topology, 
then C is the smallest topology which is jointly continuous on compact sets. 
There also we find that half of the theorem remains true without separation, 
namely, for arbitrary X, the topology C is smaller than every topology which is 
jointly continuous on compact sets. We add the remark that the other half of the 
theorem can be similarly reclaimed in the form: For arbitrary X, the topology C 
is jointly continuous on locally compact sets. (The proof given in [4] can be 
easily modified to cover this.) The standard theorem follows immediately for 
any space X in which compact subsets are locally compact. 


5. Assume regular instead of 72. Students of [4] will be familiar with this 
suggestion. For regular spaces compact implies locally compact, locally compact 
implies second category, compact implies normal and completely regular. All 
uniform spaces (in particular topological groups and vector spaces) are regular. 

Example 5.1. Let X be regular, then X* is regular if and only if X is locally 
compact. (X* is the one point compactification of X.) The standard theorem 
({4], p. 150), that if X is 72, then X+ is T, if and only if X is locally compact, 
follows immediately. (For example, if X is locally compact 72, then X* is regular 
and 7;, hence 73, hence 7%.) 

Example 5.2. A regular second countable space is semimetrizable. (A semi- 
metric is like a metric except that d(x, y) =0 for x ¥y is allowed.) Also (Nagata- 
Smirnov Theorem) a regular space is semimetrizable if and only if it has a 
g-locally finite base. The usual proofs may be easily modified by replacing em- 
bedding by the weak topology of a countable family of maps into a metric space. 
(These results fail for normal spaces, however, since a normal space need not be 
regular.) 


6. Assume KC instead of 7;. A KC space is one in which all compact sets 
are closed. (See [9].) 

Example 6.1. Let T be a compact 72 topology for a set X. Then T is maximal 
among compact topologies for X. This result (with precisely the same proof) 
holds if 7; is replaced by KC. But now we have the advantage that the converse 
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holds, i.e., a compact topology is maximal among compact topologies if and 
only if itis KC. (See [6], Theorem 2.) 

Example 6.2. In the absence of 72 one may have compact sets whose closures 
and intersections are not compact. See [3] for some extreme pathology. Those 
who are made uncomfortable by such examples may work in KC spaces since 
such spaces do not allow them. 

7. Assume symmetric instead of 7:2. A symmetric space is one in which 
«€{y} implies yC{x}. For symmetric spaces Ty implies Ti, normal implies 
regular, normal Ty) implies 74. 


8. Modify conclusions of theorems. This is done in Examples 5.1, 5.2. 

Example 8.1. The result that a finite dimensional Hausdorff linear space has 
a unique topology is replaced by: Every finite dimensional linear topological 
space has its topology uniquely determined by the closure of {o}. (See [8], 
Section 10.6, Theorem 1.) 

Example 8.2. The Mackey-Arens Theorem goes through unchanged without 
separation assumptions. (See [5], Theorem 18.8.) The crucial step that a certain 
compact set is closed ([5], p. 143, line 7) is carried out in an associated Haus- 
dorff space. 


9. Modify definitions. Neighborhoods of © in the one point compactifica- 
tion X+ must be defined as complements of compact closed sets (rather than 
just compact sets). Also, for certain purposes g-compact and hemi-compact must 
be replaced. by o-(compact closed) and hemi-(compact closed); for example, 
{co} isa Gsin X*+ if and only if X is o-(compact closed), and X* is first count- 
able at © if and only if X is hemi-(compact closed). For KC spaces no such 
modification is needed since compact sets are already closed. This remark applies 
to the rest of this section also. Paracompactness may successfully ask for regu- 
larity rather than T> in its definition. For example the famous theorem of A. H. 
Stone says that a semimetric space is paracompact. (See [4] p. 160, Corollary 35.) 

Example 9.1. Let P be a topological property. A P-generated space X is one 
in which a subset S is closed if SOK is closed in K for all sets K which have 
property P. For P=compact, a P-generated T> space is called a k-space. Steen- 
rod [7] makes a compelling case for doing topology in the category of Hausdorff 
k-spaces and continuous maps. He points out that a regular open subspace of a 
k-space is a k-space; actually it turns out that every open subspace of a k-space 
is a k-space. This is immediate from [1], Theorems 11.9.4 and 6.2.1. 

The category of KC.k-spaces and continuous maps is convenient for some 
purposes; for example (see [9], Theorem 5), it admits one point compactifica- 
tion (not as a functor, of course); but it is not offered as a serious alternative 
since, for example the product of a compact KC space with itself is not KC 
unless the space is TZ). In [4], the definition of k-space for non-T. spaces is ob- 
tained by taking P =compact closed. The following advantages of retaining the 
original definition, P =compact, were pointed out to me by J. W. Taylor: a first 
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countable or locally compact space must be a k-space in this sense (compare 
[4], p. 231, Theorem 13); a space in which each compact subset is locally com- 
pact is a k-space if and only if it is a quotient of a locally compact space (com- 
pare [1], Theorem 11.9.4); a quotient of any k-space is a k-space. 


10. Specialize when necessary. In some situations 72 may be needed or con- 
venient. In such cases one specializes; but this is no reason to use 7; as a blanket 
assumption from the beginning, any more than prospective use of the Cech 
compactification justifies dealing only with completely regular spaces from the 
beginning. 


11. Examples. Some examples of useful non-Hausdorff topologies are: the 
topology induced by a non-separating family of seminorms on a vector space, 
the one point compactification (see [9]), the Zariski topology used by algebraic 
geometers. 
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ON DIVIDING A SQUARE INTO TRIANGLES 


Pau. Monsky, Brandeis University and Kyoto University 


Sometime ago in this MoNTHLY, Fred Richman and John Thomas [1] asked 
the following puzzling question: 

Can a square S be diwwided into an odd number of nonoverlapping triangles T;, 
all of the same area? 

In [2], the answer was shown to be no, provided S= [0, 1] x [0, 1| and the 
coordinates of the vertices of the 7; are rational numbers with odd denominators. 
In this note we shall show that the answer is always no. In fact we shall prove 
the following more general result. 
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Suppose that S=[0, 1] [0, 1] cs divided into m nonoverlapping triangles T;; 
let ag=area T;. Then there 1s a polynomial f with integer co-efficients such that 
f(a, ce ty Am) =1/2. 

There are two parts to the proof: one combinatorial, the other valuation 
theoretic. The combinatorial argument generalizes an argument made in [2]. 
By itself it may be made to prove the desired result when the vertices of the T; 
all have rational coordinates. But to handle the case of arbitrary vertices, it 
becomes necessary to argue with “congruences mod 2 in the reals.” This is where 
valuation theory comes in; we make use of absolute values on the reals extending 
the 2-adic absolute value of the rationals. In the course of the proof the the- 
orem of the extension of valuations plays a remarkable and unexpected role. 

We begin with the combinatorial argument. Let R be a region in the plane 
bounded by a simple closed polygon. Suppose R is divided into m nonoverlapping 
triangles T;. By a vertex we shall mean a vertex of some T;, by a face a face of 
some T; or of R. Two vertices are called adjacent if they are in the same face and 
the line segment joining them contains no other vertices. A basic segment is a 
line segment joining two adjacent vertices. Note that the boundary of each T; 
is a union of nonoverlapping basic segments; the same is true of the boundary 
of R. Suppose now that the vertices are divided into three disjoint sets, @, @, 
and @. We shall say that a face or a basic segment is of type @@ if it has one end- 
point in @ and one in @. 


LEMMA. Suppose that no face contains vertices of all three types and that R 
has an odd number of faces of type @@. Then some T; has vertices of all three types. 


To prove the lemma note the following. A face of type @@ contains an odd 
number of basic segments of type @@, while a face not of type @@ contains an 
even number of basic segments of type @@. (Use the fact that a face contains 
vertices of at most two types to prove this.) Suppose that no T; has vertices of 
all three types. Then each T; has either 0 or 2 faces of type @@. Hence the 
boundary of T; contains an even number of basic segments of type @G. Similarly, 
the boundary of R contains an odd number of basic segments of type @@. But 
this is impossible; in an obvious sense the boundary of R is congruent to the 
sum of the boundaries of the T; modulo 2. 

We now come to the valuation theoretic part of the proof and need to intro- 
duce some further terminology. Let K be a field. By an ultranorm (sometimes 
called a non-Archimidean absolute value) on K we mean a function ] ] from K 
to the nonnegative real numbers satisfying: 


(1) ley] = Ill -Iloil 
(2) |e + 9|| < max(lal], |[yIl) 
(3) |x|] = Oa = 0. 
We can easily prove that ||1||=||—1||=1, and that equality holds in 


equation (2) unless ||-x|| =||¥]|. 
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As an example let K be the field of rational numbers. Any +0 in K may be 
written as 2*(r/s), where r and s are odd integers and ¢ is an integer. Set ||| 
= (1/2)!. In this way we get an ultranorm on the rationals in which ||2|| <1. This 
ultranorm is known as the 2-adic absolute value. The more general fact that we 
shall need is this: There 1s an uliranorm on the field of real numbers (or more 
generally on any extension of the rational numbers) such that 2 <1. This follows 
from the theorem of the extension of valuations whose proof may be found in 
many places; for example see [3]. 

Granting the above facts we may argue as follows. Choose an ultranorm on 
the reals for which ||2|| <1. Divide the points of the plane into three sets in the 
following way: 


(1) (x,y) isin@ if|lal| <1 and ||y|| <1, 
(2) (x,y) isin@ if|[al] 21 and [al] 2 [lyll, 
(3) (x,y) isin @ if|[yl] 21 and |[yl] > |[a). 


Suppose now that P=(x, y) and P’=(x’, y’) are points and that P’ is a 
translate of P by a point of type @; in other words, that both ||x’—x|| <1 and 
ly’ —y|| <1. Then P and P' have the same type. lf P is of type @ this is obvious. 
If P is of type ®, then ||x’|| =||x|| 21, while ||y’|| S$ max(1, |||) S|]x|| =||x’||; so 
P’ is of type @ too. If P is of type © the argument is similar. 

It is now easy to see that a line Z cannot contain points of all three types. 
For by translating a point of type @ on L to the origin we may assume that 
(0, 0) is on L. Let (x, y) and (x’, y’) be points of ZL of types @ and @. Then 
Hl] = Ila], Ily’l] >||x’|[, and |[xy’|| >||x’y||. This is absurd as xy’=x’y. Note also 
that if a triangle T has vertices of all three types then ||area T||>1. For we may 
assume that the vertex of T of type @ is (0, 0). Let (x, y) and (x’, y’) be the 
vertices of types ® and @. Then area T, up to sign, is equal to $(xy’—x’y). But 
|xy'|| >|[«’y|. So [area TI] =|[3ll |ly’l] =[] 4] Ie] -l]y’l] > 1. 

Suppose now that S=[0, 1]x[0, 1] is divided into m nonoverlapping tri- 
angles T; each of area 1/m. Obviously S has exactly one face of type @@; by 
the lemma, some T; has vertices of all three types. By the paragraph above, 
|area T;|| =||1/m||>1. So m is even. (Note that if all vertices have rational 
coordinates, then we can argue directly with the 2-adic absolute value of the 
rationals, and avoid the theorem of the extension of valuations; this is essen- 
tially what was done in [2].) 

Finally, we indicate the proof of the more general theorem mentioned at 
the beginning of this paper. Let A be the ring Z[a1, - + - , dm|. If 2 generates the 
unit ideal in A, then 1=2f(aq, ---,@m) and we are done. If 2A, then 2 is 
contained in a height 1 prime ideal P of A. The integral closure of the local ring 
of Pon Aisa discrete valuation ring. This ring gives rise to an ultranorm on the 
quotient field of A such that ||a,|| $1, while ||2||<1. Extend this ultranorm to 
the reals, and use it to subdivide the plane into points of three types as above. 
Then, as above, some T; has vertices of all three types, and ||@,|| =||area T;|| >1, 
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a contradiction. (By using valuation rings instead of ultranorms we could sim- 
plify the proof a little.) 

The above proof is not so wildly nonconstructive as it first appears. For the 
entire argument is carried out in the field generated by the coordinates of the 
vertices. So it is only necessary to extend our ultranorm from Q to this finitely 
generated field, not to the entire field of real numbers. 
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SYLVESTER’S PROBLEM ON COLLINEAR POINTS AND A RELATIVE 


G. D. CHAKERIAN, University of California, Davis 


In 1893, J. J. Sylvester posed the following question in the Educational 
Times: Given a finite set of points in the plane, not all lying on a straight line, must 
there be a line containing exacily two of the points? 

This question was not settled until after 1930, when T. Gallai proved the 
affirmative answer. Many proofs have been given since that time, some sparked 
by a restatement of the problem by P. Erdés [3] in this Monrtuty. V. C. 
Williams [9] recently gave a proof in this MONTHLY. Extensive references to the 
literature which has grown up around the problem can be found in [2], [4], 
and [5]. 

If we settle upon the projective plane as an appropriate setting and state the 
problem in its dual form, we obtain: Given n lines in the projective plane, not all 
concurrent, must there be a point lying on exactly two of the lines? 

Thinking of the projective plane as a sphere with antipodes identified, and 
then immediately observing that the identification contributes nothing to the 
problem, we obtain the following equivalent formulation of Sylvester’s problem: 
Given n great circles on a sphere, not all concurrent, must there be a point lying on 
exactly two of the great circles? 

In his exquisite paper on zonohedra, H. S. M. Coxeter [1] observes in passing 
that in this context the positive answer to Sylvester’s question follows imme- 
diately from the fact that there exists no map on the sphere (each of whose 
countries has at least 3 sides) such that all vertices have valence 6 or more. 
For a set of ” great circles such that no point lies on exactly two circles, would 
form a map on the sphere such that each vertex has valence at least 6. 

Here is a short proof of this fact about maps. Suppose we are given a map 
on the sphere such that each face (country) has 3 or more sides. Suppose each 
vertex had valence at least 6 (the valence of a vertex is the number of edges 
emanating from that vertex). If V denotes the total number of vertices, E the 
number of edges, and F the number of faces, then we have Euler’s relation, 


(1) V-—E+F=2. 
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Let F;, denote the number of k-sided faces of the map. Counting the number of 
edges around each face and summing, we obtain 


(2) 2E = 3F,;+4F,+--- = 3F, 


where we have used the assumption that /y=0. Let V; be the number of 
vertices of degree k. Counting the number of edges around each vertex and 
summing, we get 


(3) 2E = 6Vet7Vi+--- = OV, 
where we use the assumption that V,=0 for kS5. Now (1), (2), and (3) imply 
6(E+ 2) = 6(V + F) S 2E+4E = 6£, 


a contradiction. Hence there must be vertices of valence less than 6. 

The above solution of Sylvester’s problem, which does not appear to be as 
well known as some others, was offered by N. E. Steenrod in response to the 
problem of Erdés [3]. This approach is used in [4] to obtain a number of other 
results on configurations of points and lines. 

The relative of Sylvester’s problem referred to in the title is the following: 
Given a finite set of points in the plane, each colored either red or blue and not all 
collinear, must there be a “monochromatic line”? That 1s, must there be a line, con- 
taining at least two points of the given set, such that all points of the set on that line 
are the same color? 

T. S. Motzkin and M. Rabin ([6], [7]) have proved this is so. The problem 
is attributed to R. Graham and D. Newman. We shall give another proof, 
similar in spirit to Steenrod’s solution of Sylvester’s problem, using the follow- 
ing topological lemma of A. L. Cauchy. 


CauCHY’s LEMMA. Given a map on a sphere such that each face has at least 3 
sides, 1t 1s not possible to label the edges with +’s and —’s in such a way that the 
number of sign changes around each vertex 1s at least 4. 


To obtain “the number of sign changes around a vertex” we run once around 
that vertex, counting the number of times we have a sign change (from + to 
— or vice versa) as we traverse the edges emanating from that vertex. In other 
words, a “sign change” is simply a pair of adjacent edges labeled with opposite 
signs. Cauchy’s lemma is valid even if some edges are allowed to remain un- 
labeled. The lemma is needed in that more general form in order to prove 
Cauchy’s famous rigidity theorem: a convex polyhedron cannot be deformed tnto 
another noncongruent convex polyhedron without stretching or tearing tts surface. 

For a proof of this theorem, and Cauchy’s lemma in its more general form, 
the reader may consult [8]. We now give a proof of Cauchy’s lemma as stated 
above. 

Suppose we have a map on a sphere such that each country has at least 
3 sides. Suppose the edges have been labeled in such a fashion that the number 
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of sign changes around each vertex is at least 4. Let NV be the sum of all the sign 
changes around all the vertices of our map. In other words, run around each 
vertex once, counting the sign changes, and sum over all vertices—the result 
is NV. Since the number of sign changes around each vertex is assumed to be at 
least 4, we have 


(4) N = Av. 


A bit of reflection shows that if we run around the edges of each face, counting 
the number of sign changes as we go once around that face, and sum over all 
faces, then the result is also N. (Note that what we are doing here is quite 
analogous to what we did in deriving (2) and (3). But instead of counting edges, 
we are counting pairs of oppositely labeled adjacent edges.) Since the total 
number of sign changes, as we run around a k-sided face, is even and at most k, 
we have, 


(S) N S 2P; + 4F, + 4F5 + 6Fe + OF7 + --:-. 


Using (1) and (2) (which are still valid for our map) and then (5) and (4), we 
have, 


4V ~8 = 4E—4F = 2(3F,+4F,+---)—-4(F3+Fat---) 
= 2F,+4F,+ 6Fhs + , re = N2ZAY?d, 


a contradiction, and the proof is complete. 

In order to answer the question of Graham and Newman, we begin, as with 
Sylvester’s problem, by dualizing and transferring the problem onto a sphere. 
Then what we want to prove is: Given n great circles on a sphere, each colored 
either red or blue and not all concurrent, there 1s a “monochromatic point.” That ts, 
there 1s a point lying on at least two of the circles and such that all circles through 
that point are the same color. 

To prove this, label each edge of the resulting map on the sphere with a 
“+” or a “—,” depending on whether it lies on a red circle or a blue circle. If 
there were no monochromatic points, each vertex of the map would have both 
red circles and blue circles passing through it; hence there would clearly be at 
least 4 sign changes around each vertex. Since each face of our map has at least 
3 edges, Cauchy’s lemma shows this is impossible, so there exist monochromatic 
points. 
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ON NORMALIZED SCHAUDER BASES 


T, A. Coox, University of Massachusetts 


In 1948, Karlin [1] proved that in Hilbert space a normalized Schauder 
basis is an orthonormal basis. His proof is somewhat involved and in this note 
an extremely simple proof is offered. 

A Schauder basis in a Banach space E is a sequence {x,} of vectors in E 
together with a sequence of continuous linear functionals {f,} such that 
fx(%;) =6%,; and such that each x in E has the representation 


x= SS false) tty. 


It is understood that the partial sums of the above series converge in norm 
to x. (The reader is reminded that this is equivalent to saying that each xCE 
has a unique representation as x= >_;.1 4; x4, see, e.g., [2, pp. 86-89].) A 
Schauder basis is called normalized provided ||x,|| =||f,||=1 for each x. 


THEOREM. Each normalized Schauder basis in Hilberi space 1s an orthonormal 
basis. 


Proof. lf kj, we must show that x; is orthogonal to x;; i.e., (%,, x;) =0, where 
(x, y) is the inner product in the Hilbert space. Suppose for a particular k and j 
that (xz, «;) 40. Then there is a unit vector e in the span of x, and x; which is 
orthogonal to x,. Consequently, | (x5, e)| >0. Now, x;=(x;, ele +(x;, x4)x- and 
1 =||x,||? = | (xj, €) | 24 | (x5, 1) | 2.Thus1> | (x5; e) | . Hence, 1 = | f4(2e3) | = | (x5, e)| 
| File) | and | Fie) | = 1/| (x5, e) | >1. But this denies that Al =1. Consequently, 
x, must be orthogonal to x;. 


This research was supported in part by the National Science Foundation Grant GP-5952. 
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AN IDENTITY 


J. SHEEHAN, University of Waterloo 


Let ” be a positive integer. Let [j] =(1,% 2%, - ++, min) denote a partition 
of m, where 17; +2jo+ +--+ +2j,=n. 


THEOREM. 


2 


1 ; cee - ! 
tL So (—A)atate tint GQithat ss: tin! 
n [7] dil qe! oo “dn! 


where the summation ts over all partitions of n. 


Proof. Let I(x")| - -- |] mean “the coefficient of x" in [---+ |.” Then 


Li 


1 4? 4” 
— = I(x”) E +. —. +. eee +. =| 
n 2 


= U(«)[log(1 — «)-4 
= 1(x") [log 1+ (ata?+.-- + 4)] 


1 (4 x a eas “y] 


= (ctyitate tae: ATR Fie TT 
(71 diljel +++ Ga! 


= Uar)[— (a + a® + ++ + 0) + (-1)! 


(Note that 0! is defined to be equal to 1.) 


I am indebted to the referee for pointing out to me that the above identity is also a consequence 
of inverse relations for Bell polynomials [1, p. 174]. 


Reference 


1. J. Riordan, An introduction to combinatorial analysis, a Wiley publication in Mathematical 
Statistics, Chapman and Hall, Ltd., London, 1958. 


ON THE CIRCLE GROUP OF A NILPOTENT RING 


R. L. Kruse, Sandia Laboratories, Albuquerque 


It is well known [1] that the Jacobson radical of a ring forms a group under 
the circle composition 
xoy=xb-y+ xy. 


It would be of interest to characterize the groups which can occur as circle 
groups. In the finite case it is sufficient to study p-groups. To see this, observe 
that the primary decomposition of the additive group of a finite ring induces a 
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ring direct sum; hence the circle group of a finite radical ring is the direct prod- 
uct of its Sylow subgroups. 

A fundamental property of a finite p-group G is that G possesses a central 
series, that is, a series of normal subgroups 


G=40%450°°:)D4=1 


such that Z;1/Z; is in the center of G/Z; for 1SiSc. The length c of the 
shortest such series is called the class of G. Clearly each group G of class 1— 
that is, each Abelian group—occurs as the circle group of the ring with additive 
group G and all products 0. In [2] L. Kaloujnine gives a construction which 
shows that any p-group of class 2, where » is an odd prime, occurs as a circle 
group. In the other direction, this note obtains a restriction on the central 
series of p-groups which can occur as circle groups. A direct consequence of this 
result is that a group of order p” which occurs as a circle group can have class at 
most (x+1)/2. An example is presented which shows that this bound is the 
best possible. 

If S and T are subgroups of a group [subrings of a ring], with TCS, then 
[S: 7] will denote the index of T in S [the index of T in S considered as additive 
subgroups of the ring]. The letter » will always denote a prime. 


THEOREM. Let G be a p-group. If G occurs as the circle group of a nilpotent 
ring, then G has a central series 


G=24040:°:°::)DZ.=1 
in which |Z;4:Z;| =p? for 1Si<c. 


REMARK 1. Note that the theorem makes no assertion concerning 
[Z.1:Z.|=|Z.1|. To see that |Z,1] = can occur consider the ring of p 
elements in which all products are 0. 


REMARK 2. To show that the bound on indices in this result cannot be im- 
proved, consider the following example. Let R be the ring generated by elements 
a, b with p"a=p"b =0, a?=ab= pa, ba=b*=pb. The terms of the upper central 
series of the circle group of R coincide with the multiples of R of the form 
pb*R, OSkSn7, so all terms of the upper central series ascend by steps of p?. 


REMARK 3. Combining the above theorem with the construction of L. 
Kaloujnine [2| for #2, or with a direct construction for p=2, one imme- 
diately obtains: 


COROLLARY. All groups of orders dividing p* occur as circle groups of nilpotent 
rings. A group of order p* occurs as a circle group if and only if tt is Abelian or 
has class 2. 


To prove the theorem we shall first establish the following 


Lemma. Let R be a nilpotent ring such that [R»-?:R°| is a prime, where 
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R*=0, R*0, and n2=3. Then R*-? 1s in the center of R. 


Proof. Let b=rre ++ + %n-2 be a product of length n»—2 in R, with DER, 
Since [R*-?: Rk] =, for some prime , it follows that pb€R™, and 5b to- 
gether with R* spans R*—?. Since R*— annihilates R, the center of R contains 
R-1, Assume R*—? is not in the center. Then there is an element a€@R with 
ab #ba. Hence either ab #0 or ba ~0. To be definite suppose ab £0. 

We shall now show by a finite induction that )=xa"—-? (mod R*“!) for some 
integer x40 (mod p). Suppose, for some integer k, OSkSn—3, that 


(1) b = xa"riyre + > + ta-z-2 (mod R*!) for some integer x, 4 0 (mod 9). 


Since R*} annihilates R, (1) implies ab=(x,a*t*71 - + + Pp_4-3)rn—z—2. Since 
0~abCR*', it follows that s=x,a*t17,--- 7,434 R"!. But sCR*-?, so 
b=ys (mod R*') for some integer y#0 (mod p). Setting xn41=yx., we obtain 
(1) with k replaced by k-+1. 

Condition (1) trivially holds for k=0, so by induction we obtain b=x,;_.a"—? 
(mod R*—') for some integer x,-240 (mod p). Since, finally, R7— annihilates R, 
we have ab = Xn_2.a"—! = ba, which contradicts the assumption that ab #ba. Hence 
R-?2 is central in R. 


Proof of Theorem. Let R be a nilpotent ring whose circle group G is a p-group. 
Since R is nilpotent, the powers of R form a finite descending central series for 
G. We shall select a subset, 


R=404)::':)4.=9, 


which forms a central series for G, and for which [Z;_,:Z;] = p?, 1Si<c. To do 
so, let us define Zy»=R and proceed by induction. Suppose Z;=R*0. If 
[R':R't1] > p2, or if R41 =0, define Z;4,=R*4. If [R': R'#]=p and R' 0, 
then application of the lemma to the ring R/R*+? shows that R*/R*+*? is central 
in R/R**2, In this case define 2,4, = R*t?. This process terminates when Z;=0, 
and the description of the desired central series is then complete. 


Acknowledgment. The author wishes to thank D. T. Price of the University of Chicago for 
helpful comments and the United States Atomic Energy Commission for financial support. 
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A MEAN VALUE THEOREM 
R. J. EAston, Indiana State University, and S. G. WAYMENT, Utah State University 


I, Introduction: Let u be Lebesgue measure on £” and let T be a finitely 
additive set function defined on the intervals R, in E*. A sequence {J;} of in- 
tervals in E” is said to converge to x CE”, denoted by I;—x, provided xC J; for 
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each 2 and lim;.,, d(/;) =0, where d(/) is the diameter of J. A sequence of inter- 
vals is said to be regular and @ is the parameter of regularity provided that for 
each J; there exists a cube J; J; so that wU/;)/u(J;) 2a. The upper and lower 
regular derivatives of T are defined as 


_ — | P(I) 
T’*(«) = sup lim and 7’*(x) = inf lim 
Tyor tit MMe _ Ij~oe it bb I;) 


respectively, where the sup and inf are taken over regular sequences which 
converge to x. If 7’*(x) =7’*(x), then the regular derivative T’* is said to exist 
[2]. If 7’* exists at each point in an interval R,CE”, then L. Misik has shown 
[1] a mean value theorem holds. 

We give here a simpler and more direct proof of his result using the addi- 
tional hypothesis that T is absolutely continuous with respect to uw. For nota- 
tional advantages we restrict our proof to E?. 

Let R= |a, 0; c, d|={(x, y)ixE(a, 6), yEC(e, d)}. Denote the area of R 
by A(R), and define gr(x, y) = 7(R’), where (x, y) is the mid-point of R’ and R’ 
has its edges parallel to the edges of R and is congruent to R. Since T is abso- 
lutely continuous with respect to mu, gr is a continuous function of x and y. 


II. The Mean Value Theorem. Jf 7’*(p) exists at each point p of a closed 
rectangle R and T ts absolutely continuous with respect to pw, then there exists a point 
gER such that T’*(q) =T(R)/A(R). 


Proof. Let Ryx=R= |a, b; c, d|] and divide R, into four rectangles using the 
lines x =a+h/2, y=c+k/2 where h=b—a and k=d—c. Denote the rectangles 
beginning in the lower left hand corner and proceeding counterclockwise by 
Ry, Ry, Riz, Ris, and observe that each of the four rectangles is similar to R. 
Since D7_, T'(Ri;) =T (Ri), there must exist a j, and a 7. such that 


T(Rij,) S (1/4) T(Ri), T(Rij,) 2 (1/4) T (Ri). 


We consider the case j;=1 and j7,=3. The other cases follow in a similar manner. 
Let a=k/h and define the function f(t) =gr,,(atith/4, ctat+k/4). Then 
f(0)=7 (Ru), f(h/2) = T (Ris), and f is a continuous function of ¢ for OSiSh/2. 
Hence there exists a fo€[0, h/2] such that f(t) = (4)7(Ri). The number fy de- 
termines a rectangle R2. We proceed inductively to define a nested sequence of 
closed rectangles {Rs}, geometrically similar to Ri, and with sides parallel to 
R,, such that 


(i) T Ri) = G)T(Ri) and = (ii) A(Riti) = (4) A(R). 
By the nested interval theorem there exists exactly one point gqEN, Ri, and 
T’*(q) = lim T(Ri)/A(Ri) = T(Ri)/ A(R). 


REMARK 1. By changing the selection process slightly for R;,1=3, 4 one can 
insure g@Int(R). 
REMARK 2. The derivative 7’* can be shown to have an interesting Darboux 
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property. Suppose G is an open set such that 7”’*(p) exists for each PCG. Sup- 
pose p, gq&G and T’*(p)=a<T’*(q)=8 and let y be between a and 8. If 
bg is an arc in G and e>0, then there exists a point x within ¢ of pg such that 
T’*(x) =y. This is easily seen to be true by obtaining two congruent rectangles 
R, and R, with diameter less than e and centered at p and g respectively and 
such that 7(R,)/u(R,) and T(R,)/u(R,) are respectively less than and greater 
than y. By further restricting the size of R, and R, one can ensure that for each 
w€pq there exists a rectangle R, congruent to R, and R, such that RjCG and 
has edges parallel to the coordinate axes. Now since gr, is continuous on pq there 
exists a point s€q and a rectangle R, centered at s and contained in G such 
that 7(R,)/u(R,) =y. Finally, an application of the mean value theorem yields 
the desired x. 

REMARK 3. The requirement that 7 be absolutely continuous with respect 
to « was needed only to insure that gr be a continuous function of x and y for 
each R’CR. This leads one to the following conjecture: 


ConjEcTuRE. If T’* extsis for each qER, then gr: ts a continuous function 
of x and y for each R’'CR. 


If this conjecture were true, then the theorem given here is equivalent to 
Misik’s theorem [1]. In fact if gg- could be shown to have an intermediate value 
property, then the theorems would be equivalent. 
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A COMPACT NON-METRIZABLE SPACE SUCH THAT EVERY 
CLOSED SUBSET IS A G-DELTA 


Roy A. JOHNSON, Washington State University 


1. Introduction. By the Borel sets and Batre sets of a locally compact space 
we mean the o-rings generated by compact sets and compact G; sets, respectively 
[1, pp. 219-220]. If X and Y are locally compact spaces, it is well known that 
the Baire sets of X X Y form precisely the product o-ring of the Baire sets in X 
with the Baire sets in Y. See [1, 51.E]. It is therefore very tempting to believe 
that the Borel sets in X X Y form precisely the product o-ring of the Borel sets 
in X with the Borel sets in Y, but unfortunately this is false. Worse yet, it is 
possible that the Borel sets and Baire sets of a space X may coincide (i.e., each 
compact set may be a Gj), but that the Borel sets of X XX may form a strictly 
larger class than the Baire sets of X XX. Such is the case for a compact non- 
metrizable space in which each closed subset is a G;. Let us see why. 

We notice that if X is a compact Hausdorff space, then the Borel sets and 


1970 | MATHEMATICAL NOTES 173 


Baire sets of X XX coincide only if X is metrizable. For, if the diagonal D of 
X XX is a Baire set, then D is a compact G;. See [1, 51.D]. Hence, there exists 
a decreasing sequence of compact neighborhoods of D whose intersection is D. 
This sequence of decreasing neighborhoods is a base for the neighborhood system 
of D [2, Exercise 5.F(a)|, and hence for the uniformity of X, so that X is 
metrizable [2, 6.13, 6.30]. 


2. The example. The space we construct can be viewed in three ways: as a 
subset of the reals with a suitable topology, as a subspace of a product of discrete 
spaces, or as a subspace of an ordered set having the order topology. We con- 
struct three homeomorphic spaces having the promised properties. 

Let X=[—1, 1)={x: —1Sx<1}. Let 8 be the family of subsets A such 
that A or X—A has the form [—3, b) for some such that OSD S1. Then $ isa 
subbase for a topology on X. If @ is the base derived from §, then 


® = {[—b, —a) U [a, 5): 05 a0 8 1}. 
Since members of § are both open and closed, so are base members. 
THEOREM 2.1. The space X 1s Hausdorff. 


Proof. Given two distinct points of X, then one has smaller absolute value 
than the other or the points are negatives of each other. In the first case, suppose 
|a| <|b|. Choosing ¢ such that |a| <c<|b|, we have a@[—c, c) and bEX 
—[—c, c). In the second case, suppose )= —a, where a>0. Then J€|—a, a) 
and a©@X—|—a, a). 

To construct the second example, let G be the class of all functions from 
[0, 1] into the discrete space of two elements, and let G have the product topol- 
ogy. Then G is a compact Hausdorff space. The desired example is a subspace 
Y, defined as follows: Let Y be the class of all nonconstant increasing functions 
with the relative topology. That is, 


V = {g@G:g(0) = 0, g(1) =1, and g(i) S$ g()ifO Si Sj S 1. 


Then Y consists of those functions which are 0 to the left of some point j, called 
a jump point, and 1 to the right of that point. We shall call a function g lower 
jump or upper jump at its jump point j, depending on whether g(j) is 0or 1. A 
function g€Y is clearly characterized by its jump point and the knowledge of 
whether it is lower jump or upper jump. 


THEOREM 2.2. The subspace Y 1s closed, and hence compact, in G. 


Proof. Suppose fEG— Y. If f(0) =1[f(1) =0], then {g@G: g(0)=1} [resp., 
{gEG: g(1)=0} | is a neighborhood of f which is disjoint from Y. In the re- 
maining case, there exist 7 and j such that 0<7<j<1 and such that f(z) >f(j). 

Necessarily, f(z) =1 and f(j) =0. Let 


U = {gEGig(i) = 1} ON {g GG: g(f) = Of. 
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Then U is a neighborhood of f which is disjoint from Y. 
THEOREM 2.3. The spaces X and Y are homeomorphic, and hence X 1s compact. 


Proof. Define S: YX as follows: If gE Y is lower jump at j, let S(g) =j. If 
g€Y is upper jump at J, let S(g) = —j. Then given S(g), the jump point of g is 
| S(g)| , and g is lower or upper jump, depending on whether S(g) 20 or S(g) <0. 
Thus, S is one-to-one, and it is clear that S is onto. 

To show that S is continuous, we show that the inverses of subbase members 
of X are open sets in Y. See [2, 3.1]. If U is a subbase member of X having the 
form [—3, 0), then 


S“[U] = {g & ¥: g(0) = 1}, 
an open set in Y. On the other hand, if X -U=|-—B, d), then 
S-[U] = {g & VY: g(6) = 0}, 


an open set in Y. Hence S is a continuous one-to-one mapping of the compact 
space Y onto the Hausdorff space X, which shows that S is a homeomorphism. 

Incidentally, X could be shown directly to be compact by showing that every 
covering by subbase members can be reduced to a covering by two such mem- 
bers. 

To construct the third example, let H be the unit square with dictionary 
order, denoted by ~, and let H have the order topology [2, Exercise 5.J ]. Then 
H is a compact Hausdorff space. The desired example is a subspace Z, defined 
as follows: Let 


Z = {(x, y) € H:y = 0 or 1} — {(0, 0), (1, 1)}. 


That is, Z consists of the lower and upper edges of H, without the first and last 
points of H. Let Z have the relative topology. Since Z is clearly closed in H, we 
see that Z is compact. 


THEOREM 2.4. The spaces X and Z are homeomorphic. 


Proof. Define T: Z-—X by T(x, 0) = —x and T(x, 1) =x. Clearly T is one-to- 
one and onto. We show that T is continuous by showing 7-1[U] is open for each 
subbase member U of X. If U has the form |—83, d), then 


T-[U] = {(«, 0): -bS —2 <0} Uf, 1):0S5 4 <3} 
= {(«, 9) © Z: (, 9) ~ (, 1}, 
an open set in Z. Similarly, if X —-U=[—8, b), then 
T-[U] = {(«, 9) € Z: (6, 0) X («, 9)}, 


an open set in Z. Since T is a one-to-one continuous mapping of the compact 
space Z onto X, we see that 7 is a homeomorphism. 


THEOREM 2.5. The space X 1s not metrizable. 
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Proof. Since X is compact, it would have a countable base if it were metriz- 
able. But each «@ [—1, 0) is a member of the open set [xc, —x), and each such 
pair of x and [x, —x) would require a distinct member of the proposed countable 
base, which is impossible. 


THEOREM 2.6. The space X 1s a compact non-metirizable space such that each 
closed subset 1s a G3. 


Proof. We show that if U is open, then U is a countable union of base mem- 
bers and is therefore an F,. It follows that each closed subset is a G;, and this is 
all that needs to be shown. 

For each «CU, let Vz be a base member such that xC V,C U. Then U 
=U} V.:x€U}. We show that U can be covered by a countable subfamily of 
{ Ve: x€ U}. Now by a Euclidean interval we shall mean an open interval in the 
usual sense. Let 


W = ly € U:y © W, C Vz forsomex € Uand some Euclidean interval W,}. 


Clearly W=U{W,:yGW}. Then there exists a countable subfamily of 
{W,: yEW}, say W,,such that W=UW,, Foreachn, wechoose Va€ { Vz: xE U} 

such that W,C Vn. Necessarily, WCWUV,. Hence, to show that U can be cov- 
ered by a countable subfamily of { Vi: x€ U}, we need only show that U-W 
is countable. To do this, we construct a one-to-one mapping from U—W into 
the rational numbers. For each x G U— W, choose a rational number 7, such that | 
x<rz and such that |x, 7,)C Vz. The mapping x— 7, is one-to-one. Otherwise 
there would exist x and y in U— W such that x <¥y and such that 7,=r,. But then 
yE(x, 7z)C Vz, so that yE W, a contradiction. Hence, U— W is countable, and 
the proof is complete. 


3. An observation. A Borel measure yp is called completion regular if each 
Borel set lies in the domain of the completion of the Baire contraction of p [1, 
p. 230]. Equivalently, u is completion regular if 


h(E) = sup {u(C): Cis a compact G; and C C E}. 


Letting X be the space in this note, we notice that each Borel measure on X is 
completion regular since the Borel sets and Baire sets coincide. But, interest- 
ingly, there exist Borel measures on X XX which fail to be completion regular. 

For, let 1 be any Borel measure on X XX such that the diagonal D has posi- 
tive measure and such that each countable set has zero measure. (If T(x, x) =x 
for each x€X and if m is Lebesgue measure on the line, then n(EZ) =m(T[END]) 
is such a measure.) Now if C is a compact subset in D, we let 


A = {(«, —x): (x, 2) EC or (—x, —x) Ec}. 


Then every open set containing C contains all but finitely many members of A, 
so that each G; containing C contains all but countably many members of A. 
Hence, if C is a compact G3, then C must be countable. Hence, pu fails to be 
completion regular. 
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A GROUP EPIMORPHISM IS SURJECTIVE 


C, E. LINDERHOLM, Reading University 


1. Introduction. The reader should be familiar with the most elementary 
part of a first course on group theory. Such a course may not contain the follow- 
ing definitions. We write ¢: F->G to mean that F and G are groups and ¢ is a 
homomorphism defined on F and taking values in G. If ~: G-—H the composite 
homomorphism from F to H is written ¢y, according to the usage of some alge- 
braists. If xE F the image under ¢ is written (x)¢. A homomorphism y that 
cancels on the right in every equation ¢y =dwW is a monomorphism. More pre- 
cisely, if Y: GH then W is a monomorphism if for each group F and for each 
pair ¢1, ¢2 of homomorphisms from F to G such that dy =d¢wy, the equation 
$1=¢2 holds. A homomorphism ¢ that cancels on the left in every equation 
oy, =dy2 is an epimorphism. If @: FG and if an inverse homomorphism y from 
G to F exists—i.e., one such that (x)¢y =x for each element x of Fand (y)~¢=y 
for each element y of G—then ¢ is an tsomorphism. 

Suppose d: FG. Then o 1s a monomorphism tf and only tf > 1s injective; an 
epimorphism tf and only tf surjective; and an 1tsomorphism tf and only if bijective. 

(Injective, surjective, and bijective are the Norman terms for the Anglo- 
Saxon one-to-one, onto, one-to-one and onto.) The three assertions just made 
become six if they are analyzed into their “if” and “only if” parts. Of the six, 
five are very easy to prove; the hardest is the assertion of the title. 


2. Proof. The present proof closely resembles one due to S. Ejilenberg and 
J. Moore [3]; moreover, it seems to be well known to those who work with cate- 
gories. A proof using the slightly sophisticated concept of free products with 
amalgamation is easy once one knows that concept. Such a proof, however, since 
it involves constructing a possibly infinite group, fails to yield the interesting 
corollary that an epimorphism in the category of finite groups ts surjective. 

Let ¢ be an epimorphism from a group G to a group H, and let A be the image 
subgroup. We must show that 4 =H. To do this we construct two homomor- 
phisms -, Y2 from H to a group K such that dy =¢@y2, and we use the resulting 
equation ¥,=yY, to prove that A =H. 

Let H/A be the set of all right cosets of A; let A’ be something that is not a 
right coset of A, and let S be the set H/AU {A’}. Let K be the group of all 
permutations (bijections) of S. If h, i, haC H and if Ah; and Ah, are the same 
coset, then A (Mh) and A(h.h) are the same coset, and hence the function from 
A/H to A/H that sends Ah’ to A(h’'h) is well defined. It is easily seen to be 
bijective. (In fact its inverse is given by Ah’—Ah'h-1.) If A’ is sent to itself, this 
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defines a bijection of S, which we write (4)y. The function y, from H to K so 
defined is easily seen to be a homomorphism. Let @ be the permutation of S that 
interchanges A and A’ and leaves everything else fixed, and if hCG H write (h)y, 
for the composite permutation o~!((2)y)o. Then the function Y. is the composi- 
tion of Y, with an inner automorphism of K, and hence is a homomorphism 
from H to K. 

If aE A, then (a), leaves both A and A’ fixed, and o leaves every other ele- 
ment of S fixed. Hence o and (a)¥1 commute, and (a)~2.=a—!((a)W)o = (a). 
Since y and y agree on the range of ¢, the equation dy = dy, holds. Since ¢ is an 
epimorphism, ¥1=wW2. Hence (4)~1 commutes with o@ for each element h of H. 
Since ()y, leaves A’ fixed and o interchanges A’ and A, it follows that (h)y, 
leaves A fixed. Since (h)y, sends A to Af, it follows that hCA. Since h was an 
arbitrary element of the group H, it follows that A =H and ¢@ is surjective. 
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RESEARCH PROBLEMS 
EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (if any are known to the author) and by a brief description of known 
partial results. Manuscripts should be sent to Victor Klee, Department of Mathematics, 
University of Washington, Seattle, WA 98105. 


DOES THERE EXIST A ‘FOUR NORMALS TRIANGLE”? 


H. GUGGENHEIMER, Polytechnic Institute of Brooklyn 


There are always at least two normals from an interior point of an oval 
(a closed, smooth, convex curve) to the curve: the segments of minimum and 
maximum length from the point to the oval. There exists quite a body of litera- 
ture connected with the determination of points from which more than two nor- 
mals can be drawn to a curve. It is easy to see that for any oval that is not a 
circle there exists a domain of positive area all of whose points are “four- 
normals points,” i.e., from which at least four normals can be drawn to the oval. 
The only four-normals point of the circle is its center. 

Of particular interest in this respect are three centers of mass distributions 
connected with an oval: the centroid G, the perimeter centroid P (center of 
mass for a density 1 on the boundary curve), and the curvature centroid or 
Steiner point C (center of mass for a density proportional to the radius of curva- 
ture on the boundary curve). Hayashi [1] has shown that C is a four-normals 


178 H. GUGGENHEIMER [February 


point, and Bose [2]| did the same for G. These proofs use Fourier analysis or some 
equivalent tool [3]. Chakerian and Stein [4] proved that P is a four-normals 
point; their proof is a simple continuity argument. Bose and Roy [5] noted that 
the normals to a curve are identical with the normals to the outer parallel curves 
for all positive distances (the boundary of the union of all circles of a constant 
radius and centers in or on the oval). If G(c), P(c), C(c) are the corresponding 
points for the parallel curve in distance c=0 and A, JL, are, respectively, surface 
area and perimeter of the given oval, we have by formulas of Nicliborc [6] 


G(c) = (A + Le+ rc?)“!} AG + cLP + exc}, 
P(c) = (L + 2nc){ LP + 2ncC}, 
C(c) =C. 


Hence, C=lim,.,G(c) =lim...P(c). The points G(c) are on a parabolic arc 
joining G and C; the points P(c) fill a segment CP of the tangent to thearc at C; 
and the endpoint FP is the intersection of that tangent with the tangent to the 
parabola at G. In this way, one obtains a “triangle” CPG formed by the arc of 
parabola GC and its tangents at the endpoints. It is known, then, that all points 
of the arc CG and the segment CP are four-normals points. Question: Is it true 
that all points of CPG and its interior are four-normals points? The known 
methods do not seem to work, probably because the methods needed for CG and 
for CP have nothing in common. 
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CLASSROOM NOTES 


EDITED By DAviD DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


SATURATION OF MEASURES 


W. G. Dorson, JR., North Carolina State University 


Suppose (X, B, uw) is a measure space. A set ECX is said to be locally 
measurable provided that if FEB and pF<o then F(\ECB. The measure yu 
is said to be saturated [1, p. 221| provided every locally measurable set is mea- 
surable. It is well known that a measure pw induced by an outer measure p* is 
complete. It has recently been suggested [1, p. 253, Problem 2] that a measure 
st induced by an outer measure u* must also be saturated. That this is not true 
can be seen by the following example: 

Let X be any uncountable set, and define u* by u*@ =0, w*LE=1 if Hisany 
nonempty countable subset of X, and w*E= o if E is any uncountable subset 
of X. It is easily seen that u* is an outer measure and that the only u*-measur- 
able sets are @ and X. Indeed, if ECX and @#E#X then letting A= |p, g} 
where PCE, qEX\E, we have wp*4 =1<14+1=p*(ANE)+u*(A\Z£). So the 
measure uw induced by yp* is given by w=p*| B where B= 1B, Xx}. It is clear 
that each set ECX is locally measurable, since the only FEB with pr< © is 
F= 2. Thus p is not saturated. 

On the other hand, it is well known that if 9 is a measure on an algebra Bo, 
then the outer measure u* induced by po is regular, e.g., see [1, p. 260]. The outer 
measure u* then induces a measure uw on a o-algebra B containing Bo, and | Bo 
= jlo. It is in this context of the Carathéodory extension theorem that measures 
induced by outer measures u* are most frequently encountered. 


THEOREM. Jf w* 15 a regular outer measurc, then the measure p induced by 
u™ ts saturated. 


Proof. Let B denote the o-algebra of all u*-measurable subsets of X, so that 
= w*| B. Suppose £ is a locally measurable subset of X. For any ACX we must 
show p*A 2u*(ANE)+yu*(A\E). Since this is obviously true if u*A =o, we 
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suppose u*A < oo. Since w* is regular there exists CCB such that CDA and 
pC=p*C=p*A < o@, It now follows at once from the local measurability of E 
that COELEB, and so C\E=C\(CNE)EB also. Hence 


uA = pC = u(CO E) + a(C\E£) 
= u*(C OY E) + w*(C\E) 
= u*(A 0) E) + w*(A\B), 


since CDA and p* is monotone. Q.E.D. 


CoROLLARY 1. Jf the outer measure yw* 1s induced by a measure on an algebra, 
then the measure p induced by w* 1s saturated. 


CoROLLARY 2. The outer measure in the above example 1s not regular, and 1s 
therefore not induced by any measure on an algebra. 
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AN INEQUALITY ABOUT COMPLEX NUMBERS 
W. W. BLEDSOE, University of Texas 


In this note we give a simple proof of the theorem listed below about com- 
plex numbers. This is a slightly stronger result than that given by Rudin 
(Walter Rudin, Real and Complex Analysts, Lemma 6.3, p. 119). His constant 
of 1/6 has been replaced by the best value of 1/z. 


THEOREM. Every finite set { 21, Ze, ty Zn} of complex numbers has a subset 
S such that 
1 n 
D2} 2— Do | eI. 
2e8, WT jual 


Proof. For each 0, 0S0<27, let Sy be the set of those z; such that 
—7/2<560—arg(z;) S7/2, and let f(@) = | 26 Sp Z| . 


1970 | CLASSROOM NOTES 181 
We complete the proof by showing that 


1 n 
(6) = — Do | | 


via j= 1 


for some 6’. 
Note that f(@) is piecewise constant in 6, 


> 2 e# Dog >> ze7* 


zES, zES, zESy 
= C (x 2) =| >) R(ze-*) 
zES, ZES, 


> || cos(@— args) 


zESp 


= >> |2| cos(@ — args), 


zE8y 


and hence 


do 


> 2 


zES_ 


" ((0)d0 i) . " 


" » | 3 | cos(6 — arg(z))dé 


= 
0 zES, 
n arg (2j)-+7a/2 
= > f | 2;| cos(@ — arg(z;))d0 
j=1Y arg(2;)—7/2 


= > 2 | 2; | . 
j=1 
Thus, by the extreme value theorem, there is a 6’ between 0 and 27 for which 
1 n 
6’) = — 2) 2| z;|. 
Qa j=1 
The desired conclusion is at hand. 


The constant 1/7 is the best (largest) possible, as can be seen as follows. 
Let Cy be the supremum of numbers C for which 


Vz =CD [zl], 


zES8 zeT 


where T is any nonempty, finite set of complex numbers and S is any subset 


of T. The above theorem shows that Cpo21/z. 
Now consider 2;=exp|i(2r/n)j|,j7=1, - ++, 7. For this choice of points we 


182 H, KESTELMAN [February 
have, for each @, 


lim (|= 


> 2 


| / X lel) = tim — 


ee es 
= lim —|— Z 
no lr n/2 z€S, 
1 6-+-2 /2 1 
= f } = —-32 = —, 
21|\ J 91/2 27 7 


Thus CoS1/mr and Cy=1/7, which shows that 1/7 is the best such constant. 
This technique readily lends itself to higher dimensions. If @ is the rotation- 

invariant measure on the unit sphere S*~!'C R*, so normalized that o(S*—1) =1, 

and 

(1) c(k) = (q-u)o dq, 


Su 
where u is any point on S*-!, q-u is the usual scalar product in R*, and S, is 
the half-sphere 
Sy = fq E St1; g-u = Of, 
then we obtain the following: 


THEOREM. Every finite set { Pry pe, °°, Pa} CR* has a subset E such that 


> = c(t) D | a 


pe 
The c(k), defined by (1), are the best such constants. Computation shows 
that c(1) =1/2, c(2) =1/z, c(3) =1/4. 


After this paper was accepted for publication the author became aware of an earlier paper, 
(Kaufman and Rickert, “An inequality concerning measures,” Bul. AMS, 72 (1966), 672-676), 
which generalizes most of the results given in this paper. The treatment here is different and, 
hopefully, more readable. 


RIEMANN INTEGRATION OF LIMIT FUNCTIONS 


H. KESTELMAN, University College, London 


1. The tendency to down-grade Riemann integration into ‘calculus’ gets 
some of its impetus from the fact that the more useful rules for integrating 
limit functions are hard to prove for Riemann integrals without introducing 
Lebesgue measure. It is frustrating to be confined to uniform convergence when 
we know from Lebesgue’s theorem on dominated convergence that 


(1) (R) [ti lim fn(x)dx = lim (R) _falehaey 


no 
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whenever the f, are Riemann-integrable and converge boundedly to a Riemann- 
integrable function. 

The object of this note is to prove (1), in the conditions stated, without the 
apparatus of Lebesgue measure: this is done in Theorem 1. Applications to 
infinite and repeated integrals, and to differentiation under the integral sign, 
are also discussed; those who continue to preach Riemann integration may find 
the exercise interesting. 

A referee, to whom I am grateful, points out that W. F. Osgood gave a 
discussion of the problem many years ago. It can be found in Amer. J. Math., 
19 (1897) 155-190, with the title “Non-uniform convergence and the integration 
of series term by term.” Osgood restricts himself to continuous functions, not 
necessarily uniformly bounded. His paper is interesting as a pioneer effort in 
set theory, but it could hardly be used for undergraduate instruction. 


2. We take for granted in R, that 

(a) a decreasing sequence of bounded closed nonempty sets has a nonempty 
intersection, 

(b) every bounded open set is, in a unique way, the union of countably 
many disjoint open intervals. 

If J is the interval (a, b) we write | Z| for )—a, and if {Z,} are the constitu- 
ent intervals of a bounded open set S we write |.S| for >). |Z,| and refer to it 
as the ‘measure’ of S. If J is then an open subset of S it is easy to show that 


(2) {sl} s{[r] + {7 if U is any open cover of S — T. 
This is all the ‘measure theory’ we use. 


3. We shall need the following 


Lemma: Let {E,} be a decreasing sequence of open subsets of (0, 1) with 
inf, |E,| >0; then N,En¥ SD. 


Proof. What we have to prove is the weakest possible dilution of Lebesgue’s 
theorem that m(lim, E,) 2lim(mE,), but without using measure theory. 

By (a) it is enough to find a decreasing sequence of open sets { U,} with 
E,DU,# @. The idea is to select from each £, a finite set of intervals; these 
are then ‘shrunk’ to provide a closed subset of £,. The details ensuring that the 
closed sets decrease are a little tiresome, but the method is intuitive. 

We choose a number 0, 0<@<1, to be further restricted presently. If J 
=(a—6, a+6) we write J* for the ‘shrunk’ interval (a—66, a+66). Clearly 
I*CI and I—I* is covered by an open set with measure less than 2(1—6) | I| . 

Set p=inf, |E,|. {Zn} being the intervals of Zi, choose N so that |Zy41| 
+|Znvi2] +--+ <6 and define U,=IUIZU --- Uly. Then EH DN¥OD 
and E,—U, is covered by an open set with measure less than }+2(1—6)| Fy]. 
For each r22, E,(\ U, is open; since E,(\c U;CE, — Vi, it follows from (2) that 


[E,\U,| = | Z| — Ge +20 —60)| Ei|) > 4 
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provided 2(1—6)| £:| <3. If @ is so chosen then inf,z2 | E,1U.| >0, and we 
may repeat the argument above on the decreasing sequence of open sets 
E,C\ U,(r = 2) to define an open set U; with O@4#U2C E.M\ Ui; and inf,23| EC) U2| 
>0. The process can be continued inductively to define a sequence { Un} as 
required. 


4. We come now to the main 


THEOREM 1. Suppose that fo, fi, fo, ++ + are integrable-R over |0, 1] and that 
{fn(x)} converges boundedly to fo(x) throughout [0, 1]. Then (1) is valid. 


Proof. We have only to prove that limz..(R)f¢ {fn(«) —fo(x) }dx =0, and 
so we may without loss of generality assume that fp =0. Now assume the the- 
orem false. 

It follows (multiply the f, by a suitable constant) that a sequence {Zn} 
exists with the following properties: 


(a) —1iS¢,(“) $1 forallz andalO Sx 1, and 
lim g,(x) = 0, 
1 
(8) inf(R) f en(x)de = p> 0. 
n 0 


If we show that there is, for each n, an open set A, in which g,(«) 23, and 
inf, | A,| >0, we may deduce from the lemma, on setting E,=AnWUAnyU ---, 
that for some é in [0, 1], g,(€) 23 for infinitely many n. This contradicts 
limn+o 2n(£) =O and so establishes the theorem. 

To define Az, let s, be a lower Riemann sum of g, with s, > 2; let gq, be the 
sum of the lengths of those cells in which inf g,243p; the contribution to Ss, 
from these cells is <q, (by (a)), and that from the other cells is S4p(1—q,). 
This means 2Ss,53p+¢,(1—4p), and so g,(1—3p) 24. Thus, if A, is the 
union of the interiors of those cells throughout which g,(x)23p, we have 
(1—p) | A,| =p, and this completes the proof. 


5. Theorem 1 may be extended to Cauchy-Riemann integrals. If 
limx+(R) /* f(x)dx exists and is finite we denote it by (CR) {> f(x)dx. 


THEOREM 2. Let fo, fi, ++ - be nonnegative and defined on 0, ©) so that 


fle) = Dsus) and (R) f fala)az 


n=l 

exists for alO<X< 0 and n=O. Then 

(3) (CR) f fola)dx = D (CR) f faleae 
0 n==1 0 


af the right-hand side 1s known to be finite or if the left-hand side 1s known to exist. 
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Proof. Choose X, 0<X<o; then for 0OSxSX and N21, N ta (x) 
<supogesx fo(x) and by Theorem 1 


(R) ffols)dx = 3 (R) f fale)ae 


If R.H.S. of (3) is finite then 
(R) [fea < > (CR) [ faladaz forallO < X < ow. 
0 n= 0 
Since [2 fo(x)dx increases with X, this implies that L.H.S.<R.H.S. in (3), but 
CR) [foladde = DCR) f fede (W=1,2,---), 
0 n=l 0 


which justifies (3). 
If L.H.S. of (3) exists, then (CR) fo fn(x)dx exists for all m and 


> (CR) “folie = (CR) [flea (N= 1,2,---); 


n=l 


this implies that R.H.S. of (3) is finite and again (3) is justified. 
When the functions are not of constant sign we have 


THEOREM 3. Let fo, fi, - - + be defined on [0, ©) so that 


(i) fo) = Dinar fn(), 
(ii) (R)J¥ fa(x)dx exists for all 0<X< ow and n=O, and 


(iii) (CR)JS donna |fa(x) | dx exists. 
Then (3) 1s valid. 


Proof. For 0<X <X'< © and 720 we have (using (iil)) 


< cw) f > | fn(x)|dx—0 asX—> 0; 


e) f * pes 


this means that (CR) Jo f,(x)dx exists for all r2=0, and so by (iii) and Theorem 2, 
both sides of (3) exist. For any real X the partial sums of > ,., f(x) are 
dominated in [0, X] by suposzgx Dunes [fn(x)|, and so it follows from The- 
orem 1 that 


(R) J “fold = p> (R) J “fula)dz 


= X {CR J peas — (CR) f seoack 
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but 


IA 


(CR) J faledde] = LCR) f | fax) | dx 


n=1 


i 


(CR) fD | fate) | ax 


(using Theorem 2), and the last integral—0 as Xo by (iti). This proves (3). 
6. As an application to infinite repeated integrals we prove 


THEOREM 4. Let f(x, y) be continuous and f(x, y) 20 when x, y20. Then 


(4) fof wc sae = fas f wc y) dy, 


assuming all the integrals exist in the Cauchy-Riemann sense. 


Proof. If X and Y are natural numbers we have 


f dy i) Sic de = f dy | wc y) dy. 


For O<y< VY, [2 f(x, y)d«x is dominated by suposysy {5 f(x, y)dx. So, by (5) 
and Theorem 1 (X being integral) 


Y x x Y 
f dy lim f f(x, vy)dx = lim | as f f(x, y)dy; 
0 x0 0 x 0 0 


Y ve) ra) re) 
{ dy i) f(x, dx < f dx { fla, »)dy, 
0 0 0 0 


which implies that L.H.S.SR.H.S. in (4). The assumptions being symmetric 


in x and y, this proves (4). 
Finally, the theorem on differentiation under the integral sign. 


hence 


THEOREM 5. Suppose g(x, y) ts defined in I, the finite interval of Re given by 


a<x<b, |y—yo| <4, so that 
(i) (0/dy) g(x, y) is bounded in I and (R)f? gy(x, yo)dx exists, 
(ii) (R) JS? g(x, y)dx exists for ly —yol <6. Then for y= 


d b b 
~f g(x, y)dx -{ £y(%, Vo) dx. 
dyJ 4 


Proof by (i). 
g(x, Yo + h) — g(x, vo) 


h _ gy (x, yo) 
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is bounded for a<x<b and 0< | h| <6, and for each x this function tends to 
zero with h. Hence, by the analogue of Theorem 1 for continuous parameters, 
we have, by (ii) 


lim 
h-0 


fe Yo + h) — g(%, yo) 


~ ale, ve) dx = 0 


which, by (i), gives the result required. 
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of Mathematics, University of Wisconsin, Madison, WI 53706; M. W. Pownall, Department 
of Mathematics, Colgate University, Hamilton, NY 13346. 


AN INTRODUCTION TO DIFFERENTIAL CALCULUS 
D. G. Herr, Duke University 


How do you begin a first course in calculus on a note of excitement and an- 
ticipation? How do you convey the essence of what the course is about in two 
or three introductory lectures? These are some of the questions which faced two 
of us this spring in planning a revamped calculus sequence. We decided to give 
three introductory lectures: 1. An Introduction to Differential Calculus, 2. An 
Introduction to Integral Calculus, and 3. An Introduction to the Real Numbers. 
The order of the first two is rather arbitrary, but we felt that since a lecture on 
the real numbers was potentially the driest of the three we would build up some 
interest in the properties of the real numbers (especially inequalities) by having 
the introduction to calculus precede the lecture on real numbers. So far our ideas 
paralleled those in several texts. 

The next problem we had to face was the content of the three lectures. Tak- 
ing the last first, we decided on an axiomatic treatment of the real numbers with 
some historical notes such as the discovery of the irrationality of 7 in 1761 by 
Lambert [3] and the solution of the problem of “squaring the circle” as a result 
of C. L. F. Lindemann’s proving in 1882 that 7 is not algebraic [3]. The lecture 
on the integral was not difficult to construct in view of T. M. Apostol’s excellent 
introduction to integration in his calculus text [1]. However, the lecture on dif- 
ferential calculus was a problem. 

The usual introduction to differential calculus makes use of physical concepts 
such as velocity and thus immediately the focus of the discussion is on the limit 
of the difference quotient. This seems to us unfortunate for two reasons. First 
this attention to the derivative does not generalize satisfactorily to functions 
of several variables in that the existence of partial derivatives is not a generaliza- 
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tion of the existence of a derivative. In addition this immediate obsession with 
the difference quotient obscures the actual problem being attacked and as a 
consequence such concepts as differentials often remain a mystery. 

We take the position that the principal concern of differential calculus is the 
linearizing of functions, or more completely the identification of linearizable 
(differentiable) functions and their subsequent linearization. To be more explicit 
suppose f is a real valued function defined on the real line, R. In geometric terms 
linearizing f will mean finding a straight line which best approximates f. A 
student can readily visualize such a linearization and furthermore simple ex- 
amples quickly demonstrate the futility of trying to obtain one straight line to 
approximate f over all of Rin any reasonable way. In this way the point that 
we are concerned with local properties of functions in differential calculus is 
clearly and forcefully made. The problem now is to move from the geometric 
argument to an analytic one which will be explicit enough to answer two ques- 
tions: “Which functions are linearizable at a point?” and “If a function is linear- 
izable, what is the ‘best’ straight line approximation?” To effect this transition 
we ask the student to help formulate a definition of “best straight line approxi- 
mation.” 

Since a nonvertical straight line in the plane has the general equation 
y=ax-+b and functions of the form A= { (x, y)3 y=ax+b} are called affine 
functions, we want the “best affine approximation to f at a point.” 

It is not difficult to get the students to agree that one condition that a best 
affine approximation to f at x» should satisfy is 


(T) A (xo) = f(*o), 


where A is the approximating affine function. This establishes that )=f(xo) 
— ax 9 so that we need only specify a, the slope of the line, to have A completely 
determined. A second reasonable condition to impose on A is that 


(IT) | A(a) — f(x) | — 0 as | 2 — xo| — 0, 


i.e., the error made by approximating f by A at x becomes arbitrarily small as x 
tends to xo. Examples easily demonstrate that (II) may pose no restriction 
whatever on A and is thus not a strong enough condition for our purposes. 
However, although the error | A (x) —f (x)| gets small for any A satisfying (I) 
(f continuous at xo) as x—xo, examples are again enough to demonstrate that 
the rate at which the error gets small is not the same for all A. If the relative 
error | A(x) —f(x)| /|*—xo] is used as a measure of the rate at which the error 
tends to zero, it is natural enough to require the best approximation to have the 
best rate, i.e., 


(ITT) | A(x) — f(x) | /|* — | 0 as |x — «| 0. 


Note here that (III) implies (II). By noting that if A satisfies (1), | A (x) —f (x) | 
= | f(x) —f (xo) —a(x —xo) | , the student can readily see that (III) is equivalent to 
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(IV) 
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f(x) — f(%0) 
—_—_——— — 4 
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— 0 as X— Xo. 


xX — Xo 


It can then be pointed out that if the “best affine approximation to f at xo” is 
defined to be any A which satisfies (I) and (III), then the slope of the best 
fitting straight line, a, satisfies (IV) which means that a and thus A are uniquely 
determined. Furthermore (IV) then provides an answer to both original ques- 
tions, i.e., the functions f which are linearizable at x» are precisely those for 
which the limit as x—>%» of [f(x) —f(xo) |/(«—xo) exists as a real number, say 
f' (“o), and the best affine approximation is A = { (x, vy): v=f" (Xo) (x —xX0) +f (Xo) i. 
Thus the best affine approximation tof at x9 is defined to be any A which satisfies 
(I) and (III). 

With this foundation the student can see limits of difference quotients, 
derivatives, and differentiability in perspective. Also the differential of f at xo 
as the linear part of the best affine approximation tof at xo is a straightforward 
concept without any of the mystery usually associated with a differential. Fi- 
nally these ideas generalize directly to vector-valued functions of several vari- 
ables with the understanding that linearizability and differentiability are 
synonymous [2]. 
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GENESIS* 
Chapter 1 


In the beginning Euclid created space. 
2. And space was without structure, and 
void; and darkness was upon the face of 
deep space. And the spirit of Euclid moved. 
3. And Euclid said let there be lines: and 
there were lines. 

4. And Euclid saw the lines, that they were 
good: and Euclid divided the lines with 
points. 

5. And Euclid called these lines rays. And 
the extent of the line was the first axiom. 
6. And Euclid said let there be planes, and 
let them divide the points from the points. 


7. And Euclid made planes and they 
divided the points which were under the 
planes from the points which were above 
the planes: and it was so. 

8. And Euclid called the planes flat. And 
the evening and the morning were the 
second axiom. 

9. And Euclid said let the points be 
gathered together unto one place, and let 
solids appear: and it was so. 

10. And Euclid called the solids real; and 
the gathering together of the points he 
called space; and Euclid saw that it was 
good. 


* This is a translation of an MS recently found in an ancient wine vessel in a cave in the 
mountains of Iraq. There are missing sections because of the fragmentary nature of the MS, 
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11. And Euclid said, let the space bring 
forth segments, the segments yielding 
fruit after their kind whose seed is in 
itself: and it was so. 

12. And space brought forth segments, 
yielding segments upon segments to make 
number lines: and Euclid saw that it was 
good. 

13. And the evening and the morning were 
the third axiom. 

14. And Euclid said let there be orbs in the 
firmament of space to divide the inside 
from the outside and let them be signs for 
the seasons, and for days, and for years. 
15. And let them be for gems in the firma- 
ment of space to give delight upon the 
earth: and it was so. 

16, 17, 18. And Euclid made two great 
orbs the greater orb to rule the firmament 
and the lesser orb to rule the planes: he 
made the triangles also: and Euclid saw 
that it was good. 

19. And the evening and the morning were 
the fourth axiom. 

20, 21. And Euclid said let the points bring 
forth abundantly parallels so that alternate 
interiors are forever equal. And Euclid 
created great cubes, and squares, and 
every parallelogram that cometh, which 
the points brought forth abundantly, after 
their kind one upon another: and Euclid 
saw that it was good. 
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22. And Euclid blessed them, saying, be 
fruitful and multiply and with parallelo- 
grams and parallelepipeds fill the planes 
and space. 

23. And the evening and the morning were 
the fifth axiom. 

26. And Euclid said let us make a Mathe- 
matician in our image, after our likeness; 
and let him have dominion over triangles 
and over circles, and over spheres and over 
cubes, and over all of space, and every con- 
structable subset that is constructed out 
of space. 

27. So Euclid created a Mathematician in 
his own image, in the image of Euclid 
created he him; great and small created 
he them. 

28. And Euclid blessed them, and Euclid 
said unto them, be fruitful, and multiply 
and prove theorems and subdue them: and 
have dominion over triangles, and over 
spheres, and over every set of points that 
cometh out of space. 

29. And Euclid said Behold, I have given 
you every set bearing proof, which is on the 
face of all of space, and every set which is 
the fruit of a set yielding proof; to you it 
shall be for theorems. 

31. And Euclid saw everything that he 
had made, and, behold, it was very good. 
And the evening and the morning were the 
sixth day. 


Chapter 2 


Thus space and all its subsets were 
finished, and all the uncountable host of 
them. 

2. And on the seventh day Euclid ended 
the theory which he had made; and he 
rested on the seventh day from all the sets 
which he had made. 

4, These are the generations of the sets and 


points and of the space when they were 


created, in the day that the Greek Euclid 
made the space and its subsets. 

8. And Euclid builded a library eastward 
in Erewhon; and there he put the Mathe- 
matician whom he had formed. 

9. And out of the points made the Greek 
Euclid to grow every set that is pleasant 
to the sight, and good for proof; the set of 


logic also in the midst of the library, and 
the set of knowledge of the excluded 
middle. 

15. And Euclid took the Mathematician, 
and put him in the library to read it and 
keep it. 

16. And Euclid commanded the Mathe- 
matician, saying, Of every set in the 
library thou may freely partake: 

17. But of the set of the knowledge of the 
excluded middle, thou shalt not partake 
of it: for the day that thou partake thereof 
thou shalt surely die. 

18. And Euclid said it is not good that the 
Mathematician should be alone; I will 
make a help meet for him. 
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21. And Euclid caused a deep sleep to fall 
on the Mathematician, and he slept; and 
he took one of his brains, and closed up the 
flesh instead thereof; 


22. And the brain which the Greek Euclid 
had taken from Mathematician, made he a 
Subman and brought him unto the Mathe- 
matician. 
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23. And the Mathematician said, this is 
now bone of my bones, and flesh of my 
flesh; he shall be called Subman because 
he was taken out of man. 

24. Therefore shall a man leave his men- 
tors, and shall cleave to his student: and 
they shall be one mind. 

25. And they were both stupid, the man 
and his student, and were not ashamed. 


Chapter 3 


Now the Uncountable was more subtle 
than any set of the space which the Greek 
Euclid had made. And he said unto the 
Subman, Yea, hath Euclid said, Ye shall 
not partake of every set in the library? 

2. And the student said to the Uncount- 
able, We may partake of all sets in the 
library: 

3. But of the fruit of the set which is in the 
midst of the library, Euclid hath said, Ye 
shall not partake of it, neither shall ye 
touch it, lest ye die. 

4. And the Uncountable said unto the 
student, Ye shall not surely die: 

5. For Euclid doth know that in the day 
thou partake thereof, then your eyes shall 
be opened, and ye shall be as gods, know- 
ing consistency and contradiction. 

6. And when the student saw that the set 
was good for thought, and that it was 
pleasant to the mind, and a set to be 
desired to make one wise, he partook of the 
set thereof, and did think, and gave also 
unto his master with him; and he did 
partake. 

7. And the eyes of both of them were 
opened, and they knew that they were 
stupid; and they invented classes and made 
systems. 

8. And they heard the voice of Euclid 
walking in the library; and the Mathe- 
matician and his student hid themselves 
from the presence of Euclid amongst the 
footnotes in the library. 

9. And Euclid called to the Mathematician 
and said unto him, Where art thou? 

10. And he said, I heard thy voice in the 
library and I was afraid because I was 
stupid; and I hid myself. 

11. And he said, Who told thee that thou 
wast stupid? Hast thou partaken of the 
set, whereof I forbade thee? 


12. And the Mathematician said, The 
Subman whom thou gavest to be with me, 
he gave me of the set, and I did think. 

13. And the Greek Euclid said unto the 
Subman, What is this that thou hast done? 
And the Subman said, The Uncountable 
beguiled me and I did think. 

14. And Euclid said unto the Uncountable, 
Because thou hast done this thou art cursed 
above all other sets. To the foundations 
must thou go for all the days of thy life. 
15. And I will put enmity between thee 
and the student, and between thy subsets 
and his subsets; it shall bruise thy pride, 
and thou shalt bruise his mind. 

16. Unto the student he said, I will greatly 
multiply thy sorrow; in sorrow shalt thou 
bring forth lemmas; and thy desire shall 
be to thy master, and he shall rule over 
thee. 

17. And unto the Mathematician he said, 
Because thou hast hearkened to the voice 
of thy student, and has thought of the set, 
of which I commanded thee, saying, Thou 
shalt not think of it: cursed is the structure 
for thy sake; in sorrow shalt thou think of 
it all the days of thy life. 

22. And the Greek Euclid said, Behold, the 
Mathematician is become as one of us, to 
know consistency and contradiction: and 
now lest he put forth his hand and partake 
of the set of logic, and think, and prove 
theorems forever: 

23. Therefore Euclid sent him forth from 
the library of Erewhon, to prove, without a 
library card, the theorems from whence 
he was taken. 

24. So he drove out the Mathematician; 
and he placed at the east of the library of 
Erewhon Cher-u-bims Kronecker, and 
Hilbert, and Brouwer, with flaming words 
which turn every way, to keep the way of 
the set of logic. 
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E 2215. Proposed by Michael Goldberg, Washington, D. C. 


Find the shapes of the quadrilaterals of least area K which enclose three 
given circles of radii 1, 0, 0. 


E 2216. Proposed by M. S. Klamkin, Ford Sctentific Laboratory 
Which of the two integrals 


1 1 1 
f x*dx, f f (xy)*4dady 
0 0 0 


E 2217.* Proposed by Robert Spira, Michigan State University 


is larger? 


A, B, C, D are complex numbers with AC#0. Show that 
max(| AC|, | 4D + BC|, | BD|) , tit vs- 
max(| A|, | B])-max(|C|,| D|) — 2 
E2218. Proposed by CHED, Siratford-on-Penobscot 


An interesting geometric example of a noncommutative, nonassociative 
hoop is given by the binary operation « that maps points A and B in the plane 
into point C in the plane where ABC is a counterclockwise-oriented equilateral 
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triangle. Prove the medial property for a hoop, that (4+#B)«*«(C#D) =(A#C) 
+(B+D) for all points A, B, C, D in the plane. 


E 2219. Proposed by Dan Marcus, York University, Toronio 
Let X be an unbounded set of real numbers. Show that 
{t| 4X is dense modulo 1} 
is dense in the reals. 
E 2220. Proposed by Otto Morphy 
Attach an integer 0(G) to each finite group G by means of the equations 
o(I) = 1, >, 0(H) = ord(G), 


HeG 


where J is the trivial group and the sum ranges over all subgroups H of G, 
including J and G itself. This may be regarded as a recursion formula for @ so 
that there is precisely one such function. Show that 6(G) =0 if G is nonabelian. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Unions of Subsets of a Finite Set 


E 2106 [1968, 779; 1969, 697]. Proposed by Bernt Lindstrém, University of 
Stockholm, Sweden. 


Let S be a set with ~ elements and My, Mo, - +--+, Mns, be nonempty subsets 
of S. Prove that one can find 7, s, and r+s distinct indices 2, 42, +++, tr, ji, 
jay * + * Je Such that 


M;:,U--- UM; = M;, U--- UM;.. 
Note. The published solution is incorrect, as noted by E. F. Schmeichel. 
E. J. Zirkel, and others: if MUM,= MUM), it does not follow that M,= M2, 


II. Solution by Eddy Smet, University of Western Ontario. Let 
S= { a1, da, ***, an} and let Mi, ---, Mansi be nonempty subsets of S. Cor- 
responding to each set M, is a nonzero vector 


1 ifa;C M, 
0 ifa;& M,. 


My, = (Mrt, Mr, . es , Mrn) , where Mri = ‘ 


The ~+1 vectors m, +--+, Mn41 are linearly dependent, and so, since m,;20, 
there are r+s distinct indices 2, %, ++ -*, t, ji, jay °° *, Je (21, s2l1, n+1 
=r-+s), and positive numbers \i,, -- +, Az, Aj ° °° Az, SUCH that 


AM, + an + di,Mi, = Ajj, + m8 + Ajj, 
which implies 


M,,U+--UMi, = Mj,U---U Mj, 
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A Convergent Sequence 


E 2115 [1968, 897; 1969, 828]. Proposed by K. M. Brown, Cornell Uni- 


versity 
Let a sequence S, be defined by 
mt1igQ.. 
n — >, 2*/i, n= 1,2, 
Ontl 1 


Show that lim,.,S, exists and find the value of this limit. 


Editortal Note. It has been brought to our attention that the proposed 
formula is an immediate consequence of 


nm 2 tl 1 


given on p. 450 of Randolph, Basic Real and Abstract Analysts. 


Integers All of Whose Divisors are of the Same Form 


E 2167 [1960, 413]. Proposed by Steven Feigelstock, Polytechnic Institute of 
Brooklyn. 

Which positive integers of the form p*—1, p a prime, have all their divisors 
of the same form? 


Solution by D. C. B. Marsh, Colorado School of Mines. Let p*—1 (pa prime) 
be an integer all of whose divisors have the same form. 

1) If 2¢ divides p7—1, then a4 since 25+1 is not a prime power. 

2) If g® divides *—1 (gq an odd prime), then g=2*—1, a Mersenne prime; 
furthermore, }=1 since g?++1 is even with at least one nontrivial odd factor 
(and thus not a prime power). 

3) Should g and 7 be two Mersenne primes, not both can divide p"—1 
since gr-+1 is even with at least one nontrivial odd factor (and thus not a prime 
power). 

4) If 2 and q (a Mersenne prime) both divide *—1, then 2¢g+1 must also 
be a Mersenne prime, and thus q=3. 

Thus, numbers of the desired form are limited to Mersenne primes and the 
divisors of 48 (=2!-3), each of which does possess the desired property. 


Also solved by Orin Chein, W. F. Fox, D. E. Frohardt, J. R. Kuttler, C. B. A. Peck, Bob 
Prielipp, E. F. Schmeichel, Philip Trauber, C. S. Venkataraman (India), and the proposer. 


A Correct Conjecture 


E 2172 [1969, 553]. Proposed by S. I. Drobnies, San Diego College 
Prove or disprove: The complex number z belongs to the set 
{w: |w| —Re w <4} if and only if g is a product ac such that |é—a| S1. 


Solution by J. R. Kuttler, The Johns Hopkins University Applied Physics 
Laboratory. The statement is correct and is a consequence of the identity 
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|ac| — Reac=43|é—a|?~4([c| — | al)? 


The “if” part is immediate. For the “only if” part choose a and ¢ such that 
|a| =|c| =|z| 1/2 and the second term on the right vanishes. 


Also solved by L. W. Beineke, Stephen Berman, J.C. Binz (Switzerland), M. T. Bird, Douglas 
Campbell, W. O. Egerland, E. N. Fischman, Michael Goldberg, D. L. Grant, Robert Heller, 
C. V. Heuer & G. A. Heuer, E. F. Knapp, D. C. B. Marsh, K. A. Ribet, Steve Rohde, Perry 
Scheinok, E. F. Schmeichel, J. H. Simester, Charles Snygg, John Swetits, Charles Wexler, Mark Yu, 
D. A. Zave, E. J. Zirkel, and P. J. Zwier. 


The Ballot Problem Revisited 


E 2174 [1969, 553]. Proposed by Klaus Steffen, Johannes Gutenberg Univer- 
sity, Mainz, Germany 


Consider the set M of arrays in a row of the 2” symbols qa, ---, dn, 
bi, -- +, 0, such that a; precedes ai,; and 0; precedes 0:4; for 1S¢Sn—1. An 
inversion is a pair (a;, b;) such that a; does not precede b,;. For OSk Sn, let vy, 
be the number of elements of M with k inversions. Prove or disprove that 
Vgo=VU= °° * * =p. 


Solution by Harry Lass, Jet Propulsion Laboratory, California Institute of 
Technology. The set M consists of (") arrays and is equivalent to the set of 
paths of the xXx grid problem wherein one can only move horizontally and 
vertically to reach an opposite vertex (the successive horizontal paths are 
denoted by ai,--++,@,, and the successive vertical paths are denoted by 
bi, + +4 Dn). 

Clearly, the number of inversions is simply the number of vertical paths 
above the principal diagonal. Thus 


0) =m = (°") / w+ 1), 


obtained from the solution of the ballot problem, allowing ties. (See, e.g., Feller, 
An Introduction to Probability Theory and tts Applications, vol. 1.) 

Now let v(m) be the number of paths with & vertical paths above the 
principal diagonal (& inversions). If the first move is horizontal (a; recorded 
first), the diagonal will be reached after 27 moves in 


(” ') / (2r — 1) ways, 


and k inversions will be needed in the remaining (2% —2r) moves accomplished 
in v.(n—r) ways. If the first move is vertical, the diagonal will be reached after 
2s moves yielding s inversions, and k—s further inversions are required. Thus 


n(n) = — ae cee? (nal = 9), 


r-1 2r — 1 s-1 25 — 1 
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By direct enumeration it is seen that (1), v%(2) are independent of . 
Assume that v,(m) is independent of k for m=1, 2, +--+, %—1, with 


2m 
V.(m) = ( ) J mt 1). 
m 
From above we obtain 


1 2k +1 1 2n — 2k — 2 
nant) — a(n) = (TS 


1 (rN) 1 (7) = 0 
In — 2k —1 n—k kh+ii\k/ 


Thus, by mathematical induction, we have shown that 
2n 
ron) = ex(n) = n(n) = «= mtn) =") / (n+ 0) 
n 


Also solved by Michael Goldberg, M. G. Greening (Australia), and E. F. Schmeichel. 


A Maximization Problem 
E 2175 [1969, 554]. Proposed by Harry Pollard, Purdue University 


Find the maximum value of the function 
1 
exp (— — -) + exp(—xe7!/), O0<x< ow, 
x 


Solution by Simeon Reich, The Technion, Israel Institute of Technology. The 
maximum of our function f(x) is attained at x=1, and is equal to 
2e—1/e = 1.38440 approximately. 

Proof. Since f(x) =f(1/x), we may consider only the interval [1, ©). Clearly 
it is sufficient to prove that f’(x) <0 for «>1. 

After some computation it is seen that we have to show that for x>1, 
g(x) >0, where g(x) =log «-+-x+(1/x)e-*—1/x—xe-"*, We have g(1)=0. We 
shall prove that g’(«)>0 for x>1. 

In order to show this we have to establish that 


(1/x*)e-* + (1/a)em* + e/@ + (A/alells < 1/e +1 + 1/2. 


This is true because for 1<x< o, e-/*<1, and (1/x)e""/*<1/x. Also (1/x*)e- 
+ (1/x)e-*<1/x? because this is equivalent to e*>1-+x which holds for all «+0. 


Also solved by M. J. Brown, W. O. Egerland, Gretchen Ehman, M. A. Ettrick, R. D. Gee, 
A. F. Gentzel, Jr., Michael Goldberg, Emil Grosswald, Carl Hammer, Stephen Hoffman, Dennis 
Kump, P. A. Lindstrom, Beatriz Margolis (Argentina), Gus Mavrigian, J. V. Michalowicz, M. H. 
Moore, C. B. A. Peck, Marlow Sholander, E. F. Schmeichel, J. R. Ventura, Jr., L. E. Ward, Sr., 
M. R. Wise, and P. H. Young. 
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ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State Uni- 
versity, New Brunswick, N. J. 08903. Solutions of Advanced Problems in this issue should 
be typed (with double spacing) on separate, signed sheets and should be mailed before May 31, 
1970. Contributors (in the United States) who desire acknowledgement of receipt of their 
solutions are asked to enclose self-addressed, stamped postcards. An asterisk (*) means 
neither the proposer nor the editors supplied a solution. 


5714. Proposed by Marlow Sholander, Case Western Reserve University 


Let O be the origin, let ABCD be a line segment with endpoints on the coor- 
dinate axes, and let OBC be equilateral. Let r= AB, s=BC, and t=CD be posi- 
tive integers. A triple 7, s, ¢ has a dual ¢, s, 7 and a sum r+s-+ié. It is called 
primitive if (7, s, )=1. Two triples are called dependent if one is proportional 
to the other or to its dual. Pairs of independent primitive triples are called pips. 
Two such triples may have an element in common or they may have a common 
sum. 

(i) What primes are found as elements shared by pips? 

(ii) What is the minimum sum shared by pips? 


5715.* Proposed by Anon, Erewhon-upon-Wabash 


Let R=Z[a, ae, a3, b1, be, b3| with the defining relations b1b2 =02b3 =b3b1 = 0, 
301+ deb2+a3b3=1. Let M be the R-module generated by x1, x2, x3 with relations 
b1X1 = box, = b3x3=0. Show that M is free of rank two. Generalize. 


5716. Proposed by Harry Kesten, Cornell University 


Let X;, X2, °° + bea sequence of independent identically distributed ran- 
dom variables and S, = >\7., X;. Show that 
| | Xn | | .: 
(1) lim sup TS.) = oo with probability 1, 
n> 0 n~1 


whenever E|X,|, the expectation of |X,|, is infinite. Use (1) to derive the 
following theorem of Chow and Robbins (Proc. Nat. Acad. Sci., 47 (1961) 
330-335): When E|X ‘| =o then for any sequence {bn} of positive numbers 
either 


Sn 
lim sup | | = (0 with probability 1 
or 

| | Sa | v 

lim sup = 0c with probability 1. 


5717. Proposed by W. H. Ruckle, Lehigh University 


Without using the axiom of choice (i.e., no Hamel basis) construct a con- 
tinuum {X,: 0< ri} of linear subspaces of Hilbert space H which has the 


198 ADVANCED PROBLEMS AND SOLUTIONS [February 


following properties: (a) X, is dense in Hf for each 7; (b) if r<s, X,CX, and 
X,/X, has uncountable dimension; (c) U,X,4#H. 


5718.* Proposed by M. T. Boswell and G. P. Patil, Pennsylvania State 
University 
Find all functions g(x) which are complex-valued functions of a real variable 
satisfying 
dg(cx) 
dx 


= (g(x + 1) — g(x)) 


for x>0 and fixed c>0. The linear functions satisfy this equation. 
5719. Proposed by Roger Lyndon, University of Michigan 
For k20, let S be the set of all numbers of the form 


s=VSktJ/kE- + + Vb 


with arbitrary finite sequence of signs. If k22, then all s in S are real. Prove: 
(1) if R=2, then S is dense in the interval (0, +2); (2) if k>2, then S is dense 
in no real interval. 


SOLUTIONS OF ADVANCED PROBLEMS 


Correction. As one of the solvers of Problem 5614 [1969, 708], read Sidney 
Glusman, intsead of Sidney Glicksman. 


Continuous Functions with Zero 
5657 [1969, 200]. Proposed by Erwin Just, Bronx (N. Y.) Community College 


Prove that there exists a real valued function f, which is defined on (0, ~), 
is continuous at an infinite number of points and has the property that for each 
x, f(x) =0 if and only if f(2x) 40. 


I. Solution by M. S. Demos, Villanova University. Set f(x) =0 when 2% <x 
S2%*+1 and f(x) =1 when 27%*-!<x 52%, where & is any integer (positive or nega- 
tive). This f satisfies the conditions of the problem. 


II. Comment by Frank Meyer, Hamline University. A related question is 
whether there exists a continuous function f defined on (0, ») such that f(x) =0 
if and only if f(2x) 0. Actually f cannot be continuous everywhere. For suppose 
f is a function such that f(x) =0 if and only if f(2x) 40. Then in any interval 
(a, b)C (0, ©) which contains points a, B, f(a) =0 and f(8) £0, there is a point 
y (not necessarily distinct from @) and a sequence { <n} such that f(y) =0, 
f(*n) 40 for all 2, and lim,.,,, x, =y. But then by hypothesis, f(2y) 40, f(2x,) =0 
for all positive integers 1 and so 


lim f(2%n) ¥ f(27). 
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Hence f is discontinuous at 2¥y, a contradiction. 


Also solved by forty-eight other readers. 
The question answered in Solution II above was also raised and resolved by the proposer. 


Duals of Nonmetrizable Topological Spaces 
5658 [1969, 309]. Proposed by Stephen Gelbart, Princeton University 


Let £ be a locally convex topological vector space, and LE’ its topological 
dual equipped with the weak topology o(£’, E). Is it true that H nonmetrizable 
implies Z’ nonmetrizable? 


Solution by Donald Hartig, University of Caltfornia at Santa Barbara. Not 
necessarily. E’[o(E’, E)| is metrizable if and only if there exists a countable 
subset F of E having the property that for each finite subset S of £ there exists 
a finite subset S’ of F such that the balanced, convex hull of Sis contained in the 
balanced, convex hull of S’. This condition is clearly satisfied for any locally 
convex space of countable dimension. Hence, for example, if H denotes the 
locally convex direct sum of a countable number of copies of the reals, £ is 
not metrizable but E’[o(E’, E)| is metrizable. 


Also solved by Joel Anderson, Stephen Berman, Joe Cross, P. J. Holewijn & P. van der Steen 
(Netherlands), D. R. Horner, D. R. Kerr, Jr., H. Pétard, Barry Simon, D. R. Sherbert, Alan 
Shuchat, and the proposer. 


Iterates and Fixed Points of a Continuous Function 


5660 [1969, 309]. Proposed by W.-G. Dotson, Jr., North Carolina State 
University 


(1) Show that if T is a continuous function from the reals to the reals and 
there is a number x such that the sequence 


f + T(x) + Ta“) +--+ + — 


Li 


(*) 


of Cesaro means of the Picard iterates is bounded, then T has a fixed point. 

(2) Find a continuous function T from the reals to the reals which has a 
unique fixed point » and such that if «+ then the sequence (*) is bounded but 
does not converge to . 


Solution by Douglas Lind, Stanford University. (1) Suppose T has no fixed 
points. Then by the intermediate value theorem, either 7x>«x for all x, or 
Tx<x for all x. If Tx>x, then { T yy } is an increasing sequence. If 7"x—y, 
then T**!x-—>T'y, which implies T7y=¥y, a contradiction. Hence T"x—>, so the 
Cesaro means (*) are unbounded. The case Tx <x is handled similarly. 

(2) Let Tx = —x+1 (xS—1), Tx= — 2x (|x| $1), Tx=—x-—1(«21). Then 
if x21, 
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—% (n even) 


1 

—(x+T Lae Tr-ly) = 1 

gp Ot Teh FP tay (odd), 
1/2 


while if x S —1, 
3 (n even) 


1 
— T ee TT"! = 1 
— (a + “+ ++ x) p++ (n odd). 


Since for x0 we have |x| 21 for all sufficiently large n, we see that for x ~0 
the sequence (*) is bounded but does not converge. 


Also solved by David Boyd, Ted Cullen, R. V. Fuller, D. A. Hejhal, D. R. Horner, Emmett 
Keeler & Joel Spencer, B. G. Klein, O. P. Lossers (Netherlands), Dan Marcus, D. E. Myers, P. J. 
Owens, (England), Nicholas Passell, H. Pétard, Joel Spencer, Alberto Torchinsky, D. A. Zave, 
P. J. Zwier, and the proposer. 
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AN INTRODUCTION TO DISTRIBUTIONS 
JOHN HORVATH, University of Maryland 


1. In the late 1920’s the British physicist P. A. M. Dirac introduced the so- 
called delta-function which has the following properties [6, section 15]: 6 is 
defined and continuous on the whole real line R, 5(x) =0 for x40, [2 .6(x)dx =1, 
and if f is a continuous function defined on R, the relation 


(1) la) = f  f(a)8(a — 2)aa 


holds for aE R. Furthermore 6 is not only continuous but infinitely differenti- 
able, and if f is a k times continuously differentiable function defined on R, then 


(2) f(a) = ff fla)s(a— ads 


for any aC R. If we define the Heaviside function Y by Y(x) =0 if x<0 and 
Y(x) =1 if x>0, then 


(3) 5(~) = Y"(x). 
Finally Dirac lists the remarkable equation 
d 1 
(4) — log + = — — iri(x) 
ax x 


which plays a role in the quantum theory of collision processes. 


2. It was quickly pointed out by mathematicians that from the point of 
view of “rigorous” mathematics all this is nonsense. Of course it was perfectly 
clear to Dirac himself that 6 is not a function in the classical sense of the word, 
and what is important are not the values assumed by 6 at the points x, but rather 
the way 6 and the 6 act as operators on the functions f, as in formulas (1) and 
(2). It took about 30 years to discover the mathematical foundations of a correct 
formulation of the definition and properties of the delta-function, and it turned 
out that Dirac’s brilliant intuition was right in every instance. 


3. It is relatively easy to give a precise meaning to equation (1). It suffices 
indeed to consider 6 not as a point function but as a set function. We call a collec- 
tion X of subsets of R, or more generally of the m-dimensional euclidean space 
R", a tribe if it satisfies the following two conditions: 


Professor Horvath received his Ph.D. in 1947 at the University of Budapest, working under 
L. Fejér and F. Riesz. He has held positions at the C.N.R.S. Paris, the Universidad de los Andes, 
Colombia, and the University of Maryland, and visiting positions at the University of Madrid and 
the Université de Nancy. His research in analysis includes the books A proximacién y funciones casi- 
analiticas, Madrid 1956, Topological Vector Spaces and Distributions 1, Addison-Wesley, 1966, and 
Introduccién a la topologia general, Pan American Union 1969. Editor. 


227 


228 JOHN HORVATH [March 


(a) If (Ax)sew is a sequence of sets belonging to &, then their union be- 
longs to &. 

(b) If AEX, then its complement GA, i.e., the set of those points of R” 
which do not belong to A, also belongs to &. 

A real-valued function f defined on R® is said to be measurable with respect 
to the tribe X, or simply X-measurable, if for any pair a, b of real numbers such 
that a<b the set { 2¢| aSf(x) <b} belongs to &. 

A positive measure defined on the tribe X is a function p which associates with 
each set AC © a number 0Sy(A) S ~, and is such that if (A;) is a sequence of 
pairwise disjoint sets belonging to X, then 


w(U 4) = & we) 
k=0 k=0 

Let now f be a bounded, X-measurable function. For every finite increasing se- 
quence a@ of real numbers: a1<a,.< -+- <a, such that a;Sf(x)<a, write 
A; = {x| as S F(*) <aeyr}. Then the Lebesgue-Stieltjes integral [f(x)du(x) of f 
with respect to the positive measure yp is defined as the least upper bound of the 
values >a a,°u(Ae), where a runs through all possible sequences of the kind 
described above. 

For any aC R® we denote by 7, the translation map #+>x--a from R*” onto 
itself. For any subset A of R” the translate by a of A is the set 7.(A) =A-+a 
= {x-+a|xCA}. The translate by a of a positive measure u defined on the tribe 
X is given by 


Ta(u)(A) = p(7~-cA) 


for every ACR such that r_,AEX. 

If A is any subset of R", we denote by —A its reflection with respect to the 
origin, i.e., the set ~A = | —x|x@A}. The reflection i of the positive measure 
wis then defined by 


ii(A) = u(— A) 


for every ACR" such that ~-AGX. 
The Dirac measure 6 is defined on the tribe of all subsets of R™ by the follow- 
ing conditions: 


6(A) =1 if OG A, 
6(A) =0 if OG A. 
Clearly §=6, and for any a@ R* the measure 7,(5) is given by 
ta(6)(A) = 1 if @EA, 
tTa(6)(A) =O if aE A. 


If f is a continuous function defined on R", then the relation 
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5) fla) =f fa)d(ral6)) (a) 


holds for a&@ R", This is the rigorous form of Dirac’s formula (1). Of course (5) 
looks much clumsier than (1), but with the concept of convolution we shall be 
able to rewrite (5) in a form whose elegance matches that of (1). It is to keep 
in line with the general definition of a convolution that we have written 6 
instead of 6 in (5), where it is in fact superfluous. 


4. In the early 1940s Nicolas Bourbaki and Henri Cartan singled out a cer- 
tain class of measures and characterized them not as set functions but as opera- 
tions which associate with certain functions @ the number yu(¢) = fo(x)du(x). 

If ¢ is any real- or complex-valued function defined on R*, the support Supp ¢ 
of ¢ is the closure of the set { 2| o(x) #0 \ We denote by K(R*), or simply by K, 
the vector space consisting of all real-valued continuous functions ¢ on R® such 
that Supp ¢ is a compact subset of R*. A (real) Radon measure on R" isa map pu 
from & into R which satisfies the following conditions: 

(a) mis linear, i.e., fora, BG Rand ¢d, YC XK we have 


ulad + BY) = au(d) + Buly). 
(b) For every compact subset K of R” there exists a number Mx >0 such that 
| u(¢) | S Mx max | ¢(x) | 


whenever Supp ¢CK. 

A complex Radon measure is defined similarly by taking for ® the vector 
space of complex-valued continuous functions with compact support and maps 
fu: K—->C which are C-linear, i.e., satisfy condition (a) for a, BEC. 

This suggested to Laurent Schwartz in 1944 the idea to replace K by the 
smaller space of all infinitely differentiable functions with compact support, and 
condition (b) by a less stringent one, He thus obtained a new class of objects 
which contains not only the Dirac measure 6 but also its derivatives 6, which 
are not measures any more. 


5. It was in fact not formula (2) which led Schwartz to the discovery of dis- 
tributions but a problem posed by G. Choquet and J. Deny [3]. If o is a simili- 
tude of R” and f any function defined on R*, we define the function o*(f) by 
a*(f)(«) =f(o—'x). Let Q be a nonempty open subset of R* and f a continuous 
function defined on R*. Denote by § the set of restrictions to © of all functions 
o*(f), where o runs through the similitudes of R*. For the case n=2 Choquet 
and Deny proved that every continuous function defined on Q can be approxi- 
mated uniformly on every compact subset of 2 by linear combinations of ele- 
ments of § if and only if f is not polyharmonic, i.e., does not satisfy an equation 
Arf =0, where A is the Laplace operator and p an integer 21. The necessity of 
the condition is obvious. Indeed, if f is polyharmonic, then so is every function 
in ¥ and also their linear combinations, Since the uniform limit of polyharmonic 


230 JOHN HORVATH [March 


functions is polyharmonic, not every continuous function on {2 can be approxi- 
mated by linear combinations of elements of &. 

In their proof Choquet and Deny used the expansion of f into a series of 
harmonic polynomials. This tool was not powerful enough to yield the result for 
n> 2. Ina four-page-long paper [19] Schwartz gave a very elegant proof of the 
general theorem for any 22, and, more importantly, discovered the theory of 
distributions practically overnight. 

Besides the new point of view to consider Radon measures, Schwartz was 
also helped in his discovery by his familiarity with topological vector spaces. 
Indeed, in 1942 Dieudonné published a very important paper on the duality of 
normed vector spaces [4], and just as an exercise Schwartz generalized its results 
to Frechet spaces, i.e., locally convex, metrizable, and complete vector spaces. 
He never intended to publish these generalizations but they came extremely 
handy when the need for them arose. 


6. We denote by 0(R*), or simply by 9, the vector space of all real- or com- 
plex-valued functions ¢ defined on R” which have continuous partial derivatives 
of all orders and are such that Supp ¢ is a compact subset of R”. It is not obvious 
that such functions exist which are not identically zero, but they can be con- 
structed easily from Cauchy’s infinitely differentiable function which assumes 
the value exp(—1/??) for >0, and the value zero for ¢S0. 


If a=(a1,-°+ +, Q,) is any n-tuple of positive integers, we shall denote by 
| o| the integer a,+ - +--+ +a, and by 0¢ the partial differential operator 
glel 
Oxpidne? + + + dxen 


A real (or complex) distribution on R” is a map T from D into R (or into C, 
respectively), which satisfies the following two conditions: 
(a) T is linear. 
(b) For every compact subset K of R” there exists a number Mx>0 and an 
integer m(K) 20 such that 
|7(¢)| S$ Mx max max | d26(x) | , 
la] Sm(K) zeER* 
for every 6€D which verifies Supp 6CK. 
Instead of T(¢) we shall mostly write (7, ¢). 
Examples. (i) Let f be a continuous (or just locally Lebesgue-integrable) 
function defined on R*. We associate with it a distribution 7; defined by 


(T;, ¢) = J fe@ocas. 
Here we can choose Mr=Jx|f(x)|dx and m(K)=0 for every compact subset 


K of R". 
(ii) The Dirac measure 6 is the distribution defined by 
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(6, ¢) = (0) 


for every $C D. 

(iii) More generally, according to our definitions, every Radon measure de- 
fines a distribution for which m(K)=0 for any compact KC R*. The Radon 
measures can be considered as the mathematical formulation of the physical 
concepts of mass or charge distributions. 

(iv) Let 2 be a smooth surface, do the surface element of 2, denote by d/dn 
the derivation in the direction of the normal of 2, and let f be a continuous func- 
tion defined on 2. Then 


d(x) do(x) 
n 


7,9) = f sa) 


defines a distribution T which can be considered as a magnetic dipole layer on 2, 
where f is the density of the magnetic moment. In this example we have to take 
m(K) =1. 


7. For any compact subset K of R* denote by O(K) the vector space of all 
functions which have continuous partial derivatives of all orders, and whose 
support is contained in K. Clearly D is the union of all the spaces D(K). 

For any 2-tuple of positive integers ~ we have a semi-norm on D(K), i.e., a 


map ¢'> pa(¢) defined by 
Pa(h) = max | 0%(x) |. 
zeERn 


The family (p.) of all these semi-norms defines a topology on D(K): a funda- 
mental system of neighborhoods of 6€ D(K) is given by all finite intersections 
of sets of the form {W| pal —d) Se}, as a runs through all n-tuples and e 
through all strictly positive numbers. This topology is compatible with the vec- 
tor space structure of 0(K), i.e., the two algebraic operations on D(K), addition 
and multiplication by a scalar, are continuous. In particular, the topology of 
©(K) is completely characterized by the system of neighborhoods of the zero 
element of D(K): the neighborhoods of any 6€ D(K) are all the sets V+¢ 
= iv+o|we V} , where V is an arbitrary neighborhood of 0© D(K). 

For any compact subset K of R* we have a canonical map jx from D(K) 
into 9, which associates with ¢€ D(K) the same ¢ considered as an element of D. 
One then makes a topological vector space out of D by putting on it the finest 
locally convex topology for which all the maps jx are continuous. As we have 
observed above, to describe this topology it is sufficient to give the neighbor- 
hoods of the zero element of D. We say that a subset V of © is balanced if for 
every #€ V and every scalar ¢ such that | ¢| <1, the element é¢ also belongs to V; 
we say that V is convex if for PCV, YE V and 0 Si S1 we have (1—/)¢+WE V. 
A fundamental system of neighborhoods of 0€ D is given by all balanced, convex 
sets V such that for every compact subset K of R* the set jz'(V) = VO9(K) 
is a neighborhood of 0 in D(K). 
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Once this topology is put on the space 9, the distributions are simply the 
continuous linear maps from ® into the field of scalars, i.e., the continuous 
linear forms (or functionals). The space of all distributions on R” is therefore the 
topological dual D’ of BD. 

We can define various topologies and modes of convergence on 9’. For- 
tunately, for sequences all these definitions coincide, and we can say that a se- 
quence (Z;) of distributions converges to the distribution T if for every 6€D 
the sequence ((7;, )) converges to (T, ¢). 

Understandably, the discovery of distributions has aroused great interest 
in the theory of topological vector spaces. Before 1950 functional analysts 
studied mainly normed spaces, and only a handful of papers were written on 
spaces which are not normable. In 1950 the basic paper of Dieudonné and 
Schwartz [5] was published, whose avowed purpose was to prove those results 
which are necessary for the theory of distributions. This was followed the next 
year by an equally basic paper of Bourbaki [1], and a few years later Grothen- 
dieck published his very deep results [9, 10]. Since then the number of articles 
concerned with topological vector spaces has gone into the thousands. An excel- 
lent account of these developments can be found in Kéthe’s book [14]. 

On the other hand, several approaches to the theory of distributions have 
been devised, which avoid the explicit use of topological vector spaces. These 
so-called elementary approaches are, however, not very powerful and in par- 
ticular they do not yield the applications to the theory of partial differential 
equations, some of which we shall mention below, which are the strongest justifi- 
cations of the theory of distributions. 


8. The classical results concerning normed spaces, when generalized to topo- 
logical vector spaces, can be used to prove results concerning distributions. We 
shall illustrate this by two examples. 

(a) By example (i) above, D can be considered as a linear subspace of the 
space 9’ of all distributions, if we identify @€ D with the distribution Ty. Then 
D is a dense subspace of 9’. Proof. lf D were not dense in D’, then by the Hahn- 
Banach theorem there would exist a continuous linear form Z on 0’ which is not 
identically zero but L(¢) =(L, ¢)=0 for every $€D [13, prop. 1.6.2, p. 49]. 
Now it is easy to see that the space 2D is reflexive, i.e., for every continuous 
linear form L on 0’ there exists YEO such that (LZ, T)=(T, y) for every TED! 
[13, example 3.9.6, p. 241]. If (Z, 6) =(T4, ¥) =0 for every $EO, then in par- 
ticular (L, ~) =f | W(x) | 2dx =0, hence y = 0 and therefore L =0. Thus D must be 
dense in 9’. 

(b) The classical Banach-Steinhaus theorem [13, prop. 1.8.1, p. 63| can be 
generalized to the space ® to yield the following result: Let (T,) be a sequence 
of distributions such that for every 6€D the sequence of scalars ((T;, ¢)) con- 
verges to some number T(¢). Then the map T: ¢-°T(@) is a distribution, i.e., 
a continuous linear form on 9. It is fairly obvious that J is linear, the emphasis 
is on the continuity of TJ. Let us also observe that for Banach spaces the proof 
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of the Banach-Steinhaus theorem uses Baire’s theorem, according to which if a 
complete metric space is the union of a sequence of closed subsets (F;,), then at 
least one set F, has an interior point. Now in D Baire’s theorem does not hold, 
but the Banach-Steinhaus theorem is still true because 9D is a so-called barrelled 
space. It was Bourbaki [1] who characterized those spaces for which the Banach- 
Steinhaus theorem holds by the property of barrelledness, which is usually easy 
to establish. 
Example (v). The distribution v.p.(1/x) on R is defined by 


yt \_, (2) 
(6) (VP. = e) lim me dx 


for PE D(R). That it is a distribution follows from the fact that the right-hand 
side of (6) is also the limit of the sequence (J\z\>14%7!6(«)dx). The letters v.p. 
are used because the right-hand side of (6) is the Cauchy principal value (in 
French: valeur principale) of the integral [2..x—16(x)dx. The symbol 1/x in 
Dirac’s formula (4) is meant to designate the distribution v.p.(1/x). 


9. Let us now examine how the various operations, which are classically per- 
formed on functions, generalize to distributions. 

The first of these operations is differentiation. To motivate the definition, 
let f be a continuous function defined on the real line, and which has a continu- 
ous first derivative Of. It is natural to require that the derivative 0T; of the dis- 
tribution 7; associated with f shall be the distribution 7% associated with df. 
If this is so, then for every 6€ D(R) we have 


(OT; 9) = (Tu, 4) = [afte roleas 


= [feo(a)I2. — [ H2)06(a)dx = — (T,, 29). 


Let us write briefly 0; for the partial differential operator 0/dx; (1Sj7Sm) on 
R”. The above consideration leads us to define the partial derivative 0;T of the 
distribution T on R*, by requiring that the formula 


(0;T, ¢) = (T, dj) 


be verified for every $C D(R"). We have to prove, of course, that 0;Z is a dis- 
tribution. But the topology of D has been so cleverly devised that ¢'-0,¢ isa 
continuous linear map from D into itself, and therefore the linear form ¢'>(T, 0;¢) 
is continuous. 

More generally, if a is any n-tuple of positive integers, we have 


(0°T, ¢) = (—1)!(T, d°9) 


for every CD. 
Example (vi). Let us show that the derivative of the distribution associated 
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with the Heaviside function Y is the Dirac measure 6. This justifies Dirac’s 
formula (3). For any 6€ D(R) we have indeed 


(Tx, 6) = — (Tx, 09) = — f " ag(a)de 
0G 


— [6(«)]Jo = $(0) = (8, ¢). 


10. The second operation we can perform on distributions is the multtplica- 
tion by an infinitely differentiable function. Again, if 7; is the distribution asso- 
ciated with the continuous function f, and W is an infinitely differentiable 
function defined on R’, it is natural to require that 7; shall be the distribution 
associated with yf. If this is so, then for every 6€ D we have 


WT, 6) = Try 6) = J vefladolads 


=f fave roladae = (Ty, ¥0). 
Thus if Y is an infinitely differentiable function and T any distribution on 
R*, we define YT by asking that 
WT, o) = (T, vo) 


be verified for every 6€D. This definition is justified by the fact that Yd has 
compact support, and ¢'-y¢ is a continuous map from DO into itself. 


11. Perhaps the most important of all operations is the convolution of two 
distributions. First let f and g be two continuous functions on R®*, one of which 
has compact support. Their convolution h=f * g is classically defined by 


(7 wa) = f fle = s)e(oddy. 


If f is any function defined on R*, the function f is defined by x'~/(—x), 
and for any aE R® the function 7,.(f) is defined by x>/(« —a). With these nota- 
tions formula (7) can be rewritten as 


wa) = f(a )ao)dy. 


It is therefore justified to define the convolution 7 *@ of a distribution T 
and a function ¢€ OD as the function given by 


(T * ) (x) = (T, 126) 


for «© R*. One then proves that 7'*¢@ is an infinitely differentiable function 
[13, prop. 4.10.1, p. 402; 22, Theorem 27.2, p. 287]. 


12. It will not be possible to define the convolution of two arbitrary distribu- 
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tions, hence we have to introduce a certain subclass of D’. We shall say that the 
distribution S has compact support if there exists a compact subset K of R* such 
that (S, ¢)=0 whenever CD is such that K(\Supp ¢= @. 

If S is a distribution with compact support, then we can give a meaning to 
the symbol (S, y) for any infinitely differentiable function y. Indeed, let x bea 
function belonging to D, which is equal to 1 on the compact set K which figured 
above, when we explained what it meant for S to have a compact support. Then 
xv belongs to © and the value of (S, xW) is independent of the choice of x, since 
if x1 is a second function which has the same properties as x, then the support 
of (x—x1)W does not meet K. Therefore we can set, by definition, 


(S,p) = (S, xp). 
13. If Tis an arbitrary distribution, we define T by requiring that the relation 
(T, ) = (T, 4) 


be satisfied for 6€ D. This definition is justified by the fact that, if T= 7; and 
T = T};, then 


(Fo) =f K-xola)dx = J fe)6(—a)de = (7, 8) 


Let now S be a distribution with compact support, and JT an arbitrary dis- 
tribution. Their convolution S * T is the distribution which verifies the relation 


(S#T,o) = (S,T #9) 


for every CD. It has to be proved, of course, that S * 7 is indeed a distribu- 
tion, i.e., that the map ¢>(S, T *@) is continuous [22, Theorem 27.3, p. 289]. 

To justify this definition, consider the case S=7;, T7=T,, where f and g are 
continuous functions and f has compact support. Then we have 


(S* T, ¢) = (S, T *¢) 


=f ef a(—s00 — yay ds 
Rt Rn 


= [7 J. g(z — «)o(2)dz dx 
- I. h(z)o(z)dz = (Ty, >), 


where h=f * g. Thus our definition yields 7; * 7, = Ty«,, which is reasonable to 
require. 


14. We shall now list some properties of convolution. 
(a) For any distribution T we have 6 * T=T. In other words, the Dirac 
measure is the unit element for the convolution considered as an algebraic opera- 
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tion. If Z is the distribution associated with a continuous function f, then this 
is the promised elegant form of Dirac’s relations (1) and (5). 

(b) If Sis a distribution with compact support, and T an arbitrary distribu- 
tion, then 


0,(S * T) = (0;S) * T = S * (0;T) 


for 1S7Sn. 
(c) Combining the preceding two relations, we obtain for any n-tuple of 
positive integers a, and for any distribution T, the relation 


(0%) * T = 0°T, 


which is the equivalent of Dirac’s formula (2). Thus derivation has become a 
convolution with a fixed distribution. 

(d) More generally, if &=(&:, > - +, &) isa point of R” and a=(ay, +--+ , Qn) 
an -tuple of positive integers, let us write &* for Ey'&? - - - &". A polynomial of 
degree m in the n variables &, - - - , &, can then be written as P(£) = dojajem Gak*, 
where the coefficients ad, are scalars. If for each 7 (1 Sj Sm) we replace &; by the 
differential operator 0;, we obtain the partial differential operator with constant 
coefficients P(0) = > o1e)<m @.0% For any such operator P(0), and for any dis- 
tribution T we have 


P(d)i*# T = P(O)T. 


15. Let P(0) be a partial differential operator with constant coefficients. 
Then 


(8) P(d)T = S, 


where S is a given distribution and T an unknown distribution, is called a linear 
partial differential equation with constant coefficients. Though the theory of 
these equations belongs to the most classical parts of analysis, it has made very 
significant progress since the discovery of distributions. An account of these de- 
velopments can be found in the books of Hérmander [12] and Tréves [23]. 

In the first place, distribution theory helped to clear up the concept of a fun- 
damental (or elementary) solution of a partial differential operator with con- 
stant coefficients. It was always felt that, for instance, if |x| denotes the length 
of the vector x=(x1,-++, %,)ER" given by |x|*=x7+ --- +22, then the 
function £ defined by 


T'((n — 2)/2) 1 


E(x) = — —— 
(x) 4qn/2 | x |»-2 


is the fundamental solution of the Laplace operator 
2 2 2 
A=0,;+ d2e+--> +0, 


for 2 23. The reasons for this belief were the following: 
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(a) E has a singularity at the origin, and outside 10} it satisfies AE =0; 
(b) a solution u of the Poisson equation Au=f is given by the Newtonian 
potential 


I'((m — 2)/2 
u(x) = (f* E)(x) = — en 2” ay, 


Agent? re [ax — y|r? 


With the help of distributions, a very simple and satisfactory definition of the 
fundamental solution of the operator P(0) can be given: it is a distribution E 
which verifies P(0)E=6. 


If Eis a fundamental solution of P(O), and uf Sis a distribution with compact 
support, then T=S * E 1s a solution of (8). 


Proof. P(0)(S * E)=S * (PO)E)=S *6=S. 

This theorem already shows the importance of the result, obtained in 1953 
independently by Malgrange and Ehrenpreis, according to which every partial 
differential operator with constant coefficients has a fundamental solution. 


16. We shall say that the distribution T is equal to the function f (or simply 
that T is f) in the open subset 2 of R*, if (T, 6) = fof(«)b(x)dx for every 6ED 
such that Supp CQ. We shall say that the partial differential operator P(O) is 
hypoelliptic, if whenever P(0)T is an infinitely differentiable function in the 
open set QC R®*, then T itself is an infinitely differentiable function in Q. It has 
been known for a long time that A, and, more generally, that every elliptic dif- 
ferential operator is hypoelliptic, and the problem arose to characterize the 
hypoelliptic operators. 

It is fairly easy to prove that P(0) is hypoelliptic if and only if it has a funda- 
mental solution which is an infinitely differentiable function outside {0}. Ac- 
tually, if P(@) is hypoelliptic, then every fundamental solution is trivially 
infinitely differentiable outside {0} since 6 is equal to 0 and therefore infinitely 
differentiable outside {0}. But the interesting problem was to give a condition 
which uses only algebraic properties of the polynomial P(é). This was done most 
brilliantly by Hérmander in his thesis in 1957. 


If in P(€) we replace the real n-tuple é = (&:, - - - , &.) by the complex n-tuple 
C=(01, °° °,6,)EC*, where (;=& +7; CR, njER (1 Sj Sx), then we obtain 
the polynomial P(¢) in » complex variables. We also set ¢=&+7, with 
F=(&,---, &)ER* and y»=(m,-+°*, %2.)CR*. Hoérmander’s necessary and 


sufficient condition for the hypoellipticity of P(0) is as follows: if P(¢) =0 and 
|¢|—>0, then necessarily |n| 0. 


17. The last operation we want to introduce is the Fourier transformation 
of distributions. If «=(%1, -+-,%n) and&=(&, ---, &) are two vectors in R®, 
we shall denote by x-é their inner product xif1-+ - - - +*,&,. The Fourier trans- 
form / =$f of an integrable function f on R” is classically defined by 
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=f fe mtar. 


If T=T;, and if we require that 5T=T=T7, then we have 
(9) (§T, ) = (T, 5d) 
for every 6€ 9D, since both sides of (9) are equal to 


[_ [ seowet=tacae 


One is therefore tempted to define the Fourier transform ST of an arbitrary dis- 
tribution T by (9). Unfortunately this cannot be done, since S¢@ is not in 9, 
unless @ is identically zero. Indeed, by a theorem of Paley and Wiener, if ¢ has 
compact support, then S@ is the restriction to R* of an entire function of ex- 
ponential type on C”, and can therefore not have a compact support unless it 
vanishes identically. | 

There are two ways out of this dilemma. 


18. The first of them has been used already by Schwartz himself in his book 
[20]. He introduces a vector space 8 which is larger than 9, and which consists 
of all infinitely differentiable functions ¢ such that for every integer k and 
every n-tuple of positive integers a the expression 


(10) (1+ | «|2)*] ae@(x) | 


tends to zero as |x|-—> ©. For each # and a@ we then define the seminorm fz,.(¢) 
as the maximum of (10) as x varies in R*, and the family of seminorms (z,2) 
defines a topology on 8, which is compatible with the vector space structure of 
§ (cf. 7). It then follows from Fourier’s inversion formula that ¢'-¢ is an 
algebraical and topological isomorphism from § onto itself. 

Next Schwartz considers the topological dual 8’ of 8, and calls its elements 
tempered distributions. Since the canonical injection from 9 into $ is continuous, 
and since D is dense in 8, every tempered distribution is in particular a distribu- 
tion, i.e., $’ can be considered as a subset of 2’. The Fourier transform $T=T 
of any TCS$8’ is then defined by requiring that (9) shall be satisfied for every 
#€S8. Then TT is a bijective map from $’ onto itself, and for appropriate top- 
ologies on $8’ it is even bicontinuous. 

Let us now list some properties of the Fourier transformation. 

(a) The Fourier transform of the Dirac measure is the constant function 
equal to 1. 

(b) For any u-tuple of positive integers a, the Fourier transform of 0%6 is 
the monomial function &—>(277i£)*. For this reason it is more convenient to use 
the differential operators D; = (1/271)0; rather than 0; (1Sj<z). If we write a 
partial differential operator with constant coefficients in the form P(D) 
= > lel sm A.D*, where D*= Df! - - - Da", then the Fourier transform of P(D)6 
is the polynomial function P(E) = dojatem dak. 
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(c) If Sis a distribution with compact support, and 7 a tempered distribu- 
tion, then 


(11) §(S*T) = §(S)-F(T). 


Every distribution with compact support is tempered, and, by an extension of 
the Paley-Wiener theorem, its Fourier transform is the restriction to R” of an 
entire function on C*. Therefore the right-hand side of (11) is well defined. 


19. The second solution to our problem has been used mainly by Ehrenpreis 
and Gel’fand-Silov. They introduce the vector space Z of all entire functions of 
exponential type on C”, whose restrictions to R” belong to $8, and equip Z with 
an appropriate topology. Then ¢'¢ is an isomorphism from Z onto ®D and for 
any T€QM’ we can define ST by asking that (9) be satisfied for every ¢€Z. 
Then $7 will not be a distribution any more but a generalized distribution, i.e., 
an element of the topological dual of Z. For an account of this theory we refer 
to the book of Gel’fand and Silov [8], to the references given there, and to a 
paper by Ehrenpreis [7]. 


20. The considerations of no. 18 have led Schwartz to pose the problem of 
division of distributions: Given a polynomial P(é) and a tempered distribution 
S, does there exist a tempered distribution X such that P(€)X =S? 

If the answer to this problem is affirmative, then every linear partial dif- 
ferential equation P(D)T=S, where S is a given tempered distribution, has a 
tempered solution 7. Indeed, there exists then a tempered distribution X such 
that P(é)X =, and X will be the Fourier transform of the required distribution 
T. In particular, every partial differential operator P(D) has then a tempered 
fundamental solution. 

After resisting several attempts, the problem of division was solved by 
Hoérmander [11] and Lojasiewicz [15]. The latter even proved the possibility 
of division by a holomorphic function, and his methods were later used to study 
the structure of analytic manifolds. The solution of the problem of division 
inspired Malgrange to prove important theorems concerning overdetermined 
systems of partial differential equations [16, 17], and subsequently to obtain 
significant results in differential topology, in particular a proof of the so-called 
preparation theorem for differentiable functions, for which we refer to his 
book [18]. 


A lecture delivered at the session on the Theory of Distributions of the Mathematical Associ- 
ation of America held in New Orleans, Tanuary 27, 1969. 
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A Remark Concerning the Alaoglu Theorem 
ELLEN TORRANCE, Mount Holyoke College 


Use a Tychonoff Theorem attack 
To prove this Alaoglu fact: 


The closed unit ball 
In a normed space dual 


Is Hausdorff, and weak-* compact. 


APPLICATIONS OF DISTRIBUTIONS TO PDE THEORY 
F, TREVES, Purdue University 


The impact of Schwartz distributions on the theory of linear partial dif- 
ferential equations is today a well-recognized fact. There is no exaggeration in 
saying that the theory has been revolutionized during the fifties with the 
appearance of the works of Malgrange, Ehrenpreis, Hérmander, ef al. Great 
advances have been made in the last fifteen years; further advances are still 
being made, with the theory of pseudodifferential operators, of hyperfunctions, 
of analytic functionals—in the wake of the first breakthroughs. 

The advantages of distributions are manifold: a much needed increase in 
the “reservoir” of data and possible solutions (that the increase was needed is 
made clear by the “avant-garde” work of Hadamard, Sobolev, Bochner, and 
others); an automatization of the elementary operations such as taking limits, 
differentiations, convolutions, regularizations, Fourier transformation, etc. 
Another aspect, which I should like to emphasize here, has been the diffusion 
throughout PDE theory of the methods and results of functional analysis, 
particularly of the theory of general locally convex spaces. This theory was to a 
large extent built by S. Banach, but could not find significant applications until 
the late forties, when L. Schwartz gave his formulation of distribution theory. 
Not only did this formulation call for a systematic investigation of the properties 
of locally convex spaces (eventually presented in Bourbaki [1]) but it promised 
extensive applications. This, J. Dieudonné and L. Schwartz foresaw very 
clearly, and their pioneering work [1] provided many of the results that were to 
be useful later. 

This manifold influence of distributions on PDE theory is best exemplified, 
in my opinion, in the statement and the proof of an important theorem of Ber- 
nard Malgrange, and it is perhaps worthwhile to give here a detailed exposition 
(and proof!) of this result. The reader who would be willing to go carefully 
through all the steps of the reasoning which we now present will encounter, in 
succession and meaningfully “embodied,” the main aspects of the distributions 
impact. 

First of all, what is the problem we wish to solve? We deal with a linear PDE 
with constant coefficients, in 2 variables, which we write 


(1) Py = f, 
The right hand side f is given: it is a complex-valued function in some (giver) 


Professor Treves received his Sorbonne PhD under Laurent Schwartz in 1958. He held posi- 
tions at Berkeley, Yeshiva, and the Sorbonne, and visiting posts at Rio de Janeiro, Brasilia, and 
the Tata Institute, and is now at Purdue University. He was a Sloan Fellow from 1960 to 1964. 
His extensive research in partial differential equations includes the books Topological Vector Spaces, 
Distributions and Kernels, Academic Press 1967, Linear Partial Differential Equations with Constant 
Coefficients, Gordon and Breach 1967, and Locally Convex Spaces and Linear Partial Differential 
Equations, Springer Grundiehren series, 1967. Editor. 
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open subset Q of the m-dimensional Euclidean space R*; moreover f is infinitely 
differentiable, that is, f belongs to the space C*(Q). Beyond this, f is arbitrary. 
We want to know whether there exists a solution uw to (1). But we also want this 
solution to be infinitely differentiable in the open set 2. Malgrange’s theorem 
will provide a necessary and sufficient condition, bearing on the open set 2 and 
on the differential operator P, insuring that given any f in ©%(Q), there is ~ in 
C°(2) satisfying (1). 

The reader cannot fail to observe that the statement of the problem involves 
no distributions, in fact that the problem is most “classical” in nature. Still the 
only proof known today (and to be found below) makes extensive use of dis- 
tribution theory and of its most revolutionary aspects. 

We must state the necessary and sufficient condition for solvability of 
Eq. (1). Let us write explicitly the differential operator P: 


P = P(8/dx) = DY) cp(d/dx)?, 


where the sum is finite, the c, are complex numbers, and we have used the multi- 
index notation, namely that if p=(f1, °°: , pa), 


(0/0x)® = (0/0x4)?! + + + (0/0%y)”". 
We set then (writing lp | =py+ +--+ +hn) 
P = P(—d/dx) = 2) (—1)!?!c,(0/dx)?, 


observing that if¢@ andy are two C* functions with compact support in the whole 
space R*, integration by parts yields 


(2) f (0)P(/aa)We)de = f P(—0/an)6@Ma)d 


DEFINITION 1. The open set Q is said to be P-convex tf to every compact subset 
K of Q there ts another compact subset K' of Q such that, gwen any C® function > 
with compact support in Q, the following 1s true: 


(3) af Po = 0 outside of K then & = 0 outside of K’. 


Before we state the theorem, let us note that partial differentiations, and 
therefore any operator such as P, define linear mappings of ©*(Q) into itself. 
And solvability of Eq. (1) in the sense we have given to it simply means that the 
linear map 


P: er(Q) > er(Q) 
is onto (or surjective), i.e., that PC?) = C*(@). 


THEOREM (B. Malgrange, 1954). We have P©*(Q) = CQ) tf and only tf Q 1s 
P-convex. 


Of course, such a result raises the question as to when is an open set P-con- 
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vex. Although this is a difficult question (no complete characterization has been 
found yet) we have some information. For instance, every convex set, in par- 
ticular the whole space R”, is P-convex regardless of what the operator P may be 
(barring the case where all the coefficients c, are zero!). If the operator P is 
elliptic, every open set is P-convex. When the number of independent variables 
n is equal to two, the P-convex sets are known for every operator P (see Hir- 
mander [1], Th. 3.7.2). 

Proof of the “only if” part: We assume that Eq. (1) always possesses a ©” 
solution uw in Q, for arbitrary @* right-hand sides f. We must derive from this 
that Q is P-convex. 

Let K be a compact subset of 22 and let $(K) denote the set of all the eC” 
functions with compact support in 2 such that P¢ = 0 outside of K. Now, one of 
the first steps in applying distribution techniques is to equip the function 
spaces under consideration with a locally convex topology, and this is always 
done by means of appropriate seminorms. First we equip C*(Q) with its standard 
C* topology: functions ¢, converge in @® (Q) if they and every one of their deriva- 
tives converge uniformly on every compact subset of 2. This convergence can 
be defined by using the seminorms 


| > |am = sup ( > | (@/28)"6(s)| ), 


Ip|<m 


where H is an arbitrary compact subset of 2, and m any integer 20 (sup stands 
for the supremum, which here is a maximum). Thus ¢ converges to @o in C*(Q) 
if, given any H and any m, |e —o| H,mw—Q. Provided with this topology, ©?(Q) 
is a locally convex topological vector space. Moreover, the topology can be 
defined by a metric, i.e., C?(Q2) is metrizable (we shall never use explicitly the 
metric), and it is easily seen to be complete: every Cauchy sequence converges 
in it. These two properties of being locally convex on one hand, and metrizable 
and complete, on the other, allow us to apply the main theorems of S. Banach: 
the Hahn-Banach theorem and the open mapping theorem. 

Note that ®(K) is a linear space; we equip it also with a topology: the one 
defined by the semi-norms 


(4) | P(—9/8x)¢|x.m, m=0,1,--- 

It is easy to see, by a variety of ways, that each seminorm (4) is in fact a norm, 
that is, if it vanishes, the function ¢ must be identically equal to zero. The best 
way (because it extends to the case where P has variable coefficients) is to use 


our assumption, namely that (1) is solvable: indeed, let us solve Eq. (1) with 
f=¢ (complex conjugate of ¢). Then, by (2) we have: 


[ eax = f oPude = [ Pou de = 0 if Pd = 0; 


whence {| ¢|? dx=0 which implies ¢=0. Other methods use Hérmander’s in- 
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equalities or the Paley-Wiener theorem (see below). At any rate, the linear 
space ®(K) topologized by (4) is now metrizable and locally convex. However, it 
is not complete (in general). 

We consider the bilinear functional 


,f)— J o(a)f(x)de 


on the product space ®(K) X @*(Q). We are going to show that it is separately 
continuous (in each one of its arguments, ¢ and f). Fixing ¢, the continuity with 
respect to f is trivial, since 


| eae s Lflao f | $| de, 


if @ vanishes outside of the compact subset H of Q. Next we keep f fixed and study 
the continuity with respect to ¢. At this critical stage we use the fact that (1) 
has a solution uw in @*(Q) (it would suffice to assume that (1) has a distribution 
solution, and this fact would show that the solvability of Eq. (1) in the space of 
distributions, for arbitrary C®* right hand sides, implies its solvability in the 
space €*(Q)). We have 


[ orae = f oPuar = f Pou de, 
[f oraz 


which implies the desired continuity. 

We use now a general theorem of Dieudonné-Schwartz which states that zf 
E and F are two locally convex metrizable spaces and 1f moreover F 1s complete, 
every separately continuous bilinear functional on EXF 1s tn fact continuous. 
Thus our functional /¢fdx must be continuous, which can be expressed by the 
fact that there is a compact set K’ C Q, two integers m and m’ and a constant 
C>0 such that, for all ¢ in ®(K) and all f in C7), 


| f ofPdx 


But this implies that {¢fdx =0 whenever f vanishes identically in a neighborhood 
of K’. This is only possible if ¢ vanishes outside of K’. 

Proof of the “if” part: Now we assume that Q is P-convex and prove that 
Eq. (1) is always solvable in €°(Q), i.e., that the mapping P acting on @*(Q) is 
onto. If @*(Q) were finite dimensional, which it is not!, a simple criterion of 
surjectivity would be that the transpose of P, ‘P, acting on the dual of e*(Q) 
(dual which is customarily denoted by &’(Q)), is one-to-one (or injectwe). The 


hence 


< Polen f | «| de, 
K 


<C| Polam| flxm’- 
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injectivity of the transpose is not any more enough to insure the surjectivity of 
the mapping when the space acted upon is infinitely dimensional. To palliate 
this, we turn to the topologized space C*(Q2) and apply a criterion due to 
S. Banach and valid for metrizable complete locally convex spaces—which is 
exactly what €°(Q) is. The criterion concerns, here also, the transpose of the 
mapping under consideration. Let us state it in full generality: 


LEMMA 1 (S. Banach). Let E, F be two Fréchet spaces (1.e., locally convex, 
metrizable, and complete), T: E-F, a continuous linear map, E’, F' the respective 
duals of E and F. Then the two following properties are equivalent: 

(a) T 1s onto; 

(b) the transpose *T of T is one-to-one and its range 1s weakly closed. 


This statement (which is classical) calls for some comments. First of all by 
the dual of E we mean the linear space of continuous linear functionals on £, 
i.e., of continuous linear mappings E->C (the complex field). Second, by the 
transpose of J we mean the unique linear mapping 


tT: FF’ — E! 
such that, given any xin EF and any y’ in F’, 
CT (y'), x) = Cy, T(x)), 


where (, ) is the bracket of the duality, either between E and E’ or between 
Fand F’. 

Finally, what do we mean by weakly closed? The range of ‘T is a linear sub- 
space of E’ and it should be closed for the weak topology on EZ’, which is simply 
the topology of pointwise convergence for (linear) functions defined in E. 

We want to apply Lemma 1 in our present situation, which is to say when 
E=F=e@%(Q) and when J =P. Therefore we have to check that (b) holds in this 
situation. In order to do this, we use the fact that &’(Q), the dual of C*(Q), can 
be regarded as a space of distributions in Q. We recall that the space of all dis- 
tributions in Q, D’(Q), is the dual of the space of test-functions, i.e., of C% func- 
tions with compact support in Q, C2(Q) (suitably topologized). Now the natural 
injection 

C, (2) > e*(9) 


is continuous and has a dense image; therefore its transpose is a continuous 
linear mapping (say, for the weak topologies) 


&'(Q) > D'(Q) 


which ts injective. We may then identify &’(Q) with its image, a linear subspace 
of ’(Q). Can we characterize this subspace of 2’(Q)? Yes, and the answer is 
extremely simple: a distribution belongs to &'(Q) tf and only tf tt has compact sup- 
port (this is seen at once by observing that a linear functional \ on C2(Q), if it 
is to be continuous for the topology induced by €*(Q), must satisfy for some 
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constant C>0, some compact subset K of 2, and some integer m, 
1x(¢)| SC| oem for all ¢ in © (9); 


therefore \@) =0 whenever ¢ vanishes in a neighborhood of K, i.e., the support 
of A is contained in K). 

We may already check the first part of Condition (b), that *P acting on dis- 
tributions with compact support, is one-to-one. Indeed, we apply Fourier trans- 
formation on the equation 


‘(PX = QO; 
this yields at once 


but the Paley-Wiener-Schwartz theorem states that the Fourier transform of a 
distribution with compact support is an entire function of exponential type; 
the product of such a function with a nonidentically zero polynomial cannot 
vanish identically unless the function itself does. But then Fourier transforma- 
tion acting on distributions with compact support (or, for that matter, on tem- 
pered distributions) is one-to-one; therefore A itself must be zero. This provides 
another proof of the fact that the seminorms (4) are norms. 

As a last step, we must prove that the range of 'P is weakly closed. For this 
we use another criterion, due also to Banach: 


Lema 2 (S. Banach). Let E be a Fréchet space, {@.} a collection of seminorms 
on E such that the sets 


U. = {ein E; P(e) = 1} 


form a basis of neighborhoods of zero in E. Let US denote the polar of Ua, that is the 
subset of E’ consisting of those linear functionals e’ such that 


| <e’,e)| $1 forall ein Us. 
A linear subspace M’ of E’ 1s weakly closed tf and only tf its intersection with 
every polar set U2 is weakly closed. 
We apply this, taking ®, to be the seminorm 
N | b|x.m; N,m=1,2,---, 
K ranging over all the compact subsets of 2. Then the assumption in Lemma 2 
is satisfied. The polar U2 consists exclusively of distributions having their support 


contained in K. Indeed, saying that \ belongs to U2 is equivalent with saying 
that for all functions ¢ in C2(Q), 


1a, ¢)| = NV] bleym; 


therefore (A, 6) =0 whenever ¢ vanishes in a neighborhood of K, which is simply 
saying that the support of A is contained in K. 
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We have to prove that 
(5) ‘Ps/(Q) \ Us 


is weakly closed. Observe that U? is always weakly closed (polar sets always are). 
Let { d;} be a net, or a filter in the intersection (5) converging weakly to 
AoE’). Since Ao belongs to U2 all we have to do is to show that Ao belongs to 
the range of ‘P, i.e., that there is uo in &’(Q) such that 


Ao = ‘Puo. 
Now, each A; is of the form 
(6) hj = 'Puj, aj in &'(Q). 
We use now a basic theorem in the theory of PDEs with constant coefficients: 


THEOREM (B. Malgrange, L. Ehrenpreis, 1952). Every linear PDE with con- 
stant coefficients (nonidentically zero) has a fundamental solution. 


We apply this result to P=P(—0/dx), acting on distributions. But the 
definition of this action, which generalizes the integration by parts formula (2), 
shows that P='P. We see thus that there is a distribution Z in R® such that 


‘(PE = P(—0/dx)E = 56, the Dirac distribution. 


Next we use convolution, in relation to (6); we convolve both sides with E. 
But in order to differentiate a convolution of two distributions (one of which, 
at least, must have compact support: this is the case here), we may differentiate 
indifferently anyone of the factors. Therefore: 


E*)j = Ex P(—8/dx)uj = {P(—0/dx)E} * wy = 8 * wy = py. 
We exploit the fact that \—~£ *) is a continuous linear map of &’(R”), the dis- 
tributions with compact support in R*, into D’(R*), the space of all distributions 
in R* (this type of continuity is one of the standard features of operations on dis- 
tributions; all duals carry their weak topologies). We obtain that the u;=E * d; 
converge weakly to a distribution py in R”. We must have, by continuity, 


Mo = E ¥ Xo. 
By applying P(—d/dx) to both members, we obtain 
No = ‘Po. 


Thus Ao belongs to the range of *P but not as a mapping in &’(Q), since we do 
not know yet whether Ay has a compact support contained in 2. This is what 
remains to be shown. 

Suppose we knew that all the uw; had their support in a fixed compact subset 
K’ of Q. Then this would also be true of wo since such a property subsists when 
we take weak limits. What we know is that P(—0/0dx)uy; has its support con- 
tained in K (see remark following Lemma 2). At this very last stage of the 
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proof, we use the P-convexity of 2. Recalling that *P extends P, we would have 
the desired property if (3), in Definition 1, were valid for distributions with 
compact support, in addition to being valid for test functions. The proof will be 
completed if we show this to be the case. 

Let d;, R=1, 2, - + - , be a sequence of test functions in R" converging to the 
Dirac distribution 6 in standard fashion: for instance, ¢,2 0; ¢; vanishes identi- 
cally outside of the sphere of radius 1/k; f¢,d*% =1 for all k. Consider the con- 
volution ¢, *u, where p in &/(Q) is such that the support of P(—0/dx)yp is 
contained in K. As soon as k is large enough so that the neighborhood of order 
1/k of the support of uw is a compact subset of 2, we have that ¢; * uw belongs to 

2(Q). Moreover, 


P(—0/0x) (bx * w) = oy * P(—8/0x)u 


vanishes outside the neighborhood of order 1/k of K. By Property (3) we derive 
that there is a compact subset K’ of Q such that 


oy * pp = 0 outside of K’, 


as soon as k is large enough. By taking & to the limit +, we conclude that yu 
vanishes outside of K’, which is what we wanted to prove. 


An invited address before the Mathematical Association of America in New Orleans, January 
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CODING THEORY: A COUNTEREXAMPLE TO G. H. HARDY’S 
CONCEPTION OF APPLIED MATHEMATICS 


NORMAN LEVINSON, Massachusetts Institute of Technology 


1. Introduction. A major theme of G. H. Hardy in “A Mathematician’s 
Apology” [5] is the division of mathematics into pure mathematics, “the ‘real’ 
mathematics of the ‘real’ mathematicians which is almost wholly ‘useless’ ” 
[5, p. 119] and applied mathematics, which he regarded as dull and trivial. In 
contrast to the harmlessness and innocence of ‘real’ mathematics, the “trivial 
mathematics on the other hand has many applications in war.” See [5, p. 141]. 
Hardy exults particularly in the uselessness of number theory which, if “real 
mathematics” were useful, could be exploited for evil as well as good. Hence 
“Gauss and lesser mathematicians may be justified in rejoicing that there is one 
science at any rate, and that their own, whose very remoteness from ordinary 
human activities should keep it gentle and clean.” See [5, p. 121]. 

Hardy did approve of theoretical physics as exemplified by relativity and 
quantum mechanics, but regarded them as quite useless [5, p. 135]. If time has 
shown him wrong about this, it can be argued that in these subjects he was not 
an expert, and therefore the real test of his ideas concern pure mathematics. 

Here we shall show how coding theory refutes Hardy’s notion. Finite fields, 
also called Galois fields, and theorems from number theory play a central role 
in coding theory. In some areas of applied mathematics, the role of pure math- 
ematics is often at best one of reassurance, such as in providing a nonconstruc- 
tive existence theorem or a uniqueness theorem, but not in providing the com- 
putational or analytic procedures that yield the actual results. In practice the 
procedures used may involve more intuition and experience than rigor. This is 
not the case in coding theory, where pure mathematics supplies the constructive 
procedure for carrying out coding. This may surprise applied mathematicians 
more than it will pure mathematicians. To accommodate those applied readers, 
our account will not require familiarity with finite fields or number theory. 
Rather we shall start with the problem of error correction in the transmission 
of information by use of codes, and show how this leads to the introduction of 
a certain mathematical object which is in fact a finite field. Cyclotomic poly- 
nomials, a discovery of Gauss, will play a key role. 

Quadratic residues and the law of quadratic reciprocity (which Hardy [5, 
p. 92] regarded as one of the most beautiful theorems of mathematics) also enter 
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was a travelling Fellow at Cambridge University in 1934-35 and an NRC Fellow in 1935-37. He 
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Academy of Sciences. Editor. 


249 


250 NORMAN LEVINSON [March 


coding theory [1, pp. 173, 354]. Another working tool is the Chinese remainder 
theorem [1, p. 339]. Hardy [5, p. 113] discusses the aesthetic quality of “real 
mathematics. ” Here the highly regarded theorems and their proofs possess 

“a very high degree of unexpectedness, combined with inevitability and economy.” 
This is true of the manner in which finite fields enter coding theory, as we shall 
see. 

Not unexpectedly, finite fields were introduced into coding theory mainly 
by men trained as mathematicians. Some of the early work was apparently 
not published. The particular development which will be described here, the 
BCH codes, is due independently to Bose and Ray-Chauduri [2] and to 
Hocquenhem [6]. What is most important for the actual usefulness of the 
method, an efficient decoding process for these codes was discovered by an 
engineer, Peterson [7]. The BCH codes were generalized considerably by 
Gorenstein and Zierler [3]. 

Berlekamp [1, p. vii] states that “the essential limitation of all coding and 
decoding schemes... (has been) the complexity (and cost) of the decoder. 
The important work of Reed and Solomon (1960), Bose and Chaudhuri (1960), 
Gorenstein and Zierler (1961), and Peterson (1961) marked the advent of a new 
approach to this problem. By associating each digit of certain codes with an 
element in a Galois field, it was found possible to derive an algebraic equation 
whose roots represent the locations of the channel errors. . . . As a consequence 
it is now possible to build algebraic decoders which are orders of magnitude 
simpler than any that have previously been considered.” 

The notation used below mainly conforms with that used in Berlekamp [1]. 


2. Coding. Here a message will mean a finite ordered sequence of two sym- 
bols which it is desired to transmit through a channel. For example the channel 
may be a cable or a radio frequency band. It will be convenient to designate the 
two symbols as 0 and 1. A sequence of k such symbols may be regarded as a 
binary k-vector (a1, d2, - - - , dx), where each a; is either 0 or 1. Clearly there are 
2* binary k-vectors. If the transmission channel is noisy, the received vector may 
differ from the one sent, that is, the transmission process may introduce errors. 
One way to improve reliability is to repeat the message several times. This is an 
example of the use of redundancy, that is the transmission of more than the k 
binary digits contained in the original message in order to improve the reliabil- 
ity of the transmission process. 

Simple repetition is not efficient. In general, a binary n-vector is trans- 
mitted with n=k-+r, where k is the number of binary digits which form a 
message and 7 is the number of redundant digits. These redundant digits are 
determined according to some rule by the & digits of the message. The process 
of constructing the redundant n-vector from the message k-vector is called 
encoding. While there are 2” binary m-vectors, the encoding process leads to a 
subset of 2* of these, which may be called code-vectors. Because of errors in 
transmission, the m-vectors which are received need not be code-vectors. The 
process of correcting the received m-vector and extracting the original k-vector 
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is called decoding. The arithmetic operations in encoding and decoding will be 
carried out modulo 2, that is, 1+1=0. This is equivalent to binary addition 
with no carry-over, and hence is a process easy to design into an electronic 
computer. The binary arithmetic to be used here need involve only 0 and 1, 
with rules 0+0=0, 0+1=1+0=1, 1+1=0, 0-0=0-1=1-0=0 and 1-1=1. 
With these rules, 0 and 1 form a field of two elements, which is known as GF(2) 
the Galois field of two elements. (This is mot the place where finite fields play a 
crucial role in coding theory, since GF (2) by itself is rather trivial.) All arith- 
metic that follows involving vectors, matrices, and polynomials will be carried 
out in GF(2). We recall that modulo 2 all even integers may be replaced by 0 
and all odd integers by 1. 


3. Hamming single error correcting code. Suppose a channel is sufficiently 
reliable so we can assume that if a binary n-vector is transmitted, then the 
received binary n-vector contains an error in at most one entry. How much 
redundancy will allow the position of the error to be determined? Suppose m is 
a positive integer and set n=2”—1. (This assumption about x here and later is 
more restrictive than necessary, but is sufficient to illustrate the basic ideas.) 
A binary number 0 that could designate which of the x received binary digits 
contains an error must itself have m digits, because it requires m binary digits 
to represent the positive integers not exceeding 2"—1. The occurrence of no 
error can be designated by all m digits of 6 zero. As an example, let m=4 and 
hence 2 = 15. Then the four-place binary numbers starting with 0001 and ending 
with 1111 represent all integers from 1 to 15. The above remarks suggest that it 
may be possible to correct a single error in the transmission of an n-vector, 
where n=2”"—1, if r=m and hence k=n—m. A feasible method for doing so 
was discovered by Hamming [4]. Suppose again that m=4 so that n=15 and 
y=4; hence R=11. All vectors which will be considered from here on will be 
column vectors which (for typographical reasons) may also be written in row 
form. Denote the code vectors by C= (Ci, Co, - - - , C13), where the C; are binary 
digits. Let H be a matrix of 15 columns, each column a binary vector with four 
entries, all columns distinct, and none identically zero: 


00000 0 0 1-:-1 
00011141 0 1 
H = e 
011001 i1 0 1 
1010101 0---1 


(If a column of H is written as a row, then that row considered as a binary num- 
ber, represents the column number. This is convenient but not essential.) 


Let the eleven entries C3, Cs, Cs, Cz, Co, Cio, - > - , Cig of C be the entries of 
the message k-vector. Determine Ci, C2, Cs, and Cs so that 
(3.1) HC = 0. 


This is possible because the square matrix made up of the eighth, fourth, second, 
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and first columns of H is nonsingular. (Indeed it is the unit matrix.) By (3.1) 
all code vectors C are orthogonal to the rows of H. Since the arithmetic above 
is all modulo 2, Ci, Co, Cs, and Cs are of course binary digits. This determination: 
of the code vector C completes the encoding process. 

Suppose for the moment that errors occur in several of the digits in the trans- 
mission of C so that the received binary n-vector R does not coincide with C. 
Performing addition mod 2 componentwise, let the binary n-vector E be 
defined by 


(3.2) E=R-C=R4+C. 


If the jth entry of R, namely R;, and that of C, namely C;, coincide, then E;=0. 
But if R;#C;, then #;=1. Hence the vector E has entries differing from 0 at 
precisely those positions where an error in transmission occurs. Consider now 


HR = HC + HE = dE, 


where use is made of (3.1). If no error has occurred in transmission, then E=0, 
and so HR=0. If exactly one error has occurred in transmission and this error 
is in the jth term, then ZE;=1 and so 


HR = H®, 


where H“ is the jth column of H. Clearly knowledge of the vector H™ deter- 
mines j. (Indeed with the H above, writing H™ in row form gives the binary 
representation of j.) Hence if at most one error occurs, HR determines at which, 
if any, entry the error occurs. Thus the binary n-vector E is determined, and 
since C=R-+E, the vector C is now available. If the entries Ci, Ce, Cs, and Cs 
are discarded, the resulting binary k-vector is the original message. This process 
of reconstructing the message from R is the decoding process. Of course if more 
than one error occurs, that is, if E has two or more entries which are 1, the 
above procedure is not valid. 

To correct more than one error in R, the received binary 2-vector, one would 
expect to increase r, the number of redundant digits, and hence to decrease k. 
Moreover, for HR then to yield the error locations in R, one would expect H 
to have more rows. The decoding process can be expected to be least compli- 
cated if H has a structural pattern based on a reasonably simple mathematical 
algorithm. A comparatively simple mathematical scheme for locating, at least 
in principle, up to a prescribed number of errors is used in the BCH codes men- 
tioned earlier. The BCH codes make use of finite fields. 


4. The fields GF(2”). (The reader familiar with finite fields can skim this 
section.) A column of an m-rowed matrix H may be regarded as a binary 
m-vector (where of course m21). If x is an indeterminate and the a; are all 
in GF (2), then the polynomial >”! a; x4 of degree m—1 can be used to repre- 
sent a binary m-vector with entries aj, where 0SjSm-—1. (Here it is convenient 
to start the index j at 0.) Since each a;, for OSjSm-—1, can be either 0 or 1, 
there are a total of 2” of these polynomials of degree not exceeding m—1. Addi- 
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tion of these polynomials is equivalent to vector addition in GF(2) and leads 
again to one of the 2” polynomials. 

These rather trivial observations become profound if one further requires 
that the product of any two of the polynomials is again such a polynomial. 
Since the degree of the product of two polynomials is the sum of the degrees 
of each, the degree of the product will be at most m—1 only if some artifice is 
used. One way to achieve this is to compute polynomials modulo a fixed poly- 
nomial of degree m which we shall call f(x). Thus if P(x) is a polynomial, then 
P(x) is equivalent to Pi(~) where Pi(x) is the remainder obtained in dividing 
P(x) by f(x); thus 


P(x) = Pi(x) (mod f) 


P(x) = I(x) f(x) + Pa(x), 


where J(x) is a polynomial and the degree of P:(x) is at most m—1. The arith- 
metic in the division of course is performed in GF(2). In particular P(x) =0 
(mod f) if and only if f(«) is a divisor of P(x). 

As already stated there are exactly 2” polynomials of degree not exceeding 
m—1 and with coefficients in GF (2). In this paragraph we shall exclude the null 
polynomial for which all a;=0. Thus there remain n=2”—1 polynomials. The 
manipulation of the m-vectors represented by these polynomials becomes par- 
ticularly simple if the sequence 


(4.1) {ei}, OSjfSn—1, (modf) 


generates all 2 nonnull polynomials. 

Example: m=2, f(x) =x?+x+1; hence »=3. Then the sequence 1, x, x?, 
(mod f) is 1, x, 1-++x, which are the three nonnull polynomials of degree not 
exceeding m—1=1 with coefficients in GF(2). 

We shall show that the sequence (4.1) generates all » of the nonnull poly- 
nomials if 


(4.2) “* = 1(modf) and xt #1 (mod f) forisk<n. 


(This is the statement that (4.1) should form a cyclic group of order nv.) The 
xi, OSjSn-—1, are distinct (mod f). Indeed suppose that 


xi = x* (mod f), Osj<kSn—t. 
Then multiply the above equation by x"-* and use the first equation of (4.2) 
to get 
an—-)) = 1 (mod f). 
Since k>j this violates (4.2). Furthermore no x/=0 (mod /), since multiplying 


by x”—-/, we should have 1 =0 (mod f) or f(x) divides 1, which is impossible since 
degree f=m2 1. Hence the sequence (4.1) of 2 elements are all distinct (mod f) 
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and none is the null element. Therefore the sequence (4.1) generates all ” of the 
nonnull polynomials if (4.2) holds. 

If y=x* for some fixed 121, then the least positive integer \ for which 
y=1 (mod f) is called the order of y. If y*=1 (mod f) for some k21, then k is 
a multiple of A. Indeed let R=gA-+s where g20 and OSs<id. Then 1=y* 
= yates —=y® (mod f). From the definition of \, since s<X, this implies s=0 and 
proves the following special case of a classical result: 


LEMMA 4.1. Let y be a power of x and let y be of order \. If y*=1 (mod ff), then 
kts a multiple of X. 


From (4.2), f(x) must be a factor of x*—1. Let us now again take the case 
m=4 (so that »=15) and enumerate certain particularly relevant factors of 
x15—1. We revert to ordinary arithmetic, and note that if «?=1 or if x*=1, then 
certainly x4®=1. Hence x?—1 and x'—1 are factors of «15—1. Of course x—1 
is a factor of all of these. We now write the obvious identity 


x®—1 #5 — 1 /(x!® — 1)(4 — 1) 
(= — 1)(# — >) 
= 0 (4)0® (4)O® (4) QC (x), 


x15 — 1 = (x — 1) 


x—-1 *-l1 
(4.3) 


where 
OVr)=x-1, CH@=xX2+e4+1, GO) =aet+e5+ P+ e+, 


and (as can be verified) Q°® (x) =x8 —x7+-«5—x!+23—x+1. 

REMARK: The,polynomials Q(x) above are examples of the cyclotomic 
polynomials of Gauss, Eisenstein, etc. A root of unity p is said to have order 
ja1 if j is the least exponent for which p?=1. As in Lemma 4.1, the roots of 
x15—1 must all have orders which are factors of 15. The above factorization 
(4.3) involves the roots of orders 1, 3, 5, and 15, and these occur precisely in 
OM, Q®, Q®, and Q@®» respectively. The polynomial 0 (x) has roots all of 
which are of order 7. 

REMARK: The choice m =4 is not entirely an accident. Note that the cases 
m=3 and m=5 would not serve nearly as well as illustrative examples because 
7 and 31 are prime numbers, and so the analogue of (4.3) would be too simple 
to be revealing. The case m=6 becomes computationally rather long to serve 
as a suitable example. 

Return again to GF(2); there OM, Q@, and Q® remain the same in (4.3). 
But as can be readily verified, 


QO (y) = (a4 + x8 + 1)(at + x -+ 1). 
Now take f(x) as one of the two quartic factors of Q@), say 
(4.4) f(x) = «4+ 27+ 1. 


Then since f(x) is a factor of Q@ (x), it is a factor of x!®—1; therefore x4®=1 
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(mod f). By Lemma 4.1, the order of x must be a divisor of 15, so it is 1, 3, 5, or 
15. But f(x) is obviously not a divisor of x—1 or x*—1 and can easily be shown 
not to be a divisor of x —1. Hence the order of x is not 1, 3, or 5, so it must be 
15. (This is in fact a particular instance of an easily proved general property of 
the cyclotomic polynomials.) Therefore (4.2) is satisfied, so 1, x, x7,---, x 
(mod f) are the 15 cubic polynomials with coefficients in GF (2), none of which 
is the null polynomial. Given any x’, 1714, then x15’ is obviously its inverse 
(mod f). Thus these polynomials form a group under multiplication (mod /f). 
(It is of course the cyclic group.) If the null polynomial is adjoined, then the 16 
polynomials obviously form a group under addition. It follows readily that 
mod f these 16 cubic polynomials with coefficients in GF (2) form a field of 16 
elements. This field is known as GF (16). 

REMARK: The polynomial (4.4) is irreducible, that is, it cannot be written as 
the product of two lower degree polynomials in the arithmetic of GF (2). Indeed 
if it could, we should have 


Filx)fo(x) = fC). 


But fi(x) is a member of GF(16), hence has an inverse; the same is true of fo(x). 
If we multiply by these, we obtain 1=0 (mod f), which is impossible. 

REM ARK: In principle the entire above procedure can be carried out to es- 
tablish the existence of GF(2”) for any m, and to specify an appropriate f(x) 
of degree m. Actually to treat the general case it is necessary to develop a little 
more theory concerning Q(x) and the irreducible polynomials with coefficients 
in GF(2), {1]. 

A more convenient way to indicate that we are working mod f(x) is to let 
a denote a root of f(x). Therefore a‘-+-a?+-1=0 and so any polynomial in @ is 
automatically equivalent to a binary cubic in a. It is the cubic which one com- 
putes working mod f since the only property of @ that is used is f(a) =0. Thus 
the elements of GF (16) may be designated by the binary cubics in a. 

Summary: Let a be a root of at-+a?+1=0. (Only this equation, and not the 
actual numerical value of a, is used.) Then each polynomial in a with coeffi- 
cients in GF (2) is equal to a binary cubic polynomial in a. There are 24*=16 
binary cubics. These form the field GF(16). Moreover fai}, for 05/8514, 
generates the 15 nonnull binary cubics which together with the null polynomial 
make up GF (16). A binary cubic may be viewed as a binary 4-vector. 


5. A multiple error correcting code. To show how finite fields enter into 
coding, let us continue with the case m=4, 2”=16. Suppose now it is desired 
to correct up to 3 errors in the transmission of the encoded vector C with n=15 
entries. Since with 7=15 the correction of one error required a 4-rowed matrix, 
as displayed above (3.1), it seems plausible to try to correct three errors with a 
12-rowed matrix. This operating on R leads to a 12-vector which can be viewed 
as three 4-vectors and hence contains sufficient information to determine three 
integers between 1 and 15 and thereby locate up to three errors in R. A sys- 
tematic way to construct H is with its twelve rows arranged in three blocks of 
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four rows as follows: 


1 a a® ai ---qi4 
H= 1 a® a® a® --- ai], 
1 a@® qld gid... gq 


Each power of a of course represents a binary 4-vector belonging to GF (16). 
Why the row blocks a and a“, 0S$7814, can be omitted will soon be apparent. 
(Since at=a?+1 in GF(2), the first block of four rows can be computed from 
laa®?---a'4 and is 


“00011i1%1010d3i110041 
001000%1%11i10d310di141 
01000i1i1%1%1i10dé0d41édi1#é«0 
10001i1i1%10410%1410 «+0 

The second block of four rows of H, namely 1 a®a® - - - aw, consists of the first, 


fourth, seventh, tenth, and thirteenth columns of the above displayed matrix 
repeated three times. The third block of four rows consists of the first, sixth, and 
eleventh columns of the above displayed matrix repeated five times.) 

The received binary 15-vector R with entries R;, 0373514, has the poly- 
nomial representation 


14 
R(x) = » R;xi. 
0 


(Here we are not computing R(x) mod /f.) The application HR of H to R is the 
column vector with 12 entries 


R(a), R(a’), R(@’) 


which is represented in row form as a triple of 4-vectors, each 4-vector being an 
element of GF(16). 

As in (3.2), lett R= C++E. If E=0, then it is desirable that ZR=0 and there- 
fore we should have HC=0. Let 


14 
C(x) = 5 Cixi. 
0 


In terms of a, HC is the binary 12-vector C(a), C(a®), C(a®). For this 12-vector 
to be zero, the polynomial C(x) of degree 14 should vanish for x =a, a’, and a’. 
To make C(x) vanish for x =a, we can require that f(x) be a factor of C(x) in 
GF (2). It will be convenient now to designate f(x) by Mi(x). To find a poly- 
nomial, say M3(x), which has a’ as a root, note that a has order 5 and hence 
will be a root of 0 (x) of (4.3), where 


Q(x) = 1+ a+ a? + a® + at. 
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Denote Q(x) by M3(x). Similarly a® has order 3 and hence is a root of Q® (x) 
=1-+-«-+?, which we shall denote by M3(x). (The polynomials M1, M3, and M, 
are all minimum polynomials in the sense that no polynomials of lower degree 
with coefficients in GF(2) have a, a’, and a® respectively as roots.) Let 


g(x) = My(«)M;(x)M;(x). 


Then the degree of g(x) is 10, since M, and M3 are of a degree 4 and M; of de- 
gree 2. Moreover g(x) vanishes for x =a, a’, and a> because Mi(a), M3(a3), and 
M;(a°) are each zero. We now require that g(x) be a factor of C(x) to assure 
that C(x) has a, a’, and a as roots. Recall that C is a vector with 15 entries. 
Let Cio, Ci, Cie, Cis, Cig be a message vector of k=5 binary digits. Choose 
>_3 C; xi as the negative of the remainder of the quotient 


Cyge* + Cygxk + + + + Cpl? 
g(x) 


so C(x) will indeed have g(x) as a factor and hence a, a’, and a® as roots. The 
above arithmetic is of course in GF(2). This is the encoding process with 
n=15, R=5, and r=10. The binary polynomial g(x) is known as the generator 
polynomial of the code. Indeed a code-vector C is characterized by the fact that 
C(x) is divisible by g(x). 

It will now be shown in principle at least that the 12-vector HR can be used 
to correct up to a maximum of 3 errors in transmission. Since HC =0, therefore 
HR=HE, and the 12 vector HE regarded as a triple of 4-vectors determines 
E(a), E(o*), and E(a®). We recall that the entries of E are 1 where an error 
occurs and 0 otherwise. Suppose 3 errors occur say at the entries 7, 72, and 73 
of E. Then 


E(a) = qi! + q2t a8, 
E(a®) = a8 + aft + Q8is, 
E(o8) = aSit + qdit + Qbis, 
It will be convenient to note that with the a; in GF(2), 


(5.1) (>> aja) =» ajo =») ajo’, 


because all cross products have 2=1-+1 as a factor. Suppose now that 3 errors 
occur at the positions 74, 73, and 7%, all distinct from 1%, %, and 7; above, and 
suppose these errors lead to the same values for E(a), E(a*), and E(a®) as do 
11, 12, and 73. Then this leads to 


adit 4. fit 4. gilts = git + gis + ais, fF = 1,3, 5 


or 


6 
(5.2) YSatt=0 f=1,3,5. 


d=1 
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But now applying (5.1) to the case 7=1, (5.2) holds for j7=2. Applying (5.1) to 
the case 7=2 and then to j=3 gives (5.2) for 7=4 and j=6. (That is why we 
omitted the even powers of a from the rows of H.) Thus 

6 


(5.3) Yatt=0 f= 1,2,3,4,5,6. 


d=1 


The determinant of the above system is a Vandermonde determinant equal to 


(5.4) qyittiat s+ +4 IL (a#— at). 
6=d>e21 


Each factor of (5.4) is 


att — ate = qte(qid-te — 1), 


Since 07414, it follows that 0< | ta <=14. Hence, since the order of a@ is 
15, no factor of (5.4) is zero so the determinant is not zero. Thus the homo- 
geneous system (5.3) is impossible. If the other cases—such as two sets of three 
errors but with some in common, or where one set or both sets have less than 
three errors—are considered, there are now fewer columns in the analogue to 
(5.3); hence some rows may be discarded leading again to a Vandermonde 
situation. Thus if there are at most three errors, then HR determines their 
locations uniquely. 

Of course for successful decoding, the above uniqueness result, while re- 
assuring, must be replaced with a reasonably simple constructive procedure for 
determining which entries of the vector E, if any, are 1. It is the simple orderly 
structure of H in terms of powers of a that makes the mechanization of such a 
decoding procedure feasible [1, Chap. 7]. 

The BCH codes, based on conceptions from pure mathematics, are not 
unique in using unexpected parts of pure mathematics for coding. Among the 
codes there are for example euclidean-geometry codes [1, p. 375], projective- 
geometry codes [1, p. 376], tensor product codes [1, p. 346], and quadratic 
residue codes [1, p. 354]. 


The preparation of this paper was supported in part by the Office of Naval Research and by 
the National Science Foundation NSF GP-13778. 
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POSITIVE DEFINITE MATRICES 
C. R. JOHNSON, Northwestern University 


Historically, positive definite matrices arise quite naturally in the study of 
n-ary quadratic forms and assume both theoretic and computational importance 
in a wide variety of applications. For example, they are employed in certain 
optimization algorithms in mathematical programming, in testing for the strict 
convexity of scalar-valued vector functions (here, positive definiteness of the 
Hessian provides a sufficiency check), and are of basic theoretic importance in 
construction of the various linear regression models. These are only a few of the 
specific applications which may be added to the abstract interest of such 
matrices. We now concentrate specifically on the properties of the matrices 
themselves. 


DEFINITION: An nXn real matrix A, where n 1s a positive integer, 1s called 
positive definite uf (x, Ax) =x’ Ax>0 for all nonzero column vectors x in Euclidean 
n-dimensional space. 

We shall designate the set of all such matrices (which forms a subset of allnXn 
matrices) as IIy. 


Classically, it is customary to require also symmetry in the definition of 
positive definite, and we shall often concentrate on that proper subset of II, 
which consists of only the symmetric members of II,. We shall designate this 
subset as Z,, but at times we shall allow ourself to consider the more gen- 
eral case. 

It will become clear that the classical concentration on 2, is convenient since 
it is much richer in algebraic properties, but also, from the standpoint of testing 
arbitrary matrices, it suffices to consider the theory of 2,. As is well known, any 
square matrix A can be written as the sum of a symmetric and a skew-sym- 
metric matrix, d=B+C, where B=(A+A7)/2 and C=(A—AT)/2. We call 
B the symmetric and C the skew-symmetric part of A, in an unambiguous manner. 

(1) RemarK. An n Xn matrix A is positive definite if and only if the sym- 
metric part of A is positive definite. (Actually, we show that any quadratic form 
is equivalent to a symmetric quadratic form.) 

Proof. lf Cis the skew-symmetric part of A, then (x, Cx) = (C’x, x) = (— Cx, x) 
= —(Cx, x) implies (x, Cx)=0. Therefore, (x, Ax) =(x, (B+C)x) =(x, Bx), 
where B is the symmetric part of A. Thus, in a certain sense (e.g., that of testing 
for positive definiteness), it suffices to study the subset 2,. The possibility of 
generalizing some of the results to II, will also be discussed. 

We first characterize positive definiteness (2,) in terms of a basic matrix 
invariant. 


This is the winning essay in the Contest for Undergraduates announced in the January 1969 
MONTHLY. Mr. Johnson was a student at Northwestern University then, and now he is a graduate 
student at Cal Tech. He is a member of the MAA so his prize was changed from membership and a 
book to two books: William Feller, Introduction to Probability Theory and its Applications, 2 vols., 
John Wiley & Sons, 1968. The prize was donated by the publisher. Editor. 
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(2) THErorem. If A is nXn symmetric, then ACZ, tf and only tf all eigen- 
values of A are positive. 


Proof. We first note that since A is symmetric, A may be diagonalized by 
some orthogonal matrix B, BA B=D where B* = B~'and D is diagonal with the 
necessarily real eigenvalues of A (Principal Axis Theorem, see e.g., Zelinsky, 
(3]). Let the eigenvalues of A be Mi, ++: , An. 

If all A;>0, let y = B?x so that x = By. Then x7Ax=(By)’A (By) =y"(BTAB)y 
>0 if x0. 

If A is positive definite, let v; be an eigenvector corresponding to A; nor- 
malized so that (v:, v;)=1. Then 0<v/Av,;=(\yv;, v;) =\; and all A;>0. 


(3) CoroLtLary. If A € Zp, then det (A)>0. 


Proof: det(A) =\y + - + A,>0, by (2). 

To facilitate the following comments, we let SC {4, 2,°° 4, nt, properly, 
be an index set. Then, let 4g, where A isan Xn matrix, be the matrix obtained 
from A by eliminating the rows and columns indicated by S, thus reducing the 
size of A. 


(4) THEorEM. Jf A CI, then Ag ts positive definite for any S. In particular, 
the diagonal elements of A are >0. 


Proof: Let x0 be an n-vector with zeros as the components indicated by S 
and arbitrary components elsewhere. If xg is the vector obtained from x by 
eliminating the (zero) components indicated by S, then xgAsxg=x7Ax>0. 
Since Xs +0 is arbitrary, Ag is positive definite. 

As a special case, we may let S= ‘4, -e + 4—-1,7+1,°--, n} to show that 
the zth diagonal element of A is >0. 

We are now in a position to characterize positive definiteness in another 
manner which may be viewed as a test, the familiar determinant criteria. For this 
we employ the following abbreviations. If Sis of the form {i+1,i+2, ---, x}, 
we denote Ag as Aj, i.e., A; is the «Xz matrix formed from the “intersection” 
of the first 7 rows and columns of A. 


(5) THrorem. If A is nXn symmetric, then ACZ, tf and only if det(A;)>0 
forv=1,--+,n. 


Proof. If AG2Zn, then A;E2Z; by (4) and because A; is symmetric. Therefore 
det(A;:)>0 by (3). 

Unfortunately, there are no thoroughly pleasing proofs of the converse 
proposition, but the Inclusion Principle for eigenvalues (Franklin, [1]) will aid 


in the proof. We note that if ajy2 --- 2a, are the m eigenvalues of A, and 
Piz +--+ 2Br+ are the n—1 eigenvalues of A,1, then a, 28,;2a,2fh.2 --- 
= An-1=Bn-1=2An. Now, if det(A,)>0 for 7=1,---, , we may inductively 


show A€2,. Since det(Ai1)>0, A,G@2;. If A,C2, for k<n, all eigenvalues of 
A; are >0, and thus by the Inclusion Principle, all eigenvalues of A;41 are 
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greater than 0, except perhaps the smallest. But, let ay2= - ++ 2oayx4, be the 
eigenvalues of Azai; then O41; =det(Azg41)/o1 - > - a, as in the proof of (3), and 
Qz41, the quotient of two positive reals, is positive. Thus, Argi©G@2pa1, and 
A=An€CZ, by induction. 

We choose to state here without full proof a result which applies in a much 
more general setting than II,, but which is important in applications of positive 
definite matrices. 


(6) THEoreEM. Jf ACI, then A has a unique factorization A =LR into irt- 
angular factors where L 1s lower triangular with 1's on the diagonal and R 1s upper 
triangular with nonzero diagonal elements. 


Comments. This is a consequence of the more general theorem (e.g., see [1], 
p. 204) that A has such a factorization if and only if det(A,;) #0 forz=1,--+,x. 
Since A CII, each A;€II; by (4). Thus det(A,) 40, for otherwise A; would be 
singular and not positive definite. Since Z and R are obviously invertible (with 
easily computed inverses), this theorem is helpful, for instance, in exhibiting 
solutions x to Ax=y. 

We now have three rather concrete characterizations of 2,. With little diffi- 
culty we may add a fourth which is not quite so concrete. 


(7) LemMa. If AC2Zny, then there exists an invertible matrix P such that 
PTAP=TI, the identity matrix. Also PP? =A! which exists. 


Proof: We know there exists an orthogonal B, such that B?AB=C, where C 
is diagonal with the eigenvalues of A. If we let D=-+C-"/2, then D is well de- 
fined. Now let P=BD. Then, P is invertible since B and D are, and P?AP 
= DTBTABD=DCD=CD?=CC—!=I. Also, P?'APPT =P? and APPT=I since 
PT is invertible. Therefore, A-!=PP?. 

As an aside, (7) points out that any symmetric positive definite quadratic 
form x?Ax (in fact, any positive definite quadratic form by (1)), is equal to 
(y, y) under a suitable change of coordinates. 


(8) THEOREM. A C2, tf and only tf AQ invertible such that A =Q™Q. 


Proof. lf A=Q'Q and y=Qx, then x?Ax=xTQTQx=y'y =(y, y)>0 if x0 
since Q is assumed invertible. 

If AC2,, then A =(P")-!P-!, with P invertible by (7). Let Q=P-!; then 
OF = (P-1)? =(PT)-1, 

We now mention a few less fundamental but still important results which 
deal with the positive definiteness of some functions of positive definite matrices. 


(9) THEorREM. The matrix A belongs to 2, tf and only 1f BTAB belongs to 2m 
for each nXm matrix B such that By =0 implies y =0. 


Proof. If the condition is satisfied, let m=n and choose B=J. Then J7AI 
=AC2,. 
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Conversely, let A G2, and suppose B satisfies the conditions of the theorem. 
Then y?7(BTAB)y = (By)"A(By)>0 unless By =0, that is, y=0. Hence, BTAB 
is symmetric and in 2m. 


(10) THrorem. If AC2Zp, then 

(a) cACZ, for c>0 any real scalar; 

(b) (A+B) €2, tf BEZ,; 

(c) A™CZ, for m any integer; 

(d) an A? existsCZ, for p a positive integer (by A? we mean a matrix B 
such that BP? =A); 

(e) an A® extsisC2, for r any rational number. 


Proof. (a) cA is symmetric, and x7(cA)x =c(x?Ax)>0 if c>0 and x0. 

(b) x7(4+B)x =x? (Ax+ Bx) =x?Ax+x'Bx>0 if x+¥0. Also, A+B 
= (A+B)? is symmetric. 

(c) (A™)7=(AT)™=A™ is symmetric. If m=0, A™=IE2D,. If m>0 and 
CTAC=D is an orthogonal diagonalization of A, then A=CDCT and A™ 
= (CDCT)(CDC*) - - - (CDCT)=CD*"C’. Thus A” is diagonalizable to D™, and, 
therefore, has all eigenvalues >0. Therefore, A”CZ,. If m= —1, then A” exists 
in 2, by (7) and (8). Ifim<—1, —m>0, A™=(A7-!)-”, and A™€2,. 

(d) As before, we may write A = CDC’, where C is orthogonal and D diagonal 
with positive diagonal elements. Define AY? = CD?CT, where D"/? is diagonal 
with diagonal elements the positive real pth roots of the diagonal elements of D. 
Then (A?)?=A, A? is symmetric and has n positive real eigenvalues and is 
thus positive definite. 

(e) Follows from (c) and (d). 

The theorem, (10), might give us hope that a nontrivial algebraic structure 
might be imposed on 2, (or, perhaps, II,) so that it could be characterized as 
one of the more familiar algebraic objects. Unfortunately, however, this does not 
seem to be the case. 

We may not employ an additive group structure since inverses do not exist 
in Z,. If A is positive definite, not only is —A not positive definite, but it has 
essentially the opposite properties of A (such a matrix is called negative definite). 
Also, there is no additive identity in Z, or II,. Of course, both Z, and II, are 
closed and associative under addition. 

The situation is interestingly different but only a little more well behaved 
if we attempt a multiplicative group structure on Z,. We have the identity 
matrix J€2, and inverses exist in 2, by (10). Also, the multiplication is associa- 
tive. But 2, is not closed under matrix multiplication. Not only is the product 
of two members of 2, not symmetric if they do not commute, but it may not 
even be positive definite as the following example in 2». shows. Let 


1 3 1 —3 
A= ( ) and B= ( ), 
3 10 —3 10 


Both A and B are in 2» by (5). However, 
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~8 27 
AB = I 
(—, 1) ; 


by (4). In fact, AB is neither positive nor negative definite. 

About all we can say, then, is that II, forms a semigroup under matrix addi- 
tion. Since the other common algebraic structures are fundamentally more 
complex than the group, they are also precluded. 

Some generalizations may be made without great difficulty on the proofs 
exhibited thus far. For instance, in the “only if” part of (2) our hypothesis is 
unnecessarily strong. If A is mXn, it need only have z real eigenvalues. The 
proof is easily seen to depend only on the positive definiteness of A and the 
existence of 2 real eigenvalues with associated eigenvectors. Thus we may 
validly formulate the amending statement: 


(2.1) THEOREM. Jf A 1s nXn and has n real eigenvalues, then A CII, implies 
all the ergenvalues of A are positive. 


An entirely analogous statement may be made about the corollary, (3). 

We may not, however, drop the requirement that A is real diagonalizable 
from (2.1) since a matrix may be positive definite and have complex charac- 
teristic roots as the following example shows. Let 


“-(, ) 


Then A CII, by (1) since the symmetric part of A is 


(0 2) 


However, the characteristic polynomial of A is \?—4A-++5 with roots 242. It 
should be clear, though, that under no circumstances may a member of II, have 
a nonpositive real eigenvalue; for then we could choose a nonzero eigenvector 
to violate the definition of II,. Thus, to generalize slightly on (7), a member of 
II,, must always have nonzero determinant and be invertible. 

In (10) parts (a) and (b) may clearly be generalized to all of II,, but part (c) 
is not always valid in II, For instance, if 


AEII, by (1) and (5). But 


(5) “*) 
A? = 
22 92 


which is not positive definite by (4). Also, this example should caution us to note 
that a matrix may have all its eigenvalues real and positive but not be € any 
II,. A? has eigenvalues {81, 4}. 
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If we allow complex-entried matrices and then define D# to be the Hermitian 
matrices which are positive definite, it is clear also that our results concerning 
Xn May be modified to remain valid in >" In addition we should note that our 
notion of II, has no consistent analog in the complex-entried matrices. For in- 
stance, a matrix A CII, with respect to real vectors may not even have x*Ax 
real when complex vectors are allowed (* means “conjugate transpose”). 

We have thus far formulated a theory of positive definite matrices. It is 
clear that we may analogously define another (disjoint) set of matrices by re- 
placing “>” with “<” in the definition of positive definite. Such matrices are 
usually termed negative definite, and, suggestively, we might designate this set 
as —II, since A EII, if and only if —AC —II,. This, of course, is the key to the 
development of a theory of negative definite matrices which would proceed 
analogously (allowing for the peculiarities of negative numbers). 

Positive (negative) semi-definite matrices may be defined by allowing the 
possibility of equality in the definition of II, (or —II,). Their theory proceeds 
similarly, but modified by allowance for 0 eigenvalues. 

In the positive definite case, we have succeeded in establishing four charac- 
terizations through theorems: by eigenvalues ((2)); by determinants ((5)); by 
triangular decomposition ((6)); and by Q7Q decomposition ((8)). This, plus the 
additional properties commented on, is largely sufficient to both mathematically 
describe and usefully apply positive definite matrices. 
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LOGIC VERSUS PEDAGOGY 
MORRIS KLINE, New York University 


Humble thyself, impotent reason. 
PASCAL 


1. The Current Emphasis on Logical Structure. There is no question that 
mathematics is distinguished from all other bodies of human knowledge in that 
it insists on deductive proof from explicitly stated axioms as the indispensable 
condition for the acceptance of its conclusions. This requirement has indeed 
conferred power on mathematics, for deductive proof has strengthened the 
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structure. Moreover, the organization of mathematics into deductive systems 
has given coherence to its vast contents, and the axiomatization inherent in this 
type of organization has made clear precisely what is presupposed and hence 
where such systems are applicable. It has also suggested abstractions which 
embrace several structures as, for example, the theory of groups. 

The deductive organization of mathematics is now a popular mode of presen- 
tation in all instruction from the fourth or fifth grade up. Deductive proof is the 
be-all and end-all of teaching. The enthusiasm for this mode of presentation is 
somewhat understandable because it was only about 75 years ago, after over 
2500 years of struggle, that deductive organization and, through it, rigor were 
achieved. Mathematicians may be now giving vent in their textbooks to the 
satisfaction that Poincaré expressed in 1900 at the International Congress of 
Mathematicians when he gloated [11], “One may say today that absolute rigor 
has been attained.” As an expression of enthusiasm this emphasis on deduction 
and rigor might indeed be excused. However, when authors are challenged as 
to the pedagogical wisdom of such presentations they now rejoin that this is the 
way to understand mathematics. In other words, the deductive approach is 
being defended as the pedagogical approach. Deductive organization and proof 
are advocated as the answer to all of the difficulties which students have had in 
learning mathematics and the open sesame to the subject. No longer will the 
memorization of techniques be necessary. Mathematics will now be accessible 
and understandable to almost all students. 


2. The Intuitive Approach. Opposed to the deductive approach is the intui- 
tive approach. Admittedly the nature of intuition is somewhat vague. It denotes 
some direct grasp of the idea, whether it be a concept or proof. There may be a 
special intuitive faculty distinct from the logical faculty that criticizes and 
reasons. Whether or not there is an intuitive faculty there are specific and ex- 
plicit aids to the intuition which enable it to function. Primarily it seems to rely 
upon the senses, for, as Aristotle first put it, there is nothing in the intellect that 
was not first in the senses, except, Leibniz added, the intellect itself. Hence one 
of the useful devices is a picture. Consider exhibiting several triangles to incul- 
cate the idea as opposed to the definition: the union of three non-collinear points 
and the line segments joining them. How much more readily is the notion of 
continuity grasped when presented as a curve which can be drawn with an unin- 
terrupted motion of a pencil rather than by the e—6 definition. 


He was a Fulbright Lecturer and Guggenheim Fellow, and has spent several leaves in Stanford 
and at the Technische Hochschule in Aachen. 

He has published extensively in topology (his original subject), electromagnetic theory, 
history and philosophy of mathematics, and pedagogy. His books include Mathematics in Western 
Culture (Oxford 1953), Mathematics and the Physical World (Crowell 1959), Mathematics, A Cultural 
Approach (Addison-Wesley 1962), Electromagnetic Theory and Geometric Optics (with I. W. Kay, 
Wiley 1965), Calculus: An Intuitive and Physical Approach, 2 vols. (Wiley 1967), and Mathematics 
for Liberal Arts (Addison-Wesley 1967). He has edited several collections, and he holds five patents 
on antenna systems. Editor. 
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The intuition may be appealed to through physical arguments. The deriva- 
tive as a velocity at an instant gives meaning to the concept, and the argument 
that a ball thrown up into the air must have zero velocity at its highest point 
suggests that the derivative must be zero at a maximum of a function. 

We shall include in the intuitive approach what are often called heuristic 
arguments. Through experience with actual objects a child can learn that 
3+4=4-+3. The generalization that a+b=b-+<a is heuristic. Likewise the fact 
that if y=x", dy/dx=nx"—! is readily inferred from the special cases n =2 and 
n=3. Reasoning by analogy and even probabilistic arguments are heuristic. 

Intuition is not static. Just as one’s intuition about what to expect in human 
behavior improves with experience so does the mathematical intuition. The 
latter may indeed suggest, as it did to Leibniz, that the derivative of a product 
of two functions is the product of the derivatives. The conclusion should be 
tested, another heuristic measure, and of course will be found to be false. Deeper 
analysis will show that what holds for limits of functions does not hold for deriva- 
tives, and the intuition will be sharpened by this experience. 

Clearly the intuitive approach can lead to error, but committing errors and 
learning to check one’s results are part of the learning process. If the fear of 
errors is to be a deterrent, a child would never learn to walk. 

It is the contention of this paper that understanding is achieved intuitively 
and that the logical presentation is at best a subordinate and supplementary aid 
to learning and at worst a decided obstacle. Intuition should fly the student to 
the conclusion, make a landing, and then perhaps call upon plodding logic to 
show the overland route to the same goals. If this contention is correct then the 
intuitive approach should be the primary one in introducing new subject matter 
at all levels. This recommendation may appear to be treason to mathematics, but 
let us withhold judgment. 


3. The Historical Evidence. Though no air-tight case for the intuitive ap- 
proach can be made from a study of the historical growth of mathematics a brief 
survey seems to offer some compelling arguments. 

The first deductive structure was Euclid’s Elements. Euclidean geometry, 
however, did not come into being in this form. It took 300 years, the period from 
Thales to Euclid, of exploration, fumbling, vague and even incorrect arguments 
before the Elements could be organized. Even this structure, intended to be 
strictly logical, rests heavily on intuitive arguments, pointless and even mean- 
ingless definitions, and inadequate proofs. That the logical structure can be 
devised after a subject is created and understood is not in question. What is 
relevant is that this deductive system came after the understanding was 
achieved. Moreover, it is no accident that Euclidean geometry was the first 
subject to receive any extensive mathematical development; the reason is that 
the intuition is readily applied to infer geometrical facts and the very figures 
suggest methods of proof. 

A striking contrast is provided by the development of arithmetic and algebra. 
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Whole numbers and fractions and the operations with them were well accepted 
by the Egyptians and Babylonians, on an empirical basis, at least as far back 
as 2000 B.C. But irrational numbers, once their true character was recognized 
by the Pythagoreans, were not accepted by the classical Greeks as numbers. 
Why not? Because whole numbers and fractions had an obvious physical mean- 
ing whereas irrationals did not. The only intuitive meaning that one could 
attach to irrationals was that they represented certain geometrical lengths. 
What, then, did the Greeks do? They rejected irrationals as numbers and 
thought of them as lengths. In fact they converted all of algebra into geometry 
in order to work with lengths, areas and volumes that might otherwise have to 
be represented numerically by irrationals, and they even solved quadratic equa- 
tions geometrically. 

The progress that was made in the use of irrational numbers is due to the 
Alexandrian Greek civilization, which was a composite of the classical Greek, 
Egyptian and Babylonian civilizations, and to the Hindus and Arabs who were 
entirely empirically oriented. It was the Hindus who decided that 2/3 = 6, 
and their argument was that these irrationals could be “reckoned with like 
integers,” that is, like \/4+/9 = »/36. Irrational numbers were gradually accepted 
because of their utility and because familiarity breeds uncriticalness. The logical 
presentation of irrational numbers was not created until the 1870's. 

Negative numbers, introduced by the practical-minded Hindus about 600 
A.D., did not gain acceptance for 1000 years. The reason: they lacked intuitive 
support. The history of complex numbers is somewhat similar, though these did 
not appear until about 1540, and only about 200 years were required for these to 
be used somewhat freely. A remark of Gauss is very pertinent. As is well known, 
he was one of the men who discovered the geometrical representation of complex 
numbers, and about this he said in 1831 [1], “Here (in this representation) the 
demonstration of an intuitive meaning of »/—1 is completely grounded and 
more is not needed in order to admit these quantities into the domain of the 
objects of arithmetic.” Neither Descartes, Fermat, Newton, Leibniz, Euler, 
Lagrange, Gauss, or Cauchy could have given a definition of negative or complex 
numbers, or irrationals for that matter. Yet all of them managed to work with 
these numbers quite satisfactorily, to put it mildly, at least insofar as their times 
employed these numbers. In 1837 Hamilton did give the ordered couple defini- 
tion of complex numbers in terms of real numbers but the logical development 
of the real number system itself was not constructed until the last part of the 
nineteenth century. The history of the entire complex number system is perti- 
nent not only in itself but because algebra and analysis obviously utilize the 
number system and whatever basis there was for the latter had to serve as the 
basis for algebra and analysis. 

The manner in which mathematics develops and is understood is beautifully 
exemplified by the history of the calculus. For the sake of brevity let usignore 
the predecessors of Newton and Leibniz. The basic concept of the calculus is, 
of course, the instantaneous rate of change of a function, that is, the limit of 
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Ay/Ax as Ax approaches 0. Where it was physically appropriate Isaac Newton 
thought of the limit in question as a velocity or as an acceleration, and he made 
great use of this fact in solving physical problems. But Newton experienced 
insuperable difficulties in explaining how he obtained the derivative from 
Ay/Ax. Because Ay approaches 0 when Ax does, he had to account for the fact 
that the quotient approached a definite number. Newton wrote three papers on 
the calculus and put out three editions of his famous Mathematical Principles 
of Natural Philosophy, and in each of these publications he made different ex- 
planations. In his first paper he says that his method is “shortly explained 
rather than accurately demonstrated.” In his second paper he changed some 
terminology so as “to remove the harshness from the doctrine of indivisibles,” 
but the logic is no more perspicuous. In the third paper Newton says “in mathe- 
matics minutest errors are not to be neglected.” And then he gives a definition 
of the derivative, or fluxion as he called it, which supposedly shows that a 
fluxion is a precise concept. “Fluxions are, as near as we please, as the increments 
of fluents generated in times, as equal and as small as possible, and to speak 
accurately, they are in the prime ratio of nascent increments; yet they can be 
expressed by any lines whatever, which are proportional to them.” 

In the first and third editions of the Principles Newton says, “Ultimate ratios 
in which quantities vanish, are not, strictly speaking, ratios of ultimate quanti- 
ties, but limits to which the ratios of these quantities decreasing without limit, 
approach, and which, though they can come nearer than any given difference 
whatever, they can neither pass over nor attain before the quantities have 
diminished indefinitely.” He says further, “by the ultimate ratio of evanescent 
quantities is to be understood the ratio of the quantities, not before they vanish, 
nor after, but that with which they vanish.” There are other statements by 
Newton in the published versions of his works which differ from the above. 
Clearly Newton struggled hard to define the derivative but scarcely succeeded 
in formulating a precise concept. 

Leibniz worked not with the ratio Ay/Ax and its limit but with differentials 
dx and dy which, he said, though not zero were not ordinary numbers. They 
were geometrically the differences in abscissa and ordinate, respectively, of two 
“infinitely near points.” He too published many papers in which he tried to 
explain the meaning of the ratio dy/dx. Concerning his first paper on the cal- 
culus, published in 1684, even his friends, the Bernoulli brothers, said it was 
“an enigma rather than an explication.” 

Other papers and efforts to clarify his ideas did not accomplish any more. 
In a letter to Wallis Leibniz says: “It is useful to consider quantities infinitely 
small such that when their ratio is sought, they may not be considered zero but 
which are rejected as often as they occur with quantities incomparably greater. 
Thus if we have x-+dx, dx is rejected. But it is different if we seek the difference 
between x«-+-dx and x. Similarly we cannot have xdx and dxdx standing together. 
Hence, if we are to differentiate xy we write (x-+dx)(y+dy) —xy =xdy+ydx 
+dxdy. But here dxdy is to be rejected as incomparably less than xdy+-ydx. 
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Thus in any particular case, the error is less than any finite quantity.” 

In the absence of satisfactory definitions he resorted to analogies to explain 
his differentials. At one time he referred to dy and dx as momentary increments 
or as vanishing or incipient magnitudes. These are Newtonian phrases. By way 
of additional explanation he said that as a point adds nothing to a line so dif- 
ferentials of higher order, e.g., dxdx, add nothing to dx. Alternatively dx is to x 
as a point to the Earth or as the radius of the Earth to that of the heavens. 
There are many other statements by Leibniz which are equally obscure. 

There were many attacks on Leibniz’s and Newton’s work. Newton did not 
respond but Leibniz did. He objected to “overprecise critics” and argued that 
we should not be led by excessive scrupulousness to reject the fruits of invention. 
The phrases infinitely large and infinitely small signify no more than quantities 
which one can take as great or as small as one wishes. And then he adds that one 
can use these ultimate quantities, the actual infinite and the infinitely small, asa 
tool much as the algebraists use the imaginary with great profit. He also said 
that if one prefers to reject infinitely small quantities, it was possible instead to 
assume them to be as small as one judges necessary in order that they should be 
incomparable and that the error produced should be of no consequence or less 
than any given magnitude. 

Of course the successors of Newton and Leibniz were aware of the lack of 
rigor in the calculus. Euler in the first classic text on the calculus, his Introductio 
in Analysin Infinitorum (1748), and again in his Instituttones Calculi Differen- 
tialis (1755) and Institutiones Calculi Integralis (1768-70), Lagrange in his 
Théorie des fonctions analytiques (1797) and d’Alembert in his article Limite in 
the Encyclopédie all struggled manfully but futilely to clarify the basic concepts 
of the calculus. Even Cauchy, the founder of rigor, gave definitions in his Cours 
ad’ Analyse Algébrique (1821) that would be considered loose and intuitive today. 
For example he says, “When the successive values attributed to a variable 
approach indefinitely a fixed value so as to end by differing from it by as little 
as one wishes, this last is called the limit of all the others.” He defines continuity 
essentially by the requirement that the numerical value of the difference 
f(xo-+a) —f(x%9) decrease indefinitely with that of a. Because he was far more 
intuitive than rigorous Cauchy failed to distinguish between continuity and dif- 
ferentiability and even after his attention was called to this fact he persisted for 
twenty years in using differentiability where he had assumed only continuity. 
Clearly Cauchy’s own rigor was beyond his comprehension. Cauchy also failed 
to recognize the necessity for the uniform convergence of series in order to inte- 
grate series term-by-term and to assert that the sum of a convergent series of 
continuous functions is continuous. Nor did he question that a function of two 
independent variables has a limit in both variables if it has a limit in each vari- 
able separately. 

It is interesting that with respect to continuity and differentiability most 
texts of the nineteenth century including those written by the best mathemati- 
cians either followed Cauchy or “proved” that continuity implied differentiabil- 
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ity. This is why the mathematical world was shocked when Weierstrass in 1872 
produced the example of a function that is continuous for all real values of x 
but has no derivative at any value of x. Luckily this example came late in the 
development of the calculus for, as Emile Picard said in 1905 [8], “If Newton 
and Leibniz had known that continuous functions need not necessarily have a 
derivative, the differential calculus would never have been created.” 

In view of the vague, unclear, and even incorrect foundations of the calculus 
one might expect that the subject would collapse. But before Weierstrassian 
rigor became known through his lectures at Berlin in the 1870’s not only had the 
calculus been extended and applied but the subjects of ordinary and partial 
differential equations, the calculus of variations, differential geometry, and the 
theory of functions of a complex variable had been erected on the calculus. How 
did the mathematicians achieve these tremendous victories? Clearly they 
thought intuitively. 

We could examine the developments in projective geometry, non-Euclidean 
geometry and other areas but the story would be about the same. One can safely 
say that no proof given up to at least 1850 in any area of mathematics, except in 
the theory of numbers, and even there the logical foundation was missing, would 
be regarded as satisfactory by the standards of 1900, to say nothing about to- 
day’s standards. Yet the mathematics created by the men was surely understood 
by them. The history teaches us then that the intuition of great men is far more 
successful than their logic. 

One could of course argue that the growth of mathematics may indeed have 
proceeded as described but now that we have the proper logical structures for 
the number system, algebra, analysis and the various branches of geometry we 
need not ask students to repeat the fumblings of the masters. We can give them 
the correct approaches and they will understand them. This argument can be 
countered with the fact that many mathematicians did try to build logical 
foundations for the various subjects—witness Euler, Lagrange and Cauchy in 
the calculus—and their failure to do so ought to be some evidence that the logical 
approaches are not easy to grasp. Of course our students are superior to the best 
mathematicians of the past. 

There is not much doubt that the difficulties the great mathematicians en- 
countered are precisely the stumbling blocks that students experience and that 
no attempt to smother these difficulties with logical verbiage will succeed. If it 
took mathematicians 1000 years from the time that first class mathematics 
appeared to arrive at the concept of negative numbers, and it did, and if it took 
another 1000 years for mathematicians to accept negative numbers, as it did, we 
may be sure that students will have difficulties with negative numbers. More- 
over, the students will have to master these difficulties in about the same way 
that the mathematicians did, by gradually accustoming themselves to the new 
concepts, by working with them and by taking advantage of all the intuitive 
support that the teacher can muster. 

These conclusions have been reached by many great mathematicians who 
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have concerned themselves with pedagogy. Poincaré said [12], “The zoologists 
maintain that in a brief period the development of the embryo of an animal 
recapitulates the history of its ancestors of all geological epochs. It appears that 
it is the same in the development of the mind. The task of the educator is to 
make the mind of the child go through what his fathers have experienced, to 
pass rapidly through certain stages but not to omit any. For this purpose, the 
history of the science ought to be our guide.” Even Hilbert, the founder of mod- 
ern axiomatics, granted the high pedagogic and heuristic value of the genetic 
method [2]. 

But we shall not insist on the evidence of history. There are other weighty 
arguments. 

4, The Distortions of the Deductive Approach to Mathematics. Far from 
being the pedagogically sound representation of mathematics, the deductive 
approach introduces distorted views of the subject. First of all, mathematics is 
primarily a creative activity, and this calls for imagination, geometric intuition, 
experimentation, judicious guessing, trial and error, the use of analogies of the 
vaguest sort, blundering and fumbling. Even when a mathematician is con- 
vinced that a result must be correct he must still create to find the proof. As 
Gauss put it, “I have got my result but I do not know yet how to get it.” Every 
mathematician knows that the hard work, tribulations and real thinking are 
required by, and the sense of achievement derives from, the creative effort. 
Writing up the final deductive formulation is a boring task. 

Creativity presupposes flexibility in solving problems and any ideas from 
any domain of mathematics should be entertained whether or not they fall 
within the confines of a particular axiomatic structure. The latter, in fact, 
acts as a straitjacket on the mind. 

What does logic contribute to the creation of concepts? Suppose one wishes 
to define the curvature of a surface. This definition is not arrived at by a de- 
duction from axioms. It requires some deep insight to appreciate that this 
concept can be effectively represented by the product of the maximum and 
minimum curvatures of the curves through a point on the surface. As a matter 
of fact this definition was created by Euler who paid no attention to axiomatics. 

Some of the greatest ideas in mathematics are not at all a matter of logic. 
Perhaps the best example is the realization that non-Euclidean geometry is 
applicable to physical space. The logical side, namely, pursuing the conse- 
quences of assuming a non-Euclidean parallel axiom, was a relatively simple 
task and was performed by Saccheri, Lambert, Legendre, Schweikart, Taurinus 
and many others. But it was Gauss who first recognized that these new geom- 
etries are as applicable as Euclidean geometry. The consequences for mathe- 
matics were as revolutionary as the very creation of mathematics itself. 

It is true that some mathematicians, for example, Weierstrass in part of 
his work, Peano and Frege, produced rigorous theorems or axiomatic deductive 
structures. But in this work they were only reformulating what was already 
known and their goal was to rigorize what was well understood. New ideas were 
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never obtained in this manner. Logic discovers nothing, neither the statement 
of a theorem nor its proof, even in the construction of axiomatic formulations 
of known results. Thus the concentration on the deductive approach omits the 
real activity. The logical formulation does dress up this activity but conceals 
the flesh and blood. It is like the clothes which make the woman but are not 
the woman. It is the last act in the development of a branch of mathematics 
and, as one wise professor put it, when this is performed the subject is ready for 
burial. Logic may be a standard and an obligation of mathematics but it is 
not the essence. 

The student should be creating mathematics. Of course he will be re-creating 
it and with the aid of a teacher. This recreativity on the part of the student 
is more popularly termed discovery today. Every teacher professes to espouse 
discovery. The student can be gotten to do this if he is allowed to think intui- 
tively but he cannot be expected to discover within the framework of a logical 
development that is almost always a highly sophisticated and artificial recon- 
struction of the original creative work. 

The logical version is a distortion of mathematics for another reason. The 
concepts, theorems and proofs emerged from the real world. It is the uses to 
which the mathematics is put that tell us what is correct. Thus we add frac- 
tions by finding a common denominator and not by adding numerators and 
adding denominators though we do multiply fractions by multiplying numer- 
ators and multiplying denominators. Likewise, the uses to which matrices are 
put determine that multiplication is to be noncommutative though we can 
devise purely mathematical multiplications of matrices that are commutative. 
After we have determined what properties mathematical concepts and opera- 
tions must possess on the basis of the uses of these concepts and operations we 
then invent a logical structure, however artificial it must be, which yields these 
properties. Hence, the logic does not dictate the content of mathematics. The 
uses determine the logical structure. The logical organization is an after- 
thought. As Jacques Hadamard remarked, logic merely sanctions the con- 
quests of the intuition. Or, as Weyl put, “logic is the hygiene which the mathe- 
matician practices to keep his ideas healthy and strong.” 

In fact, if a student is really bright and he is told to cite the commutative 
law to justify, say 3-4=4-3, he may very well ask, Why is the commutative 
law correct? The true answer is, of course, that we accept the commutative 
law because our experience with groups of objects tells us that 3-4=4-3. In 
other words the commutative law is correct because 4-3=3-4 and not the 
other way around. The normal student will parrot the words commutative law, 
and he will, as Pascal put it in his Provincial Letters, “fix this term in his memory 
because it means nothing to his intelligence.” 

The deductive development of a branch of mathematics is often so artificial 
that it is meaningless. No example is more pertinent than the deductive de- 
velopment of the real number system. There were good reasons to axiomatize 
the number system, but the introduction of fractions and negative numbers as 
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couples with special definitions of the operations with these couples and the 
introduction of the irrationals by Cantor sequences or Dedekind cuts, clever as 
they may be, are so artificial, trumped-up and foreign to the intuitive meaning 
and uses of these numbers as to preclude understanding. 

In such developments —2 is often introduced as the number which when 
added to 2 gives 0 or, as the modern mathematics texts put it, —2 is the unique 
additive inverse to 2. Such a definition induces no more understanding of ~2 
than the statement, anti-matter is that which added to matter produces a 
vacuum, gives any understanding of anti-matter. One doesn’t learn even about 
dogs from a definition of dogs. 

Poincaré makes this point too [9]. “In becoming rigorous mathematical 
science assumes a character so artificial as to strike every one. It forgets its 
historical origins; we see how the questions can be answered, but we no longer 
see how arid why they were put.” 

Poincaré also notes [12] that in building up the number system from the 
integers there are many different constructions one can make. Why do we take 
one rather than another? “The choice is guided by the recollection of the 
intuitive notion in which this construction took place; without this recollection, 
the choice appears unjustified. But to understand a theory it is not sufficient 
to show that the path that one follows does not present obstacles; it is necessary 
to take account of the reasons that one chooses that path. Can one ever under- 
stand a theory if one builds it up right from the start in the definitive form that 
rigorous logic imposes, without some indications of the attempts which led to 
it? No; one does not really understand it; one cannot even retain it or one re- 
tains it only by learning it by heart.” 

Many teachérs might retort that the student has already learned the intui- 
tive facts about the number system and is now ready for the appreciation of the 
deductive version, which exemplifies mathematics. If the student really under- 
stands the number system intuitively the logical development will not only not 
enhance his understanding but will destroy it. As an example of mathematical 
structure no poorer choice could be made because the construction is so con- 
trived. The development is so full of details and so stilted that it not only stulti- 
fies the mind but obscures the real ideas. Yet just this topic has now become the 
chief one in high school and college mathematics courses. 

Actually this deductive approach is even misleading. In extending the 
number system from the natural numbers to the various other types we insist 
that the commutative and associative properties of the operations be retained. 
Why do we insist on these properties? We teachers know that the uses of the 
numbers call for these properties but the student gets the impression that 
these are hecessary properties of all mathematical quantities. Why then do we 
not extend the order properties to complex numbers and the commutative 
property to matrices? The logical approach gives the student an entirely false 
impression of how mathematics develops. 

The insisterice on a deductive approach deceives the student in another 
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way. He is led to believe that mathematics is created by geniuses who start 
with axioms and reason directly from the axioms to the theorems. The student 
feels humbled and baffled, but the obliging teacher is fully prepared to demon- 
strate genius in action. Perhaps most of us do not need to be told how mathe- 
matics is created but it may help to listen to the words of Felix Klein [3]. “You 
can often hear from non-mathematicians, especially from philosophers, that 
mathematics consists exclusively in drawing conclusions from clearly stated 
premises; and that in this process, it makes no difference what these premises 
signify, whether they are true or false, provided only that they do not contradict 
one another. But a person who has done productive mathematical work will 
talk quite differently. In fact those people are thinking only of the crystallized 
form into which finished mathematical theories are finally cast. The investigator 
himself, however, in mathematics as in every other science, does not work in 
this rigorous deductive fashion. On the contrary, he makes essential use of his 
imagination and proceeds inductively aided by heuristic expedients. One can 
give numerous examples of mathematicians who have discovered theorems of 
the greatest importance which they were unable to prove. Should one then 
refuse to recognize this as a great accomplishment and in deference to the above 
definition insist that this is not mathematics? After all it is an arbitrary thing 
how the word is to be used, but no judgment of value can deny that the induc- 
tive work of the person who first announces the theorem is at least as valuable 
as the deductive work of the one who first proves it. For both are equally neces- 
sary, and the discovery is the presupposition of the later conclusion.” 

The deductive approach produces practical complications. If a student has 
to show, for example, that 4ab(ab-+-3ac) = 4a7b?+-12.a%be and if he has to justify 
each step, he will have to think carefully and give reasons for so many steps 
that he will take minutes to do what he should do almost automatically on the 
basis of experience with numbers. It is far preferable that the student should 
become so familiar with the basic properties such as distributivity, commuta- 
tivity and associativity that he does not realize he is using them. Likewise 
many students of calculus have learned (by heart) the proof that a continuous 
function on a closed interval has a maximum and a minimum but cannot find 
the maxima and the minima of simple functions. 

We should be grateful that students accept unquestioningly facts that seem 
entirely reasonable to them whether on the basis of experience with numbers or 
intuitive arguments. In fact we should do all we can to make the elementary 
operations so habitual that students do not have to think about them any more 
than one thinks when he ties his shoelaces. If students do not see readily that 
3-x=x-3, it is not because they lack familiarity with the commutative principle 
but rather because they fail to understand that x is just a number. (Of course, 
one should say, x is a placeholder for a number.) When the time to teach a 
noncommutative operation arrives, then commutativity can be stressed. 

The need to make some of the work automatic was stressed by a man who 
certainly understood the role of axiomatics. Alfred North Whitehead says 
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[15], “It is a profoundly erroneous truism, repeated by all copybooks, and by 
eminent people when they are making speeches, that we should cultivate the 
habit of thinking of what we are doing. The precise opposite is the case. Civiliza- 
tion advances by extending the number of important operations which we can 
perform without thinking about them. Operations of thought are like cavalry 
charges in a battle—they are strictly limited in number, they require fresh 
horses and must only be made at decisive moments.” 

Modern texts are not content just to present mathematics deductively. They 
incorporate the rigor that meets the professional’s standards. Thus the deduc- 
tive approach to geometry now requires that the axioms overlooked by Euclid, 
the order axioms for example, be included to justify the steps. The conse- 
quence is that a host of trivial theorems must be proved before one reaches the 
significant ones. Thus the student must prove that there is a unique midpoint 
for each line segment and that there is an inside and an outside of a triangle. 
Even worse is the fact that many of the theorems are more obvious than the 
axioms used to establish them. Hence, the less obvious is used to prove the more 
obvious. But as far as the student is concerned the whole point of proof is just 
the reverse. Students will question what is being accomplished and perhaps even 
wonder whether we teachers are sane. That for two thousand years Euclidean 
geometry, as formulated by the presumably careless or naive Euclid, was re- 
garded by the best mathematicians as the paradigm of rigor, bears no weight 
with the advocates of precise axiomatics. Today’s students, we are apparently 
supposed to believe, are sharper and will not be satisfied with proofs that fail 
to mention details whose absence no one noticed for so many centuries. 

Poincaré struck at this very folly [12]. “When a student commences seri- 
ously to study mathematics, he believes he knows what a fraction is, what 
continuity is, and what the area of a curved surface is; he considers as evident, 
for example, that a continuous function cannot change its sign without vanish- 
ing. If, without any preparation, you say to him: No, that is not at all evident; 
I must demonstrate it to you; and if the demonstration rests on premises which 
do not appear to him more evident than the conclusion, what would this un- 
fortunate student think? He will think that the science of mathematics is only 
an arbitrary accumulation of useless subtleties; either he will be disgusted with 
it or he will amuse himself with it as a game and arrive at a state of mind 
analogous to that of the Greek sophists.” 

The rigorous approach requires such a multitude of minor theorems that the 
larger features of the subject fail to stand out. As Poincaré put it [10], “In the 
edifices built up by our masters, of what use is it to admire the work of the 
mason if we cannot comprehend the plan of the architect? Now pure logic can- 
not give us the appreciation of the total effect; this we must ask of the in- 
tuition.” 

In many areas the present emphasis on the logical approach is sheer hypoc- 
risy. What mathematician uses the logical development of the complex number 
system to justify his operations with real or complex numbers? Yet this is what 
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is taught to students as the way to learn the “truth” about numbers. How 
many mathematicians have ever satisfied themselves that »/2¥? is defined in 
the theory of irrational numbers or even that »/2./3 =+/3+/2? How many 
have ever worked through a rigorous development of Euclidean geometry (as 
opposed to the pseudo-rigorous developments found in modern texts)? Felix 
Klein did not hestitate to admit [5], “To follow a geometrical argument purely 
logically without having the figure on which the argument bears constantly 
before me is for me impossible.” 

As a matter of fact the attempt to be completely deductive ensnares the 
teacher in a trap. It is often necessary to include a proof which even the rigor- 
oriented teachers concede to be too difficult for the student, such as the proof 
of the formula for the area of a circle in plane geometry. Many texts evade the 
issue by adopting an axiom. As a consequence numerous elementary geometry 
texts contain as many as 70 or 80 axioms. Surely if one can adopt axioms at 
will there is no need to prove anything. The only lesson the student will learn 
from such presentations is that if he is stuck he can adopt an axiom. The 
mathematics teacher can no more afford to be profligate with axioms than to be 
parsimonious. Likewise in the presentation of the real number system the high 
school texts proceed axiomatically from the natural numbers. But when they 
get to the irrational numbers, whose logical development the authors recognize 
to be too difficult for the student, they resort to the number line and speak of 
points which have no numbers assigned to them. These are designated by the 
irrational numbers. If the logical presentation of the rational numbers had any 
value it is dissipated by this meaningless introduction of the irrationals. 

One of the gravest defects in the teaching of mathematics is the lack of 
motivation. Mathematics proper, as Weyl described it, has the inhuman quality 
of starlight, brilliant and sharp, but cold. Consequently very few students are at- 
tracted to the subject. In fact most of those taking high school and college 
mathematics do so because it is required or because they are prospective 
scientists or engineers. These students would prefer to learn more about the 
fruits than the roots of mathematics. Proper pedagogy requires that these 
students be shown why they should be studying particular topics and subjects. 
To assure them that the material will prove useful at some later time is hardly 
an incentive to take it seriously. The mathematician or the rare student who 
finds intellectual challenge or aesthetic satisfactions in the subject may be 
intrigued to learn that there are only five regular polyhedra. But very few 
students are excited by this fact. As far as they are concerned the world would 
be just as well off if there were an infinite number of them. As a matter of fact 
there is an infinite number of regular polygons and no one seems depressed by 
this fact. 

Although the subject of motivation is a vast one its relevance here is simply 
that it is relatively easy to give a genuine or significant motivation to a mathe- 
matical topic when this is introduced intuitively or heuristically because his- 
torically there were significant motivations, whereas it is very difficult to do so 
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in a logical presentation because the latter is many stages removed from reality 
and, as we have already pointed out, is often artificial. How does one motivate the 
concept of a fraction when it is to be introduced as an ordered couple of natural 
numbers? How can a student see the point of the e—6 definition of continuity 
if this is his introduction to the concept? In fact the logical approach often 
destroys the motivation. One may motivate the integral as the method of finding 
the area under a curve. But if one defines the area as an integral one begs the 
whole question of doing something significant with the integral. 

Many texts and teachers claim that they do provide motivation even for a 
logical approach. Thus they “motivate” the introduction of negative, irrational 
and complex numbers by stating that we wish to solve equations such as 
x?+-2=0. But for students who have no reason to solve even x —2 =0, the chal- 
lenge of solving ¥2-+-2 =0 certainly isn’t exciting. Moreover, the bright student 
can come back at the teacher and ask, “Why can’t we solve 5/x=0 by intro- 
ducing © as a number?” If we can invent definitions to operate with ./—2, 
we can invent definitions to operate with o. 

The presentation of theorems without the motivation robs the student of 
insight. Even on the somewhat advanced level where we deal with students who 
have some leaning toward mathematics, to present theorems without the 
motivation, whether it does or does not kill off the interest in mathematics, 
certainly leaves them with no more than a meaningless collection of theorems 
and proofs and without the power to think for themselves. Thus in linear alge- 
bra texts the subject of eigenvalues of matrices is always treated. I have not 
found one text which indicates why one wants to learn anything about the 
eigenvalues. The simultaneous reduction of two quadratic forms to sums of 
squares is another seemingly meaningless topic. At least the origin in mechanical 
problems might be suggested. Equivalent, congruent, and similar matrices are 
treated for no apparent reason. (See the article by R. J. Jarvis: A Case for 
Applications of Linear Algebra and Group Theory, this MONTHLY, 73, 1966, 
654-656.) 


5. The Role of Logic in Pedagogy. In view of the many pedagogical short- 
comings in the logical approach to mathematics it is not surprising that many 
perceptive mathematicians (there are nonperceptive ones) have spoken out 
against the logical approach. Descartes deprecated logic in rather severe lan- 
guage. “I found that, as for Logic, its syllogisms and the majority of its other 
precepts are useful rather in the communication of what we already know 
or...in speaking without judgment about things of which one is ignorant.” 
Roger Bacon said, “Argument concludes a question but it does not make us feel 
certain, or acquiesce in the contemplation of a truth, except the truth also be 
found to be so by experience.” Pascal pointed out that “Reason is the slow and 
tortuous method by which those who do not understand the truth discover it.” 

Is there then no role for logic or proof? Should it be rejected all together? 
Not at all. The first approach to any subject should indeed be intuitive. As 
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Poincaré put it [9], “I have already had occasion to insist on the place intuition 
should hold in the teaching of the mathematical sciences. Without it young 
minds could not make a beginning in the understanding of mathematics; they 
could not learn to love it and would see in it only a vain logomachy; above all 
without intuition they would never become capable of applying mathe- 
matics.... 

“We need a faculty which makes us see the end from afar and intuition is 
this faculty.” 

Proof should enter but only gradually. Moreover, the level of rigor must be 
suited to the level of the student’s mathematical development. The proof need 
only convince the student. The capacity to appreciate rigor is a function of 
the mathematical age of the student and not of the age of mathematics. This 
appreciation is acquired gradually, and the student must have the same freedom 
to make intuitive leaps that the mathematicians had. Rigor will not refine an 
intuition that has not been allowed to function freely. Proofs of whatever 
nature should be invoked only where the students think they are required. A 
proof is meaningful when it answers doubts. Felix Klein has stressed this 
point [6|: “It is my opinion that in teaching it is not only admissible, but 
absolutely necessary, to be less abstract at the start, to have constant regard for 
applications, and to refer to the refinements only gradually as the student be- 
comes able to understand them. This is, of course, nothing but a universal 
pedagogical principle to be observed in all mathematical instruction.” As 
Professor Max M. Schiffer of Stanford University has stated it, “Never put 
logical carts before heuristic horses.” 

The level of -rigor can, of course, be advanced as the student progresses. 
Poincaré makes this point too [12]. “On the other hand, when he is more 
advanced, when he becomes familiar with mathematical reasoning and his mind 
will be matured by this very experience, the doubts will be born of themselves 
and then your demonstration will be well received. It will awaken new doubts 
and the questions will arise successively to the child as they arose successively 
to our fathers to the point where only perfect rigor can satisfy him. It is not 
sufficient to doubt everything; it is necessary to know why one doubts.” 


6. Why is the Deductive Approach Favored? Despite the pedagogical defects 
of the deductive approach, the criticisms of big mathematicians, and the claims 
of many mathematicians that they do teach discovery, the prevailing practice, 
if I may judge from the textbooks and hundreds of talks with professors, is to 
present mathematics rigorously and to emphasize the axiomatic method. In- 
deed this is the essence of the so-called reform known as “modern mathe- 
matics” or the “new mathematics.” Why do teachers use this approach? 

There is no doubt that some teachers actually believe that the axiomatic 
deductive presentation is the essence of mathematics. Whether they acquired 
this limited view through the instruction they themselves received or have 
been induced to adopt it because the textbooks now favor it, they are at least 
sincere if not effective pedagogues. One has the sneaking suspicion that a few 
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teachers enjoy presenting the familiar number system in the recondite axiomatic 
form because they understand the simple mathematics it represents and yet 
can appear to be presenting profound mathematics. Certainly much of the rigor 
in modern texts comes from limited men who seek to conceal shallowness by 
giving a facade of profundity to the obvious and from pedants who mask their 
pedantry under the guise of rigor. 

Many young teachers believe that now that we have the correct, polished ver- 
sion of mathematics it is sufficient to give the axiomatic or rigorous approach and 
that students will absorb it. These very same teachers would have been swamped 
by such a presentation but having learned the correct version they can no longer 
recall and appreciate the difficulties they encountered in learning the rigorous 
versions. 

Some teachers, knowing the rigorous proofs, feel uneasy about presenting 
a convincing argument which they, at least, know is incomplete. But it is not 
the teacher who is to be satisfied; it is the student. Good pedagogy demands such 
compromises. 

Other teachers want to give students the whole truth at once so that they 
should not have to unlearn what they once learned. But one cannot teach even 
English or History by starting at the top. The A that a high school student might 
earn for an English composition would most likely be rated C at the college 
level. 

For whatever reason teachers insist on presenting to young people a modern 
rigorous proof they are deceiving themselves. There is no ultimate rigorous 
proof. This fact derives from the very way in which mathematics develops. 
Felix Klein has described it [4]: “In fact, mathematics has grown like a tree, 
which does not start at its tiniest rootlets and grow merely upward, but rather 
sends its roots deeper and deeper at the same time and rate that its branches 
and leaves are spreading upward. ... We see, then, that as regards the funda- 
mental investigations in mathematics, there 1s no final ending, and therefore on the 
other hand, no first beginning, which could offer an absolute basis for instruction.” 
Poincaré expressed a similar view. There are no solved problems; there are only 
problems that are more or less solved. Mathematics is as correct as human 
beings are and humans are fallible. 

At no time in the history of mathematics have we been less certain of what 
rigor is. Hence no proof is really complete, and the teacher must compromise in 
any case. It would be interesting to know how many teachers are aware that 
set theory, which they now regard as the indispensable beginning to any rigor- 
Ous approach to mathematics, has been the source of our deepest and thus far 
insuperable logical difficulties [16]. Those who are not aware of the founda- 
tional problems might at least note the words of Hermann Weyl [14]: “The 
question of the ultimate foundations and the ultimate meaning of mathematics 
remains open; we do not know in what direction it will find its final solution 
nor even whether a final objective answer can be expected at all. ‘Mathematizing’ 
may well be a creative activity of man, like language or music, of primary 
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originality, whose historical decisions defy complete objective rationalization.” 

Many teachers favor a logical presentation, particularly, an abstract one, 
such as group theory, because it is supposed to be efficient. They are under the 
impression that if a student is taught abstract groups he will in one swoop learn 
the properties of the rational, real and complex numbers, matrices, congru- 
ences, transformations and other topics. But of course a student who learns 
group theory could not on this basis add fractions. Nor does offering an example 
or two of a group save the day. The concrete cases must be thoroughly under- 
stood before one introduces an abstract development which unifies several con- 
crete ones. To introduce as examples concrete material which is as yet un- 
familiar to the student is of no help at all in making the abstract notion clearer. 
In every case learning proceeds from the concrete to the abstract and not vice 
versa. 

However, the major reason for the popularity of the axiomatic rigorous 
approach is that it is easier to teach. The entire body of material is laid out in 
a clear, clean-cut sequence and all the teacher has to do is repeat it. He has but 
to offer a canned body of material. I have heard teachers complain that many 
students, particularly engineers, wish to be told how to perform the processes 
they are asked to learn and then want to hand back the processes. But the 
teachers who teach the logical presentation because it avoids such difficulties as 
teaching discovery, leading students to participate in a constructive process, 
explaining the reasons for proceeding one way rather than another, and finding 
convincing arguments, are more reprehensible than the students who wish to 
avoid thinking and prefer just to repeat mechanically learned processes. 
Postulating properties has the advantage, as Bertrand Russell put it, of theft 
over honest toil. Pedagogically it is worse because the theft produces no gain 
in understanding. The logical approach to teaching is reminiscent of a reply 
that Samuel Johnson gave to a man who asked Johnson for further explanation 
of some argument he had given. Johnson barked, “I have found you an argu- 
ment but I am not obliged to find you an understanding.” 

Many of us know the story of the professor who was presenting a logical 
proof to his class, got stuck in the course of the proof, went over to the corner 
of the blackboard where he drew some pictures, erased the pictures, and then 
continued the proof. Whether the import of this story for pedagogy has been 
noted is doubtful. 

Many mathematicians prefer to present rigorous axiomatic approaches 
which, for example, use a minimal set of axioms, because they favor their own 
professional interest at the expense of the student. Even if such systems can be 
made understandable to young people the time required to teach them could be 
spent on more significant material. In this matter as well as in presenting 
sophisticated rigorous proofs they are using the classroom to challenge them- 
selves. These professors are serving themselves rather than the students not 
only in the form in which they present the various subjects but also in the pre- 
mature teaching of abstractions such as abstract algebraic concepts, linear 
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vector spaces, finite geometries, set theory, symbolic logic and functional 
analysis, because these subjects lend themselves to axiomatic treatments. Is it 
any wonder that students become alienated and question the relevance of what 
they are being taught? 

There are many indications that professors who present rigorous material 
are really uncertain as to the wisdom of doing so. A number of calculus books 
begin with rigorous definitions and theorems, for example, those concerning 
limits and continuity, and then never refer to this material. Thereafter they use 
the cookbook presentation. The most charitable view of such books is that the 
authors wish to ease their own consciences or to give the students some idea of 
what rigor means. Perhaps an unfairly severe view is that these books offer only 
a pretense of rigor in order to appeal to both markets, the one that demands 
rigor and the one that is satisfied to teach mechanical procedures. 

Other texts adopt another “compromise.” In the body of the text the pre- 
sentation is mechanical with perhaps an occasional condescension to an intuitive 
explanation. The real “explanation” is given in rigorous proofs but these are put 
in appendices and presented so compactly that they are certain to be totally 
ununderstandable to the student. However, the authors have salved their con- 
sciences. Such books are no different from the old mechanical presentations. 
They do contribute to understanding in one respect, namely, they show that 
competent mathematicians are inept in pedagogy. 

Perhaps, after all, there is some merit to the logical approach to mathe- 
matics. It has been said of rigor that “The virtue of a logical proof is not that it 
compels belief but that it suggests doubts and the proof tells us where to con- 
centrate our doubts.” Or as Bertrand Russell put it [13], “It is one of the chief 
merits of proofs that they instill a certain scepticism as to the result proved.” 
Lebesgue pointed out another value of rigorous proof [7]. “Logic makes us 
reject certain arguments but it cannot make us believe any argument.” One 
must respect but suspect mathematical proofs. Since one of the main objectives 
of mathematics education is to instill scepticism in the student, he is deriving at 
least one benefit from the current logical extravaganzas. 
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A COROLLARY TO THE GELFAND-MAZUR THEOREM 


H. A. SED, University of California, Irvine 


We wish to establish the following result as a corollary to the Gelfand- 
Mazur Theorem: 


Let A#0 be a complex commutative normed algebra such that |-cyy| =||-|| ||| 
for allx, yEA. Then A 1s tsomorplic and isometric to the complex field. 


Note that there is no assumption that A contains an identity. In fact the 
major important step in the construction of the proof is to demonstrate that 
under the given norm condition on A, the algebra has an identity. (If A were 
assumed to have an identity, then the above corollary would merely be a special 
case of a well-known result which appears in the literature. See for example 
(3, Cor. 1.7.3, p. 39]. Also under the assumption of an identity in A, Lorch has 
a proof of the above corollary [1, Th. 5-2, p. 129].) 

We shall use an algebraic approach to allow us to demonstrate the existence 
of the identity in A. To obtain the result, the Gelfand-Mazur Theorem will be 
applied. The conclusion of the corollary follows directly. 

Proof. By the norm condition, A has no zero divisors. Let S=A— {0}. Let 
T = {(a, s) | aCA,s€S}. As in Herstein [2, pp. 101-103] define an equivalence 
relation ~ on T as follows: (a, s)~(ay, 5s) if and only if s3¢=sa,. Denote the 
equivalence class of (a, s) by a/s. We define S714 = {a/s| (a, sCT i Then 
S-1A is the field of quotients of A under operations +,-, now to be defined. Let 
a/s, a4/s5E SA. 


(1) a/s + a1/s, = (sya + Say)/s54 
(2) (a/s) - (a1/s1) 


@a3/ ss = aya/sys = (a1/51)-(a/s). 
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Note that for every sCS, s/s is the unit element for S-!A. Let NCC, 
a/sE.S-1A, Then X(a/s) =da/s. Hence S—1!A4 is a complex algebra. We norm 
S-14 as follows: for a/s€ SA, ||a/s||’=|lal| /||s||; || -||’ is well defined by use of 
the norm condition on A and the definition of equivalence classes in S~14. We 
show that ||-||’ is a norm by demonstrating the triangle inequality. (The other 
conditions necessary are trivially true.) Let a/s, a,;/s;GS—14. 


lla/s + ai/sy||' = 310 — _ ee 
e [stall + [[sasl] _ |Isall al] + Isl] eal 
7 | s1s|| | ssl | 
_ tall, Maat 


— Tah tog 7 o/s + Hews’ 


Also || (a/s) - (a1/s1)|| =||a/s||\|a1/sy||’. Let G be the completion of S-1A. Then 
G is a field and a Banach algebra. By the Gelfand-Mazur Theorem, GC and 
G is isometric to C. Note that if ¢: GC is the isomorphism, ¢(s/s) =1€C for 
sES. For NEC, d(s/s) =Ad(s/s) =A-1 =X. Hence, S-14 = C. 

Define 8B: A—S-14 by x—>«xs/s. It is easily seen that 8 is an injective algebra 
homomorphism. To see that 6 is surjective suppose As/s© S-14 for NEC. Let 
xe SCA. B(x) =xs/s=Ns/s for some \yEC and \y4+0. Then (A/Ay)xEA and 
B((A/Ay)x) =As/s. Hence under 8, A ~ S14. Thus A has an identity element c. It 
follows easily that 8 is an isometry. 
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ON AITKEN’S A?-METHOD 
SimEON ReicH, The Technion, Haifa, Israel 


In [1], p. 73, the following result is proved: 


THEOREM 1. Let {xn} be any sequence (of real numbers) converging to the limt 
s, such that the quantities dn=X_,—S satisfy d,40 for n sufficiently large, and 
dnyi/dn—A, where |A| <1. Define {x!,} by 


i, = ia — (Ax) ?/A2xn 


(this 1s Aitken’s method where Axn =Xn+1—Xn). Then ach, } as actually defined for n 
sufficiently large and converges faster to s than the original sequence in the sense that 


(29 — $)/(x, — s) 90. 
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Although of no importance in the argument leading to this theorem, we 
remark that if the A given in the statement of Theorem 1 is not zero, the con- 
vergence of }x,} is said to be linear; otherwise it is said to be nonlinear. Now, 
since the result quoted is correct, it seems puzzling that in another book ({2], 
p. 348) the reader is told not to use Aitken’s method in case the convergence is 
nonlinear. Perhaps we can resolve this “paradox” in the following manner: 

In Theorem 1, the rate of convergence of {x/,} is compared with that of 
{xn}; but it is more reasonable to compare {x/,} with {xn+2}, because xn42 is 
already used in the definition of x),. If we do this, we obtain a slightly different 
result. 


THEOREM 1’. Under the hypotheses of Theorem 1, and with the additional as- 
sumption that A 0, (x1,—S)/(«n42—s)—0. 


Proof. We may write dris/d,=A+6, for n sufficiently large, where &,—0, 
and then 
(an — 5)/(Sn4a — 8) = [8, — 28,(A — 1) — 8] 
/[(A — 1)" + &)(A + En41)(A + &n)] 


converges to zero, where 


Sn = A(En + S41) — 28n + EnSn41— 0. 
What happens when A =0? Then 
(2m — S)/(Xn42 —s)= [En (En — En) |/[EnEn41(1 — 2&, + EnEnt1) |. 


Because A =0, dnit/dn=&, and therefore &, #0 for # sufficiently large. This 
means that 


(an — 5)/(tne2 — 8) = [1/(1 — 28n + Ge8n41)][1 — Sn/Engr]. 
Thus in this case, 
(an — 5)/(%ny2 — 5) > 0 
if &,/En41 71. That is if 
(dn41/dn)/(dn2/dayi) — 1. 


For example, this will indeed happen if we put x,=2~", but will not happen if 
Xn =2-", 
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ON WEAKENING OF A POSTULATE OF H. WEYL 


S. C. Saxena, University of Akron 


Given a vector space of dimension , an affine space of dimension 2 may be 
defined by using axioms of H. Weyl. This is done as follows: 

Let V be a vector space of dimension m defined over a field F. Let S be a 
nonempty set, and 


pSxXS-V 


a mapping having the following properties: 
P. 1: For every three points P, Q, Rin S, 


o(P, Q) + (Q, R) = e(P, R). 


P, 2: For every PCS, and aCVV, there exists a unique point Q in S, such that 
p(P, Q) =. 

The set S with the vector space V is called an affine space of dimension [1]. 

The purpose of this note is to show that the postulate P. 2 can be replaced 
by the following weaker postulate: 

P, 2’: There exists a point Po€S such that for each aC V, there exists a 
unique point QES such that p(Po, Q) =a. 

Proof. In P. 1 first taking P=Q=R, we get p(P, P)=0, then taking P=R, 
we get p(P, Q) = —p(Q, P). 

Given PES and aCV, by P. 2’ there exists a unique QES such that 
p(Po, Q)=at+p(Po, P), i-e., such that p(P, Q)=p(P, Po)+p(Po, Q) =p(Po, Q) 
—p(Po, P) =a. Thus P. 2 holds. 

From a pedagogical point of view P. 2‘ appears better than P. 2, since the 
point Py may be represented by zero vector of V and we can start coordinatiza- 
tion of S in terms of vectors of V, from there. 
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A RATIO-TYPE CONVERGENCE TEST 
H. MiLLer, DePaul University 
In this note we introduce a ratio-type convergence test which properly con- 
tains the half of the Gauss Convergence Theorem dealing with convergence |1]. 


THEOREM 1. Jf ai=1, an>0 for n2=2 and Any1/On=1—dn, then the conver- 
gence of 


$(1- wife) 


n=1 


implies the convergence of > Po Gn 
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Proof. We have a=1, de =a1(1—d,) = (1—d,), a3 = d2(1 —d2) = (1—d}) 


-(1—d,), +--+, and d,=(1—d,)(1—d2) - - - (1—dn_1), - - + . Therefore 
Da =t+ DITd-a) si+ L(Y a—a) ; 
n=1 n=1 k=1 n=1 ke=1 


since the geometric mean of n positive numbers is less than or equal to the arith- 
metic mean. Therefore 


Da st+ D(1- “> a) | 
n=1 n=1 


kel 
completing the proof. 

THEOREM 2. Jf a;=1, a,>0 for n22 and tf An41/dn =1—Cc/n+6,/n* where 
c>1, uf {On} oy ts bounded, and if X\>1, then 


ioe) 


>> (1 —=)>> is) 
n=1 k=1 


4s convergent, where dy =1—dy41/ax. In other words, the test in Theorem 1 contains 
the half of the Gauss convergence theorem dealing with convergence. 


Proof. For each k, d,=c/k—@,/k*. Hence 


(1 — p> de) = (1 ~ ox Nt a/c) 


k=l 
= (1 — cn (log n + cn))”, 


where {cn} is bounded. Now let 6 be given satisfying 1<5<c. There exists an 
N such that 


(1 —n} > is) <= (1 — 6n— log n)™ 
k=1 


for all n>N. But (1—6n—! log n)"Sn-* for all n»>N, which follows from 
log(1—6n—} log n) S$ —6n™ log n. Therefore 


> (1 7d is) 
n=1 k=1 
is convergent by the comparison test. 

To complete the proof that the ratio-type convergence test described in 
Theorem 1 properly contains half of the Gauss convergence test, we submit the 
following example. Let S= {s1, 52, -- +, s;} be a set consisting of j positive real 
numbers. One can construct sequences {a,} inductively in the following way: 
Let a4= 1 and let an41=sa, for every positive integer 2, where s is picked at ran- 
dom from the set S. We will show that if > 4_, s.<j, then with probability one 
a sequence constructed as above is convergent. 
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The proof is as follows: for p>0 
j 
ji Lisw=1-p 
k=l 


implies with probability one, by the Strong Law of Large Numbers [2], that 
there exists V such that nor, dx>p/2 for all u>N. Therefore, 


co 


is convergent with probability one. Therefore, >).1 da, is convergent with 
probability one. 

If j-? > 4_1 s,<1 and there exists a s; in S such that s;>1, then with proba- 
bility one a sequence constructed as above is not “eventually monotonic.” 
Therefore, the Gauss convergence test does not apply to almost all (probability 
one) such sequences. In fact, the whole scale of ratio tests will fail to show almost 
all of the above sequences are convergent [3]. 


References 


1. K. Knopp, Theory and Application of Infinite Series, Blackie, London, 1948. 

2. W. Feller, An Introduction to Probability Theory and Its Applications, Vol. II, Wiley, 
London, 1965, p. 233. 

3. T. Bromwich, An Introduction to the Theory of Infinite Series, Macmillan, London, 1949, 
p. 42. 


SOLUTIONS OF $(x)=2n, WHERE 6 IS EULER'S $-FUNCTION 


K. W. WEGNER AND S. R. SavitzKy, Carleton College 


There are simple formulas for evaluating Euler’s ¢-function. Finding values 
of x for d(x) =n is more complicated. (d(x) = 1440 has 72 solutions.) 

R. D. Carmichael [Journal of Mathematics, 30(1908) 394-400] listed all the 
solutions of ¢(x) =n for ” up to 1000. 

The computer has revealed the following errors in Carmichael’s table: 

For n=768, add solutions 1785 and 3570. 

For n=792, delete 2384, add 2388. 

For 2= 888, add 1043 and 2086. 

For »=960, add 1309 and 2618. 

For ~=972, add 1467 and 2934. 

All solutions have been calculated for x up to 1978. They may be obtained 
by writing to the Department of Mathematics, Carleton College, Northfield, 
Minnesota 55057. 


RESEARCH PROBLEMS 
EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (af any are known to the author) and by a brief description of Rnown 
partial results. Manuscripts should be sent to Victor Klee, Department of Mathematics, 
University of Washington, Seattle, WA 98105. 


WHICH ISOPERIMETRIC RATIOS ARE BOUNDED? 


Victor KLEE, University of Washington 


A subset of a Euclidean space is called a polytope provided that it is the con- 
vex hull of a finite set of points or, equivalently, is the bounded intersection of a 
finite number of closed halfspaces. A d-polytope is a polytope P that is d-dimen- 
sional. For 0 Sk <d, a k-face of P is a k-dimensional set, necessarily a k-polytope, 
which is the intersection of P with a supporting hyperplane; the only d-face 
of P is P itself. For OSk Sd, let &,(P) denote the sum of the &-measures of the 
various k-faces of P. Thus £o(P) is the number of vertices of P, ¢:(P) is the sum 
of the lengths of P’s edges, fo(P) is the sum of the areas of P’s 2-faces, ---, 
¢a1(P) is the (d—1)-dimensional surface area of P, and £a(P) is the d-dimen- 
sional volume of P. (Note that if K is a k-face of P and F is the smallest flat 
containing K, then F may be regarded as a copy of Euclidean k-space E* and the 
k-measure of K may therefore be taken in the sense of Lebesgue.) | 

Let Jz denote the set of all pairs (2, 7) of distinct integers between 1 and d. 
For (7, 7) Ela, the (4, 7)-tsoperimetric ratio of P is defined as 


pig(P) = $e(P)4/55(P). 


Note that ¢,(AP) = [A] *&.(P), so that p:,;(AP) =p;,;(P) for all 40; hence the 
ratio p;,;(P) depends only on the shape and not on the size of P. The following is 
a precise statement of the title question: 


For which triples (d, 1, 7) 1s the 1soperimetric ratio p;,;(P) bounded above as P 
ranges over all d-polytopes? 


For any such triple there is interest in determining the least upper bound 
B(d, i, 7) = sup{p:;(P): Pisa d-polytope} 


and thus obtaining an analogue of the classical isoperimetric inequality (which 
asserts B(d, d, d—1) = (dw}/*)-@), where wa is the volume of a d-dimensional 
spherical ball of unit radius). | 

Eggleston, Griinbaum, and Klee [2] show (d, i, j) is finite if i=d or i=d—1 
>jorzis a multiple of 7. Other results related to the finiteness of 6(d, 7,7) appear 
in [2] and in Larman and Mani [3 |. However, the finiteness of 6(d, i, 7) is unset- 
tled whenever d—2 21> 722 and 7 is not a multiple of 7. In particular, the finite- 
ness of 6(d, d —2, d —3) is unsettled for alld = 5. Though the supremum (@(d, d,d—1) 
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is finite, it is not attained by any polytope; Grtinbaum conjectures that all 
other finite bounds are attained. 

Even when the number §(d, 7, 7) is known to be finite, its exact value has 
been determined only for the cases corresponding to the classical isoperimetric 
inequality. In particular, the values of B(d, d—1, 1) and B(d, d, 1) are unknown 
for all d=3. Aberth [1] shows @(3, 2, 1) S(6r)-"%. Melzak [4] conjectures 
B(3, 3, 1) S$2-2/83-116 with equality only for a right prism whose base is an 
equilateral triangle having side-length equal to the height of the prism. 

The title question concerns the partition of Jz into the two sets 


Ta = { (i, 9) © 1a: B(d, i,j) < of and Iz = {(i,97) © Ta: BUd, i,j) = @}. 


Note that 

(1) A member (7, 7) of Ig belongs to J; if and only if there is a sequence 
P, (n=1, 2,--+-) of d-polytopes such that ¢;(P,) is bounded above while 
¢:(P,) tends to infinity. 

We close by showing 


(2) Tt D {j) E Iai <j} 


and are tempted to conjecture that equality holds in (2), as it surely does when 
ds4. The validity of (2) for all d is an easy consequence of the fact that always 


(3) Tan DIaU {Gi,d+ 1): —-1,¢ Eg}. 


For any d-polytope Q in E* and any r>0, let Q’ denote the prism Q X [0, 7], a 
(d+1)-polytope in E*t!. From a description of the facial structure of Q7 in 
terms of that of Q, it follows that 


(4a) &.(O7) = 2¢.(Q) + r&e-1(0) ford 2 k2=1, while 
(4b) fa41(Q") = rha(Q). 


To see that [7 CJ¢,1, consider an arbitrary (7, 7)€J7, let P, be as in (1), 
and use (4a) to see that if 7, tends sufficiently rapidly to 0 then ¢;(P;") is bounded 
above while ¢;(P;") tends to ©. To complete the proof of (3), suppose (¢—1, d) 
€I;, let Pa be such that {a(P,) is bounded above while £;:(P,) tends to ~, 
and use (4a) and (4b) to see that if 7, tends sufficiently rapidly to © then 
¢a-1(P;") is bounded above while ¢;(P;") tends to ©. 
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CLASSROOM NOTES 
EDITED BY DAvID DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


THE CONSTRUCTION OF SECOND ORDER DIFFERENTIAL EQUATIONS WITH 
SOLUTIONS HAVING GIVEN PROPERTIES 


W. R. Utz, University of Missouri 


The most common problem of differential equations is that of determining 
properties of solutions of given equations. Here we wish to consider the con- 
verse problem, especially for second order equations. That is, given functions 
of a specified kind we shall seek second order equations within a given class for 
which the given functions are solutions. Clearly, the class of equations will be 
limited since there is, trivially, always an equation y’’=f''(x) satisfied by any 
given twice differentiable function f(x). 

Suppose that at first we require that a twice differentiable function satisfy 
an equation 


(1) y" + p(x)y = 0 


for some continuous p(x). That is, y(«) is given and we seek a continuous p(x) 
so that the given y(x) satisfies (1) for this p(x). One examines 


—y"(2) 
2 — 
”) y(x) 
and observes that y(x) satisfies (1) if 
ee aC 
p(x) yn) 


is, in fact, a continuous function. That is, if (2) has, at worst, removable singu- 
larities. Clearly, this is impossible unless y’’(x) vanishes when y(x) vanishes. 
Thus, y =sin x satisfies an equation (1) for all « while y =sin x? does not. 

The function y =sin x? (more generally, y=A sin x?+B cos x?) satisfies 


(3) xy’ — y' + 4x%y = 0. 


Thus all solutions of (3) are oscillatory and bounded. 
If d(x) EC?, then y= A cos $(x)+B sin ¢(x) satisfies 


(A) o' (x)y" — (x) y" + (¢'(x))®y = 0 


and all solutions (4) are bounded. 
If d(x) E C2, then y=A cosh d(x) +B sinh (x) satisfies 


(5) $'(x)y"" — G(x) y — ($'(x))®y = 0. 
290 
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Equations (3), (4), (5) illustrate the following theorem, the proof of which is 
immediate. 


THEOREM. Jf y=y(x) satisfies y'+b(x)y’+c(x)y=g(x) for all real x and if 
u(x)€C? for all real x, then y=y(u(x)) satisfies 


uu! (ae)! — [ul (x2) — b(a(s)) (a! (x)? ]y" + c(ee(a)) (a! (a) Sy = (re! (@)) 5g (see). 


This theorem, in special cases, is helpful in providing equations with solu- 
tions having a prescribed property. For example, to secure an equation having 
a solution with a prescribed number of zeros, let d(«) =(2”%+1) arc tan x in (4) 
to secure the equation 


(4’) (1 + 2)?y"" + 2x(1 + x?)y! + (2n + 1)*y = 0. 
Solutions of (4’) include 

= sin[(2n + 1) arc tan x] 
with an odd number, 2”-++-1, of zeros and 


y = cos|(2n + 1) arc tan x] 


with an even number, 2”, of zeros. All solutions of (4’) are bounded. 

By appropriate choices of (x) in (4) and (5) one is able to provide equations 
with solutions having other specified properties. As further examples, solutions 
of (3) are bounded and oscillatory but not periodic; (5) with d(x) =x sin x has 
‘ an unbounded, oscillatory solution y=sinh (« sin x) and an unbounded non- 
oscillatory solution y=cosh(x cos x). 

All of the above examples are linear. We shall now consider the discovery of 
nonlinear equations with some solutions of a special kind. 

The equation 


(6) xv’ + wx = 0 
has the solutions x=A sin(wi+ 8), wherein A and @ are arbitrary. John Brock 
[1] observes that since 
x’ = Awcos(wt+ 6) and x#” = — Aw’ sin(wt + 8) 
one has 


—% x~x!? 
and A? = 7°? ——— 
xl! 


independently of the differential equation (6). Hence if f(u, v) is any function 
at all and w? and A? are selected so that f(w?, 4?)=0, then x=A sin(wt+ 8), 
for this choice of A and w, are (@ is arbitrary) solutions of the differential equation 


—x/! xa? 
(7) s( »xe— —_ = 
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Thus, (7) provides a source of nonlinear equations permitting simple harmonic 
motion. Brock devotes particular attention to the special case 


2 
xl! +. Gs — =) —_— 0 
rad 


One may apply the device of Brock to other equations. For example, 
(8) x’ — wx = 0 
is satisfied by «=A sinh(wi+ 8) wherein A and @ are arbitrary. In this case, 


wt=x"'/x, A*= —x?+xx'2/x", Thus, if A and w are selected so that f(w?, A?) =0, 
then x =A sinh(wi+ 8) satisfies 


As another example, the spring equations x’’++-ax+bx'=0 are satisfied by 
the Jacobian elliptic functions [2]. The method of Brock may be used to find 
other nonlinear equations satisfied by these functions. 

In particular, 


(9) x’ + bx? = QO, b>0 


(ahardspring) is satisfied by x = A cn(wt+8), where k =sina = 1/+/2 (the modulus 
of the elliptic functions to be used) and w?=b)A?. The A and 8 may be selected 
arbitrarily. Keeping in mind that sin a=1/+/2, we have 


A cn(/fb | A|t+ 68) 
'=— A|A|Vbsn(Vb| 4|¢+ 8) dn(vVo| Al t+) 
"= — 4% cn3(/b| A| t+) 


from which we have (independently of equation (9)) 


8 
I 


R 
l 


8 
I 


—% 2x34/2 
) At=y'— 
3 x 


at 


Thus, if f(u, v) is any function and 8, A are selected so that f(b, A*) =0, then 
— x!" 2x82 
4 = 
s( “3 > 2% az ) 0 


has, for this Db and A, the two-parameter family of periodic solutions 


x= A cn(/o | A|t+8). 
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A COUNTEREXAMPLE TO A LEMMA ON THE EXISTENCE OF SIMPLE REFINEMENTS 


R. E. SmrrHson, University of Wyoming 


Let X be a topological space, and let x, yeX. Then a simple chain from x 
to y is a finite collection, {L1, ty L,} of subsets of X such that xEC ly, «EL, 
iA#1,yCly, yvEL,, i%k, and LAL;4 @ if and only if |i—3| <1. Then a simple 
chain C= { Ci, ce, C,} is called a simple refinement (Hocking and Young 
[1] use Ta term “straight through” in this context) of a simple chain £ = { Lh, 

-, Ly} if 

(1) for each C;G@, there is an L;E £& such that C;CL;, and 

(2) if C;, C;CLe, and if i<t<j, then C,CL,. Then in both [1] and [2] below 
the following lemma is stated. The lemma is then used to prove that every locally 
compact, connected, locally connected metric space is arcwise connected, and 
hence, this lemma is quite significant. 


LEMMA. Let © be a simple chain of connected open sets in X from x to y. If Ww 
1s a Collection of open sets such that each member of © 1s the union of members of U, 
then there 1s a simple chain consisting of members of U from x to y which is a simple 
refinement of @. 


Note. In the statement of this lemma in Hocking and Young [1, p. 116] 
there were some additional hypotheses given but they were not used in the 
alleged proof. 

We now give a counterexample to the above lemma. All sets given below are 
subsets of the plane R? and the topology used is the relative topology. 

Let X1= { (x, 0): —2Sx<2}, Xo= { (x, y): x?-+-y?=1 and y=0} and set 
X =X, X>. Also let Y; = { (x, y)ECXe:x<0 and Osy<3} and Y,= { (x, y)ECXe: 
*>0 and 0Sy<}. Then let 


Cy= ViU {(%,0): -252<3}U PL, 
and 
Co = {(«,0): ~<a < —2}UXLU (4,0): <a SQ}. 


Then C; and C; are open connected subsets of X and {, C,} is a simple chain 
from (—2, 0) to (2, 0) in X. But if U consists of open subsets of X whose di- 
ameters are less than 4, then condition (2) in the definition cannot be satisfied. 
This is because Cy/\C, is not connected and each component of Cy(\C, will 
contain a member of any simple chain from WU. But in chaining from one com- 
ponent of Ci/\C; to the other by members of U there will be at least one link 
either not contained in C; or not contained in C;. 

The difficulty in the “proof” given in [1] and [2] is that the authors did not 
consider the situation that occurs in the above example. To overcome this 
problem replace condition (2) above by the following condition. (This condition 
was suggested by the referee.) 

(2’). If C;, C;CL, and if +<t<j, then CL. L, Le. 
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Then the proofs given in [1] and [2] are valid when (2) is replaced by (2’), 
and thus the lemma is true when conditions (1) and (2’) are used in the defini- 
tion of a simple refinement, and the major theorem on arcwise connectedness 
can also be proved from conditions (1) and (2’). 
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AN ELEMENTARY DISCUSSION OF THE TRANSCENDENTAL NATURE OF THE 
ELEMENTARY TRANSCENDENTAL FUNCTIONS 


R. W. Hammine, Bell Telephone Laboratories 


When the elementary transcendental functions are introduced in the cal- 
culus course, it is usually stated that they are not algebraic functions, but little 
indication is given either as to what this means or how it can be proved. The 
purpose of this note is to fill this gap partially. 

Probably the most convenient approach to this matter is the increasingly 
common path of introducing the In x as 


In -{ dt/t, 
1 


which is equivalent to d/dx(In x) =1/x; In 1=0. 
We first prove that In x is not a rational function, that is 


where N(x) and D(x) are polynomials with no common factor. If it were a 
rational function, then upon differentiating both sides of the equality we would 
have 1/x=(DN'—ND’')/D?, or D?=x(DN'’—ND’'). We see that D(x) has a 
factor x. Let 


D(x) = x* D,(x), D,(0) a 0, k = 1. 
Substituting and dividing out x*, we get 
¢ D; = «D,N' — kND, — «ND\, 


from which we see that N(x) is divisible by x. The common factor of x in both 
N(x) and D(x) leads to a contradiction. 
We are now ready to prove that 


y=Inx 


is not algebraic, that is, there is no polynomial in x and y with real or complex 
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coefficients such that f(x, y) =0, or what is the same thing, 


N 
2, Pr(x)y!=0, — Pw(x) #0, 
k=O 
where the P;(x) are polynomials in x. There is an essentially unique equation 
of minimum degree NV because if there were two equations of the same degree 
differing by more than a multiplicative factor then by eliminating the highest 
power of In x between them we would have a lower degree equation. Assuming 
we have chosen the function f(x, y) for which WN is the smallest possible we can 
write the equation as 


P 


N-1 Po 
(In x)¥ + (In w)¥-1+ ---+—=0, N22 
Py Py 
and differentiate to get 
Py-\' 
N(in «)4-! + «( ) (In w)N-?+4+ --- =0. 
Py 
If all the terms in (In x)*, k=0, 1, ---, N—1, do not vanish identically, then 


we have a lower degree polynomial (in In x), a contradiction. If all the terms do 
vanish, then in particular 


Integrating this, we find that In x is a rational function, which we just proved is 
impossible. Hence, In x is not an algebraic function. 

There is a second idea of an algebraic function that the student needs to 
consider, namely that any finite combination of additions, subtractions, multi- 
plications, divisions, and radicals with rational exponents of algebraic functions 
is still algebraic. In particular the student asks, “Is it possible that 


Jat - 1 — iV/x? — 1 2/ x? — 2 £3 + x? 
eaa/, +1 + ON x? + 3 — (Rn? bo?) (A/a? + 1) 
or something like it?” Note we are excluding x* etc. 
We now indicate the proof that this second definition is included in the first 


one. Consider the sum of two expressions, each of which is a root of a poly- 
nomial. Let this sum be 


In x = 


as (x) + B(x), 
where a(x) is a solution of f:(x, y) =0 with the complete set of solutions 
a(x), a2(2), vy a(x). 


Let B(x) be a solution of f2(x, y) =0 with the complete set of solutions 
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Bi(x), Ba(x), > + > , BoC). 


Now consider the set of rs functions a;(x)+6,;(%) and the corresponding poly- 
nomial in y, 


IT w- a — 8) =0 


having these 7s factors as solution. This is a symmetric function in both the a; 
and the 6;. We now use the theorem that every rational symmetric function is 
expressible rationally in terms of the elementary symmetric functions: 


pPi=aytart-:++ta, 
pe 


ayQ2 + aya3 + °° * + Os—10s 


Ps = A123 °° ° As. 


But these are in turn rational expressions in the coefficients of fi (x, y) =0. Sim- 
ilarly for the B;(x). Thus we have 


IT & — a — 85) 

447 
as a rational expression in the coefficients of fi(x, y) =0, fe(x, y) =0 and assorted 
integers that arose in the algebraic manipulations are indicated. 

Similar arguments show that differences, products, quotients, and radicals 
of algebraic expressions are again algebraic, and we have therefore shown that 
the second definition of an algebraic function is included in the first. 

We now turn to the inverse function of In x, namely e*. Since In x does not 
satisfy any polynomial 


f(a, y) = 0 


we have merely to set x =e? to get f(e’, #) =0 as an equivalent impossibility. 
For sin x, cos x, tan x, etc., if one of them, say sin x, satisfied 


f(x, sin x) = 0, 


then the polynomial f(x, 0) =0 would have an infinite number of zeros, namely 
x=0, +a, +27, ---+. Thus since the trigonometric functions have an infinite 
number of zeros they cannot be algebraic functions. 

From the argument we used for the exponential function, we see that the 
corresponding inverse functions arcsin x, arccos x, arctan x, etc. also cannot be 
algebraic functions. 

This presentation, except possibly that of the inclusion of the second defini- 
tion of algebraic functions in the first, is readily presented in a calculus course to 
the better prepared students; the less prepared usually don’t care, being willing 
to believe that the functions are transcendental (not algebraic). 

Using similar methods, and slight extensions of them, integrals like 
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| edt, i) (e*/t)dt 
0 x 


can be shown to be transcendental. However, at this point in the development 
the real question is: Can these new functions be expressed as finite combinations 
of the algebraic and elementary transcendental functions we now have on hand? 
Unfortunately the above simple methods seem to be inadequate for this purpose. 
(See J. F. Ritt, Integration in Finite Terms, Columbia University Press, 1948, 
for a more sophisticated and more powerful treatment.) 


Thanks for help in discussing this topic are due to A. J. Goldstein and Jessie MacWilliams. 


MATHEMATICAL EDUCATION 


EDITED BY J. G. HARVEY AnD M. W. POWNALL 


Material for this Department should be sent to either of the editors: J. G. Harvey, Department 
of Mathematics, University of Wisconsin, Madison, WI 53706; M. W. Pownall, Department 
of Mathematics, Colgate University, Hamilton, NY 13346. 


ON THE MATHEMATICS CURRICULUM IN BLACK COLLEGES 


A, J. SCAVELLA,} Tuskegee Institute 


It is a pleasure and an honor to have been invited to address this august 
body of mathematicians on this occasion. When asked to participate, I was 
curious to know why. If in me Professor Stewart desired a first-rate mathe- 
matician, then I do not qualify; if in me he desired someone dedicated to the 
education of deprived youngsters, then perhaps I do qualify; if in me he de- 
sired someone black—then my Italian name and Jewish nose notwithstanding, 
I know I qualify! 

On April 24-26, 1969, a Ford Foundation-sponsored Conference on Mathe- 
matics Curriculum was held at Morgan State College under the direction of 
Professor Walter Talbot, Chairman of the Department of Mathematics at that 
institution. 

The purpose of the Conference was to discuss matters relating to curriculum, 
with a view toward improving the effectiveness of the individual mathematics 
programs at black colleges. 

In attendance at the Conference were mathematics professors from twenty- 
seven colleges and universities with student bodies that are predominantly 
black—Afro-American, Negro, etc., and some resource persons, including Pro- 
fessors Richard Anderson and R. Creighton Buck as representatives from 
CUPM. 

This Conference was much needed and very timely—for never before has 
there been as much concern about the status of the mathematics curriculum 
in black colleges as exists today. The Ford Foundation, in sponsoring the Con- 


+ We have just learned that Professor Scavella passed away on February 5. Editor. 
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A COMPARISON OF TWO APPROACHES TO TEACHING LIMIT THEORY 
F. M. Paviick, Slippery Rock State College 


Using the concept of a collection of advanced sets, E. J. McShane formulates 
one definition of limit, lim f(t), that is general enough to include all types of 
limits of real valued functions normally encountered in elementary calculus: 
limasr f(x), litmaseo f(x), limjs ;, [2]. The advanced sets definition can be ap- 
plied to other concepts of the calculus which depend on the limiting process, 
e.g., {2f(x)dx and f’(x). 

The traditional approach to limits is to introduce each limit definition 
separately. Little mention is made of the relationship between the various 
definitions. One wonders whether the use of the unifier, advanced sets, would 
provide a beginning calculus student with a better understanding of the under- 
lying structure of the limit definitions by pointing out the properties common 
to each. 

On the other hand, the addition of this concept to the limit definitions may 
be too much for a student to handle, especially at the time of his first formal 
introduction to limits. With this in mind, the success of the advanced sets ap- 
proach may depend on the ability and achievement level of the individual 
calculus student, coupled with the question of generality versus specificity. 
What follows is a brief description of the author’s doctoral dissertation which 
was designed to investigate this problem. 

First, let us notice an important mathematical advantage of the all inclusive 
definition proposed by McShane. Once an appropriate fundamental theorem 
such as lim[f(é)+g()]=lim f(t)+lim g(¢#) has been proven in the general 
setting, it can be used in all the special cases without having to be proven over 
and over again. For example, one could conclude immediately that 


f Ue telex = fran f scoer, 


For a detailed discussion of this and other aspects of the advanced sets approach, 
see |2, 3]. 


Description of the study. The main problem in this study was to determine 
if there is a measurable difference in learning, as measured by an achievement 
test, between students who studied a traditional e, 6 presentation of limit theory 
(Treatment T) and students who studied an advanced sets approach to limits 
(Treatment A). Treatment T consisted of a programed unit which introduced 
the definitions, lim;,,, b; and limz.,, f(x), separately in the traditional manner. 
Treatment A was achieved by means of a program which used the advanced 
sets approach to limits. 

Also included in the study was an attempt to determine if ability and 
achievement level would contribute to a difference in learning from these two 
treatments. To this end honors and nonhonors beginning calculus students at 
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The Florida State University were selected to represent two levels labeled 
H and N respectively. 

The students in each level were randomly assigned to one of the two treat- 
ments which resulted in a 2X2 levels x treatments design. The students read 
the programed units in four study sessions. A test was administered during the 
fifth session. This test was composed of two parts. Part I consisted of 23 mul- 
tiple choice items primarily concerned with finding or recognizing limits of cer- 
tain functions. Part II required the student to verify limits of some functions 
and to provide proofs of some simple theorems. A randomized 2X2 levels x 
treatments analysis of variance was employed to test the following null hy- 
potheses. 


Null Hypotheses. 


1. There is no significant difference between the mean scores of Level H and Level N on the 
criterion test. 

2. There is no significant difference between the mean scores of Treatment Group A and 
Treatment Group T on the criterion test. 

3. There is no interaction between levels (H and N) and treatment groups (A and T). 


TABLE 1.—Scores on Section I of the criterion test 


Treatments 


A T 
17 21 15 21 
18 22 Mean 16 21 Mean 
20.6 19.08 
H 19 22 16 21 Row 
Mean 
20 23 Std. Dev. 16 22 Std. Dev. 19.77 
2.06 2.81 
21 23 17 22 
Levels 20 22 
9 14 11 17 
10 14 Mean 12 17 Mean 
13.28 15.77 
11 14 12 18 Row 
Mean 
N 11 15 Std. Dev. 13 19 Std. Dev. 14.48 
2.52 3.03 
12 16 14 19 
13 16 16 20 
13 18 17 
Column Mean Column Mean 


16.33 17.36 
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TABLE 2.—Analysis of variance of scores on Section I of the criterion test* 


Degrees of Sum of Mean 


Source Freedom Squares Square F-ratio 
Levels 1 340.58 340.58 48.128 
Treatments 1 2.82 2.82 0.40 
Levels by Treatment 1 48.24 48 24 6.82 

5 318.48 7.08 


Error 4 


6 Significant at the 0.05 level. 
* Obtained by means of computer program O5V [I]. 


Results and conclusions. Table 1 gives a summary of scores by cell, treat- 
ment, and level for Part I of the criterion test. Table 2 is the source table for 
the analysis of variance performed on these scores. Since similar results were 
obtained for Part II of the test, these data are not included in this paper. 

Significant differences between levels results in rejection of hypothesis 1. 
This is not surprising since the honors class consists of select students. Signifi- 
cant interaction results in rejection of hypothesis 3 which suggests that the 
effectiveness of a particular treatment depends on the level of the individual 
student. The profile (Figure 1) indicates that the traditional approach may be 
favored by non-honors students and the advanced set approach by honors 
students. 

The t-statistic was used to compare cell means. A t>2.06 was required for an 
indication of significant difference with 95% confidence. The mean scores for 
the Level N students studying Treatments A and T were 13.28 and 15.77 
respectively. When these two means were compared, a ¢=2.32 was obtained. 
A Mann-Whitney U Test further confirmed this significant difference. Thus, 
those nonhonors students who studied the traditional approach performed ‘sig- 
nificantly better on the criterion test than those who studied the advanced set 
approach. 


20 
Cell 


15 


Mean 
10 

Scores 
5 


Level H Level N 


Fic. 1. Profiles for the cell means of the scores on Section I of the criterion test. 


One must keep in mind that this was a short term study performed with 
only 49 students, but the results in general lead one to believe that the efficiency 
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of the advanced set approach to limits depends on the ability and achievement 
level of the calculus student. Students at a high level when taught by the ad- 
vanced set approach should do at least as well as when taught by the traditional 
approach. However, students of average achievement would profit more from 
the traditional strategy of instruction. 


Discussion. One possible explanation for these results may be found in 
whole-part vs. part-whole learning theory. The traditional approach follows a 
“by parts” strategy whereas the advanced set approach has many characteris- 
tics of a “whole-part” teaching procedure. Seagoe and Symonds [4, 5] theorize 
a reciprocal relationship between the ability of the learner and the difficulty of 
the subject matter. With subject matter of given difficulty the more advanced 
student can learn more effectively by the whole-part method of instruction. 
With students of given ability the easier subject matter can be learned more 
readily by the whole-part method. Evidently, the nonhonors students were able 
to comprehend the limit concept presented in parts (lim;., b; and lim,., f(x)), 
whereas they were less able to grasp the significance of the concept as a whole 
(the all inclusive definition of limit presented in the advanced set approach). 
The subject matter was too difficult for these students to learn it more ef- 
fectively by the whole method. For the honors students, on the other hand, it 
was at a level of difficulty at which they could actually profit from studying the 
more general approach. 

To the author’s knowledge, no calculus text defines a collection of advanced 
sets. However, many contemporary texts do incorporate the neighborhood 
concept into their limit definitions. Although the neighborhood approach is not 
so general as the ddvanced sets approach, it does require learning of additional 
terminology. Such a text could not be read with profit by (average) students 
at Slippery Rock. One utilizing the e, 6 approach is now being used with much 
better results. Thus, what was suggested by the results of the study, seems to be 
applicable in practice. Evidently a calculus text should not be rated uncondi- 
tionally as “good” but rather “good for whom.” 
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ELEMENTARY PROBLEMS 
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E 2221. Proposed by Ioan Tomescu, University of Bucharest, Rumania 
Let ABC and A’B’‘C’ be two triangles having sides a, b, c and a’, b’, c’ re- 
spectively. If R is the circumradius of ABC, prove the inequality: 
ae $$ ¢ a’ + b' +’)? 
—_+—+t— < R? (+o +e) ; 
ay le a’b'c’ 
When does equality hold? 


E 2222. Proposed by R. M. Krause, National Science Foundation 

The following problem has been making the rounds at Berkeley, M. I. T., 
etc. Let NV be any number between 0001 and 9998, excluding only 1111, 2222, 
etc. Arrange.the digits in ascending order of magnitude and in descending order 
of magnitude and take the difference of the two resulting numbers. Call this 
difference 7(NV). Show that repeated applications of the operator 7 converge 
on the number 6174. Is generalization possible? 


E 2223. Proposed by M. Slater, University of Bristol, England 


A pack of cards is shuffled by starting with the first card and then placing 
successive cards alternately above and below the growing discard pile. After 
repeated shuffles of this type, will the pack first return to its original order when 
the original top card first returns to its top position? 


307 


308 ELEMENTARY PROBLEMS AND SOLUTIONS [March 


E, 2224. Proposed by D. P. Gtesy, University of Southern California 

Suppose {dn} is a sequence of nonnegative real numbers such that 
lim SUPn+e (Git ++ + +dn)/n<o and lim,.,, a,/n =0. Does it necessarily follow 
that limMn-seo (aji+ ve +a?) /n?=0? 

E 2225. Proposed by Diane Comer and J. J. Le Tourneau, Fisk University 


Show that any positive integer S can be written in exactly k different ways 
as the sum of two or more consecutive positive integers (in increasing order), 
where & is the number of positive odd divisors of S greater than 1. 


E 2226. Proposed by M. S. Klamkin, Ford Scientific Laboratory 


If one altitude of a tetrahedron intersects two other altitudes, then all four 
altitudes are concurrent. 


E 2227. Proposed by N. S. Mendelsohn, Unwersity of Manitoba 
Find the greatest common divisor of 


@ @ (7) ( 2n ) 
t1/’>\37? \5/7? "\Qn — 1) > 
SOLUTIONS OF ELEMENTARY PROBLEMS 


A Sum of Stirling Character 


E 2159 [1969, 300; corrected 1969, 690]. Proposed by G. M. Lee, San Maieo, 
California 


Prove: 


B (-y— E (-ri("Vir =o, 
r=] t=0 
where n=2, 3,4, °°: 
I. Solution by G. C. Dodds, State College, Pa. The given sum may be rewritten: 
> >> (— {)2-1 —( )e —_ > (— {)#F1 — ~(’ ) qr-!, 
r=1 s=0 r=1 i=0 
By further rearrangement, and discarding the term for 7~0, 


D (-pe y A (%) = neni 0 (7). 


t=1 pod 1 


Since 
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the given sum is 


(1) > (nese ("), 
Now, 7 


n/n 
(1 — et)” — > (7) (tye 
i= \t 
Hence, the (x —1)th derivative of this expression, evaluated at x = 0, is (1) above. 
But this derivative at x=0 is clearly zero for n =2, 3, - + - , so that the desired 
sum is zero for n=2, 3,°--. 


II. Solution by John Riordan, Rockefeller University. The inner sum is A70” 
or r!S(n, r), the Stirling number of the second kind, and (—1)*1(r—1)! is 
s(r, 1), the Stirling number of the first kind. Hence the result is the instance 
m=1 of the familiar orthogonal relation 


» S(n, r)s(r, m) = Onmy 


the Kronecker delta; and 6,1:=0 for 71. 


Also solved by Giinter Bach (Germany), Anders Bager (Denmark), M. T. Bird, Michael 
Bousquet, C. A. Church, Jr., N. Ersec, A. F. Gentzel, F. Gébel & Tj. Plomp (Netherlands), M. G. 
Greening (Australia), J. C. Hickman, R. L. Jow, D. G. Kabe, J. Kaucky (Czechoslovakia), Harry 
Lass, Douglas Lind (England), O. P. Lossers (Netherlands), W. Moser, C. B. A. Peck, Richard 
Post, E. A. Power (Australia), M. L. Railkar (India), Simeon Reich (Israel), John Riordan, M. J. 
Rossin, E. F. Schmeichel, R. R. Seeber, C. V. L. Smith, F. C. Smith, DB?S, Stephen Spindler, 
Dragutin Surtan (Yugoslavia), T. Tamura (Japan), M. R. Wise, Chung-kiu Wong, and the pro- 
poser. 

Robert Breusch, L. Carlitz and Alexander Zujus solved assumed reformations of the problem 
other than the proposer’s corrected version. Earlier, Bager had proposed essentially this problem 
in Elemente der Mathematik; see solutions by O. Reutter, I. Paasche and L. Carlitz, 1963, 5. 


Quadratic Nonresidues Summing to Zero 


E 2173 [1969, 553]. Proposed by Emanuel Vegh, Naval Research Laboratory 
If pis a prime (p#2, 3, 5, 11, or 17) there are three distinct quadratic non- 
residues of », whose sum is divisible by . 


Solution by Joel Spencer, University of Maine. Since 7| 3+5+6 and 
13] 2+5 +6, we assume 219. Let g be a fixed nonresidue (mod p). Then 
a+b+c=0 (mod p) where a, 0, ¢ are distinct nonresidues if and only if 
ga+gb-+gc=0 (mod p) where ga, gb, gc are distinct residues. We solve the latter 
equation. 

Case I. (—1 |p) =—1. 

Subcase A: If (2|) = —1, then (8 |p) = —1;so for one of n=4, 5, 6, or 7, 


(nes (H)=-4 
p p 
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giving 1+n2+(—1—m”)=0 (mod p) where 1, n, —1—n are distinct residues. 
Subcase B: If (2 | p) = +1, let » be the smallest integer such that 


(=) 


Since there are 4(p—1) nonzero residues, »S4(p—1). Since the residues are 
closed under multiplication, 1<4(p—1); so 1, n, —1—7x are distinct residues 
whose sum is 0 (mod ). 

Case II. (—1]p) =4+1. 

If (5|p) = +1, then 1+4-+(—5) =0 and 1, 4, —5 are distinct nonresidues. If 


(2-3 
Q-()-- 


Then (2]p) =+41; so (8|p) =+1 and 1+8+(—9) =0. 


Also solved by Anders Bager (Denmark), D. M. Bloom, G. C. Dodds, M. G. Greening (Aus- 
tralia), Erwin Just, Ekkehard Kroll (Germany), D. C. B. Marsh, K. A. Ribet, E. Ww. Trost (Switzer- 
land), K. M. Wilke, and the proposer. 

Kroll proves that each nonresidue is a member of such a triple, while Trost shows that the 
number of such triples is 


{(e — 1)/48] {o — 8 — 3(-1 |p) [1 + 2(2[p)]}. 


then 1+9-+(—10) =0. Let 


A Functional Equation 


E 2176 [1969, 554]. Proposed by R. S. Luthar, University of Wisconsin at 
Waukesha 


Find all continuous real functions f such that 


(: + 2) fle) +40) 
z—y/l f(x) — fo”) 

Solution by L. E. Ward, Sr., Escondido, California. Placing y=0 shows that 
f(1) [f(%) —f(0)] =f(«)+f(0), whence we infer that f(0)=0 and f(1)=1, since 
f(x) cannot be a constant. Replacing y by x—2 and again x by x—1 and y 
by 1, we obtain 


(1) 


+ —2 fie —- 1 7 
0) fe - 1) LO FIE =D, (.)- (e— 1) + 


f(a) = fle — 2) f(e®—1)— 1 


It therefore follows that 


x—2 
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6) (= ) ie 3 eo 2) 


From (3), with «=4, we have f?(2) =2/(2). But f(2) 40, since, from (1) f(2) =0 
would imply f((«+2)/(*—2)) =1. Hence f(2) =2. 

Now assume that f(x) =x for x=0, 1,2, ---,2 (n22). Then, with x=n-+1, 
the first equation of (2) gives at once f(n+1)=n-+1. It follows that f(x) =x for 
all nonnegative integers. 

Putting y =cx yields 

(- + “) _ f(%) +f(cx) 1 t+ fe) 
fle) — f(z) 1 — fl)’ 
from which f(cx) =f(c)f(«). Now let x =p/g, a rational fraction, and c=q. Then 
f(b) =f(b/af(q), showing that f(«)=x for all positive rational numbers. By 
continuity it follows that f(<) =x for all positive numbers. 


Finally, placing y= —x in (1) gives f(0) =f(x) +f(—<x), so that f(—x) = —f(x) 
= —x. We conclude that f(x) =x. 


1-—c 


Also solved by Joel Anderson, Anders Bager (Denmark), Joseph Beer, Stephen Berman, M. T. 
Bird, Michel Bousquet, Ted Cullen, R. J. Egbert, W. F. Fox, George Gastl & V. Sehgal, Michael 
Goldberg, M. G. Greening (Australia), Emil Grosswald, D. W. Hadwin, Erhard Heil, D. A. 
Herrero, Toshi Iida (Japan), B. G. Klein, Robert Kopp, Beatriz Margolis (Argentina), J. V. 
Michalowicz, Steven Minsker, Jernej Polajnar (Yugoslavia), Simeon Reich (Israel), E. F. Schmei- 
chel, Hanna Schwerdtfeger, Hans Schwerdtfeger, Stewart Shapiro, R. A. Struble, Charles Wexler, 
Albert White, and the proposer. 

Note. Several solutions attempting to prove by induction that f(z) =x for positive integers 
n, were incomplete: they did not show that f(2) =2. 


A Characteristic Value Problem 


E 2177 [1969, 554]. Proposed by P. M. Gibson, University of Alabama, Hunts- 
ville 


Let r be a nonzero complex number, and m an integer. Suppose that A = (a,;), 
B= (6;;) are n Xn complex matrices with 0;;=r™"t*‘a,; for 4, j=1, -- - , 2. Show 
that if \ is a characteristic value of A of multiplicity 2 then r”\ is a characteristic 
value of B of multiplicity &. Use this to show that if A is a tridiagonal matrix of 
odd order with zero diagonal then A is singular. 


Solution by J. R. Kutiler, The Johns Hopkins University Applied Physics 
Laboratory. We have B=r™DA D—!, where D is the diagonal matrix with dy=7', 
4=1,---+,m. Since DAD~' is a similarity transformation, the eigenvalues of B 
are 7™ times the eigenvalues of A. Now let m=0, r=—1. When A has zero 
diagonal, B= —A.Thus A and —A have the same eigenvalues. Hence the eigen- 
values of A are symmetrically distributed about the origin and the number of 
nonzero eigenvalues is necessarily even. Thus, when is odd there must be a 
zero eigenvalue, i.e., A is singular (cf. Problem 5488, [1967, 425]). 


Also solved by J. V. Michalowicz, Simeon Reich (Israel), W. A. Thrash, Jr., and the proposer. 
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Continuous Real-valued Functions with at most one Nonzero Partial Derivative 
E 2178 [1969, 690]. Proposed by W. G. Dotson, North Carolina State University 


(1) Find a connected open set D in the plane, and a continuous function f 
from D to the real numbers, such that f(x, y)=0 on D but f is not a function of 
y alone. (2) Find (weakest possible) conditions on a connected open set D in R” 
which insure that if f is a continuous function from D to the real numbers, and 
Of/dx:=0 for 1+=2, 3, ---,m, then f is a function of x; alone. 


Solution by Simeon Reich, Israel Institute of Technology. (1) Let D be the 
following set: 


i(#, 9)| -2<#<2,0<y<2\{@,9)|-1S 481,159 <4}, 
and let f(x, y) be defined as follows: 


—y+2, -2<x2#<-lisy<2 
fe, ») = 4 


for other (x, y) € D. 
Here, f is not a function of y alone because 1/2 =f(—3/2, 3/2) ¥f(3/2, 3/2) =3/2. 


(2) The following condition is sufficient: For any two distinct points 


(x1, X2,°°+*, Xn), (yr, Yo, ***, Ya) belonging to D such that x;=¥4,; for 
a=1,---,7—-1,j+1, ---,, where 722, the set 
{(a1, +++, 8° ++) %m)|%; Sa Sy; ory; Se; S x,} 

is a subset of D. 

Proof. Take arly two points a= (x1, %2, - + - Xn), 0= (1, Yo, > - + » Yn) Delong- 
ing to D. We have to show that f(a) =f(6). Indeed we have 
f(a) — f() = 
L f(a, Me, °° 8 Xn) — f(%, V2, %3, °° * tn) | + [f (a1, V2, %3, °° 8 Xn) 


— f(x1, Ve, V8, V4, ° °° Xn) | + - + (f(a, V2, ses » Va—1) Xn) 
— (1, Yo) ++, Yn) I. 


Consider, for example, the first bracket. The function F(z) =f(«1, 2, %3, - + + , Xn) 
is differentiable for xxS2Sye (or for yeSzSxe, as the case may be). Therefore 
there is a number w, x%2<w<y2, such that F(x2)— F(ye) = (x2—y2) F’(w). But 
F" (w) =fin,(*1, W, 3, °* * » Xn) =0. Similarly for the other brackets. The result 
follows. 

I do not know if this condition is the weakest possible in the sense that for 
any D not satisfying this requirement there exists a suitable function for which 
the conclusion is not valid. But note that neither the continuity of f nor the 
connectedness of D was needed in the proof. 


Also solved by G. A. Heuer, E. F. Schmeichel, Luis Verde-Star, and the proposer. 


1970] ADVANCED PROBLEMS AND SOLUTIONS 313 


ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State Uni- 
versity, New Brunswick, N. J. 08903. Solutions of Advanced Problems in this issue should be 
typed (with double spacing) on separate, signed sheets and should be mailed before June 30, 
1970. Contributors (in the United States) who desire acknowledgment of receipt of their solu- 
tions are asked to enclose self-addressed stamped postcards. 

An asterisk (*) means netther the proposer nor the editors supplied a solution. 


5720. Proposed by Otto Morphy, Atlantis University 

Using the definition of “normal” not containing “Hausdorff,” find all topo- 
logical spaces X which are not normal but are such that each proper subspace is 
normal. 


5721. Proposed by Simeon Reich, The Technion, Hatfa, Israel 


Let f: K"—E” be a continuous function where E” is Euclidean n-space and 
Kn = {x|xCE*, ||| <1} such that for every yES*-!= {x|eGE, |x|] =1} 
there is no m>1 with f(y) =my. Show that f has a fixed point. 


5722. Proposed by D. G. Cantor, Unwersity of California, Los Angeles 


Let X be a compact subset of the reals. Prove that the necessary and 
sufficient condition for the existence of a nonconstant monic polynomial with 
real coefficients which has absolute value <1 on X is that there exist such a 
polynomial which has absolute value <2 on X. Show that “2” is sharp; i.e., 2 
cannot be replaced by any larger number. 


5723*. Proposed by H. D. Ruderman, Hunter College High School 


Let M =ab-+éd —ef.A random integer is selected uniformly from the numbers 
0 to 9 inclusive and is used for some one of the six variables a, 6, c, d, e, or f. 
Then a second random integer is selected independently from the same uniform 
distribution and is used for some other of the remaining variables. This selection 
and assignment is continued in the same way until six integers have been selected 
and assigned to the six variables. What is the best strategy for assigning the 
integers so as to maximize the expectation of M? 


5724*. Proposed by Paul Erdis, University College of Swansea, Wales 

Denote by P(m) the largest prime factor of n. Let f(m) be an increasing 
function of 2 so that >.°_, 1/nf(n) < ©. Prove that >), 1/nf(P(n)) <<. 

5725. Proposed by D. J. Lutzer, University of Washington 


Is it true that every compact Hausdorff space is a compactification of a 
metric space? 


5726*. Proposed by D. P. Allen, Jr., Bell Telephone Laboratories, Holmdel, N.J. 


Let X be a finite nonempty set, let 2X [AX] denote the free [free commuta- 
tive] semigroup on X, and let 6: 2X AX denote the natural homomorphism. 
Do there exist semigroups J and T’ of 2X with TUT’=2X and TAT’=@ 
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such that (i) 6(7)N@(T")¥# @ and (ii) neither T nor T’ contains either a right 
or a left ideal of 2X? 


SOLUTIONS OF ADVANCED PROBLEMS 
Rational Distances on a Conic Section 


5650 [1969, 95]. Proposed by D. E. Daykin, University of Malaya, Kuala 
Lumpur 

Let S be a set of points of the real x,y plane such that the distance between 
any two of the points is rational. How big can S be when S lies (1) on a hyper- 
bola, (2) on a parabola, (3) on a proper ellipse, and (4) when S is of the form 
(x1, +d), (x1, —d), (X2, +d), (x2, —d), se? 


Note and comments by the proposer. It is known, and easy to verify, that 
there can be infinitely many points on a circle for which all distances are ra- 
tional—choose the points (1, 0) for which tan0 is rational. By problem 5330 
[1966, 1020] the set must be countable. S can consist of two points on one line 
and infinitely many points on another line—see D. E. Daykin, Rational Poly- 
gons, Mathematika, 10 (1963), 125-131. See also the reference in 5330. 

Cases (1) and (3). We require (x;—x;)?+(y:—¥;)? equal to a square number, 
subject to (x/a)?+(y/b)?=1 for the ellipse and hyperbola respectively. 

Case (4). We need a set of points x; such that (x;—x;)?+4d? is a square. We 
can assume that x1=0 and, by a change of scale, that 4d?=1. Thus the problem 
is equivalent to finding a subset A of the set P of solutions of the Pell equation 
x?+-1 equals a square, such that OCA and x1, xx A implies (x,—x2) CP. 

Case (2). Take for the parabola y=at?+bi+c, a40. Putting t=x—(b/2a) 
gives y =ax*-+d for some d. We need a set of points («:, y;) such that y;=axj7+d 
and 


(a — 43)? + (vs — ys)? = 0, 
which is, upon substituting for 4,, 4;, 
(a5 — mj) + (axe — ax)” = 0, 
that is 
1 + (ax; + ax;)? = C7. 


So, if P is the set of solutions of the Pell equation x«?+1=[], the problem is 
equivalent to finding a set B of numbers, such that x, x.€B implies x1+*2,.€P. 


Note. The editors will be pleased to print partial solutions if these are available. 


A Trigonometric Equation 


5661 [1969, 309]. Proposed by G. J. Foschini, Bell Telephone Laboratories, 
Holmdel, N. J. 
Find all solutions in the complex plane of 
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oOo 86g 
a > > e2tipela 


q=2 p=1 


Solution by Douglas Campbell, Unwersity of North Carolina. For the given 
double sum even to make sense we must have 


q 
lim e?xipz/a) = (), 
g@-> 0 p=1 
If z is not an integer: 
qd 1 — e2riz 
lim | >) e?*Pe/a| = lim | e?ré#/a(——______ }| = o, 
q-> 0 p=1 q-> 00 1 — e2tizla 


Hence, the only possible solutions are integers. Since 
; ; 
> e2tipn!q = \ 2 { ” 
p=1 qd if | Ny 


we need only find all integral solutions to the equation 


oO qd 
> e2ripn! 4g = ) g = nN. 
g=2 p=1 qin 

q22 


This latter equation is clearly true if and only if 2 is a positive prime. Therefore 
the only solutions to the original equation are the positive primes. 

REMARK: The above analysis shows that the set of solutions of the equation 
2g= Dea 5-1 e"iP4l2 is the set of perfect numbers. 


Also solved by T. M. Apostol, Robert Breusch, L. Carlitz, Josef Dane’ (Czechoslovakia), 
Emil Grosswald, H. Pétard, Jernej Polajnar (Yugoslavia), Steve Rohde, and the proposer. 


Inherited Property for a Polynomial Ring 


5662 [1969, 309]. Proposed by Irving Kaplansky, University of Chicago 
Let R be an associative ring with unit. Suppose in R whenever ab=1 then 
ba =1. Prove that the same thing holds in R[x], the polynomial ring over R. 


Solution by Sue Ann Hemr, Champaign, Illinois. Let p(x)q(x) =1 and g(x) p(x) 
=e(x). From the equation (0)q(0)=1 it follows that g(0)p(0)=1=e(0). If 
e(x) #1 then there is a minimal natural number m and a nonzero element r in R 
such that e(x)=1+rx"+ +--+. From the relation between p(x) and g(x) we 
obtain e(x) =e(x)e(x) =1+2rx"+ --- and thus r=2r. This contradicts the 
assumption that 70 and so e(x) =1 after all. 


Also solved by forty-four other readers. 

E. J. Taft observes that the property remains true for the formal power-series ring R[[x]] if 
the element 1 is replaced by any central element c which is not a zero divisor in R, i.e., ab=c 
implies ba=c becomes the hypothesis. He then raises the question whether R[x] retains the 
property if we delete the assumption that ¢ is a central nonzero divisor in R. 
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OPTIMAL STOPPING 
HERBERT ROBBINS, Columbia University 


The theory of probability began with efforts to calculate the odds in games of 
chance. In this context, optimal stopping problems concern the effect on a 
gambler’s fortune of various possible systems for deciding when to stop playing 
a sequence of games. Such problems are of interest in statistics, where the 
experimenter must constantly ask whether the increase in information con- 
tained in further data will outweigh the cost of collecting it. 

Optimal stopping theory provides a general mathematical framework in 
which such problems can be precisely formulated and in some cases solved 
completely. The examples which we shall consider here are of a simpler nature 
than those arising in statistics, but will serve to illustrate some of the problems 
that arise in the general theory. 


EXAMPLE 1. A fair coin is tossed repeatedly. After each toss we have the op- 
tion of stopping or going on to the next toss, our decision at each stage being 
allowed to depend on the outcome thus far. We must stop after some finite (but 
not necessarily preassigned) number of tosses, and it is agreed that if we stop 
after the mth toss we are to receive a reward x, which is a given function of the 
outcomes of the first 2 tosses. When should we stop so as to maximize our 
expected reward? 

Let us introduce random variables yi, yo, -- + to represent the-successive 
tosses, the y; being independent with the common probability distribution 
P(y;=1)=P(y;= —1) =1/2; y;=1 denoting heads on the zth toss and y;= —1 
tails. The reward sequence will then consist of a sequence of functions #1, %2, ---, 
where %n=fn(y1, ° °°, Vn). A stopping rule is then a random variable ¢ with 
values in the set {1, 2,3,°°° } and such that the event [t=] depends solely 
on the values of yi, - + +, yn and not on future values yai1, - - - . Using a stop- 
ping rule ¢ our reward x, will be a random variable whose expectation Ex; 
measures the performance on the average of the stopping rule ¢t. The supremum 
V=sup { Ex, } over the class C of all possible stopping rules ¢ for which Ex, 
exists is called the value of the sequence {x,}, and if a stopping rule ¢ exists such 
that Ex:;=V, t is said to be optimal. 


Prof. Robbins earned his Harvard Ph.D. under Hassler Whitney. He followed an assistantship 
under Marston Morse at the Institute and an NYU instructorship with a four-year career in the 
U.S. Navy. He then was Associate Prof. and Prof. at the Univ. of N. Carolina before assuming his 
present position at Columbia. He spent leaves at Berkeley, Minnesota, Purdue, Michigan, and 
I.A.S. 

He has served as President of the Institute of Math. Stat. and delivered the Rietz and Wald 
lectures. He was a Guggenheim Fellow in 1952-53. 

Since his thesis in topology, Robbins’s main research has been in probability theory and 
mathematical statistics. His book (with Y. S. Chow and D. O. Siegmund) Great Expectations; The 
Theory of Optimal Stopping will be published shortly by Houghton Mifflin. His previous What is 
Mathematics? with Richard Courant (Oxford Univ. Press, 1941) is a landmark in mathematical 
exposition. Editor. 
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Thus far we have said nothing about the nature of the reward sequence 


Xn=fn(yi1, > * +, Yn). To begin with, let us take 
n2” ne /yi ti 
1 n = e =— 1, 2, ee ee @ 
Ww) " n+ i ( 2 ) 


and analyze the resulting situation. Equation (1) is just a symbolic way of 
saying that if we stop after the mth toss with all heads we are to re- 
ceive n 2"/(n-+1), while if any one of the first ” tosses has been a tail we are to 
receive nothing. A little reflection will show that we need consider only the class 
of stopping rules {te}, k=1, 2,°--, where t,=2; i.e., & stops after the kth 
toss no matter what sequence of heads and tails has appeared. Clearly 


E 1 = k2 n (1 =) 0 k 

X = «© — ° = 
* Oe R+ 1 2k k+1 
and therefore V=1 but no optimal stopping rule exists. 


We remark in passing that at any stage z in which all heads have appeared, 
so that x, =n 2"/(n-+1), the conditional expected reward for making one more 


toss before stopping is 
( no" ) 1 (w+ 1)2"*1 2"(n + 1) 
E Latij<n = ——— | FE 
(n + 1) 2 (n+ 2) (nm + 2) 


Hence it is always “foolish” to stop with all heads. But if we do not act “fool- 
ishly” at some point we shall wait for the first tail to occur and our final reward 
will always be 0. Thus acting “wisely” at each stage is the worst long-range 
policy. 


> Xn- 


EXAMPLE 2. The same, except that now 


+++ + Yn 
(2) t= 
n 
This problem is much harder than the preceding one because of the enormous 
family of possible stopping rules which must be considered and evaluated. A 
simple instance of such a rule is 


\" if y, = 1, otherwise 


3 i= 
3) n if mis the first integer such that y1 + - +--+ 9, = 0. 


The first question we must ask is, does (3) define a legitimate stopping rule in 
the sense that P(t< 0)=1? In probability theory it is shown that in repeated 
tossings of a fair coin, for any fixed integer k=0, +1, +2, ---, the probability 
is 1 that the difference (number of heads in first 2 tosses) — (number of tails in 
first 2 tosses) will assume the value & infinitely often. In our notation the case 
k=0 corresponds to the event yi+ --- +y,=0, and hence P(¢<o)=1. It 
remains to evaluate Lx, for (2) and (3). We have 
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It follows that V24, and of course VS1, since x, 31 in all cases. By trial and 
error we can invent other stopping rules ¢ for which Ex,>4%, but we will find 
none for which Ex;2.9, say. However, it is not so easy to prove that V<.9, 
still less to find the exact value of V and to determine whether an optimal ¢ 
exists in this example. 

In cases like this it is tempting to try to “put the problem on a computer.” 
Briefly, here is what a computer can do. Suppose we restrict ourselves to the 
class Cy of all possible stopping rules ¢ which take on only values in the set 
1, 2,°°:,N } , where NV is some fixed positive integer; in other words, we re- 
strict ourselves to stopping rules which always stop after at most NV tosses of 
the coin. Denoting by vy the supremum of Ex; over the class Cy, we see that in 
this case Cy is a finite class and therefore an optimal rule in Cy must exist. 
Even so, for V = 1000, say, the class Cy is still too large for an exhaustive anal- 
ysis. At this point the general theory of optimal stopping comes to our aid. 
Whenever the number of stages in the problem is bounded (even if the random 
variables involved have continuous rather than discrete distributions) there is 
always an optimal rule and a more or less constructive algorithm for finding it 
(cf. (22)). In the present problem a computer can actually be programmed to 
find vy quite quickly for all VV up to a few thousand. By definition the sequence 
Y’y is nondecreasing, and therefore v=limy.,,, vy exists. But the values of vy 
produced by a computer for successive values of NV display no obvious pattern, 
and it is impossible even to guess the exact value of v from computer evidence, 
still less to decide whether V =v or V>v. 

In the present problem it has been proved [6, 11] that an optimal ¢ does 
exist and that v= V, but the exact description of ¢ and the value of V are not 
known. 


EXAMPLE 3. As in Example 1 but now with 
%_ = min(1, yi + ++ + + yn) — n/(n+1) (n = 1). 
Consider the stopping rule 
(4) t= first m 2 1 such thatyi +--+: +49, =-1. 
That P(i< ©) =1 follows as in Example 2, and since n/(n-+1) <1, 
Bu =1-E(——) >0 


(the exact value of Ex; would require some probability theory to compute). A 
little thought will show that ¢ is in fact optimal for this example and hence that 
V= Ex;>0. 

On the other hand, since the y; are independent and identically distributed 
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with Hy;=%-1+3-(—1)=0, Wald’s lemma (see below) shows that if, unlike 
(4), is any stopping rule for which Et< ©, and hence in particular if #€ Cy for 
some N=1, 2,---, then E(yi+ :-- +41) =0, so that 


IIA 


t 1 
Ex, S E(yz +t: ++ + - 2(——) s-—. 
t (v1 yt) i414 , 
Hence, vy S —¥ for all N21, and hence v=limy., vy S —1/2, while as we have 
seen V>0. 
A slight variation on this example is given by 


EXAMPLE 4. Let 41, yo, - - - be independent (but not identically distributed) 
random variables such that 


(5) P(y;i =1—-— a) = P= —1—a, =1/2, witha; = 1/i(4¢ + 1), 
and let 
(6) fe = Mts ty = (m2 1) 


be the reward if we stop at the mth stage. Thus x, represents the net gain of a 
gambler who plays a succession of unfavorable games, 


Ey; = 4(1 — a;) + $(-1 — a;) = a; <0, 


and stops after the mth. It might seem that whatever stopping rule ¢ he might 
use, his expected net gain after stopping, Ex:, would be <0. However, let 


(7) , = first 7 2 1 such that > (y; + ai) = R, 


where & is any preassigned positive integer. The argument in Example 2 again 
shows that P(¢< «)=1, while by (6) and (7) 


ta ~2( Sx) =1-2( 5) <1-2($(2-;4) 


t 
-t-£(——)>a-1, 
i+ 1 


Since 2 can be as large as we please, we see that V=-++ © for this example. (The 
reader may decide whether there is an optimal ¢ in this case, i.e., one for which 
Ex:;=-+.) We remark that although the one-step conditional expected re- 
ward E(xn41| Xn) =%y,+E (yas) is always less than the present reward x,, the use 
of a proper stopping rule makes the game profitable. 

This example suggests the following question. If 1, y2, - - - are independent 
and identically distributed with Ey;<0, does there exist a stopping rule ¢ such 
that E()>.4 y;)>0? We shall now show that there does not. 

We shall need the strong law of large numbers of probability theory. 
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THEOREM. (Kolmogorov) Let yi, yo, - - - be independent and identically dis- 
tributed random variables with the common distribution function F(y)=P(yi<y), 
and put %n= >" y;. If 


(8) By = f PQ) +f sdFo) 


exists (1.€., if the two integrals in (8) are not respectively + © and — @), denote it 
by pw. Then 

(a) If uw exists, —o Spo, then P(limn.. (¥,/n) =p) = 1. 

(b) If for some finite constant c, 


(9) P( tim = = )=1 


noo 
then p exists and equals c. 


We remark parenthetically that (9) may hold with c=-+o or —© even 
though p does not exist. 

Now let ¢ be any stopping rule for an independent and identically distributed 
sequence 41, ye, °°: for which p= Ey; exists, — oS, and consider the 
randomly stopped sum x;= ).4 y;. For any n21 let 


Sn = ib yn) + aga Fos Matte) Foe 
+ (Verte -ptpart bop Verte etty)s 
where t;=1#, fe is ¢ applied to the sequence ¥:,41, y42, °° * , etc. It is easy to see 
that the ” groups of terms in S, are in fact x independent random variables with 
the same probability distribution as x:, and that h, h, - + - are independent 


random variables with the same distribution as f. Since nw=Ey; exists by 


hypothesis and since Et always exists (since +21), it follows from (a) that with 
probability 1 as n— 0 


(10) Sn _ 1a ie a — sone aah 5 Et 
n tote ee +t, n 


provided only that the last expression 1s not of the form 0- ©. Applying (a) again 
to x:= >.) y; we have [15] the 


COROLLARY. Jf y1, yo, ° ++ are independent and identically distributed, if 
p= Ey; exists, and tf t is any stopping time of the sequence y1, yo, «+ + for which 
E(>.5 y:) exists, then 


(11) e( Xm) = EI 


whenever the right side of (11) ts not of the form 0-«. 
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From (b) and (10) we obtain 


Watp’s Lemma [18]. Jf u and Et are both finite then E(>.' y;) always exists 
and satisfies (11). 


Wald’s Lemma admits of many generalizations. For example, if yi, ye, -° - 
are independent but not necessarily identically distributed random variables 
for which E|y.| SC<o for all i21, if y;= Ey, and if ¢ is any stopping rule of 
the sequence 1, yz, - - - such that Et< o, then E( > 4 y;) exists and E(>_{ y;) 
=F(> wi). Applied to Example 4 in which | yi <2 we see that if in that ex- 
ample ¢ is any stopping time of the sequence 41, ye, - + - such that Hi< o, then 


ro =o Sn)=-1(Es) = 2) <-4, 


Thus to achieve Ex;> —1/2 as we did with (7), we have to use a ¢ for which 
Et= oo, This detracts somewhat from the prospect of getting rich in the long 
run by using (7) or something like it. 

We are now able to justify our negative answer to the question raised in 
the paragraph following Example 4. The Corollary shows that if +1, ye, - - - are 
independent and identically distributed with Ey;=y <0, and if ¢ is any stopping 
rule for which E( > 4 yx) exists (even though Et may be +), then 


(12) E( Dx) = p-Et <0; 


the optimal stopping rule is therefore t=1, and V=u<0. (We remark that 
there may exist #’s such that E( >.5 y;:) does not exist, but such #’s are excluded 
from consideration under our definition of optimality.) 

Our next example is of a different character from the preceding ones in that 
it can be supposed to apply to certain problems in real life rather than to 
imaginary gambling situations. 


EXAMPLE 5. Let yi, v2, - - + be independent random variables with a com- 
mon distribution function F(y)=P(y;<~y), and let 


(13) %, = max(yi,°°*)9n) —on (n 2 1) 


where c is some positive constant. What stopping rule ¢ maximizes Ex;? A com- 
plete solution to this question is given by 
(i) If foydF(y) = © then an optimal stopping rule is 


(14) t = firstn 2 1suchthaty, 2 b (b any finite constant), 


and V=Fx;=-+, while 
(ii) if fo ydF(y)< © there exists a unique number 8 such that 


(15) [, 0 -94F0) = « 
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an optimal stopping rule is 
(16) t = first nm 2 1 such that y, 2 8B, 
and V= Ex;=B8. 
Proof. (i) Let p=p(y;20) =1—F(6)>0 and g=1—p. Then for (14) 


v8 1 
Pit =n) = pq, Plt< ©) = Do pgis= p—— = 1, 
1 —q 


oO 


d fore) 
17) Bt = Snpg = p =( > ") => 
qd 0 


1 
(1—q)? p- 


J " ydF(y) 


E max(y1, +++, 91) = ———— = &. 


1 


and 


Hence by (17) 


so t is necessarily optimal. 
(ii) Consider the function 


(6) = J “(y — B)dF(y), 


which is equal to the area of the region in a y, z-plane bounded below by the 
curve 2= F(y), on the left by the line y=0, and above by the line zg=1. It is 
geometrically evident that there is a unique solution of (15) for any given c>0, 
and for ¢ defined by (16) we have as in (i) 


i -—, Bu = (ff) aro) - ep (p= 1 - FO), 


Hence by (15) 
(18) fix, = B. 


To prove that the ¢ defined by (16) is optimal, let ¢’ be any stopping rule such 
that Exy exists and is > — ©. Choose any )>8 and observe that 


(19) vn = max(y, ++ +49) — on SO+ DH (( — + — 0), 
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where we define a+ = max(a, 0). The sequence of random variables w, we, +: -, 
where wi;=(y:—b)t—c, is independent and identically distributed with 
p=Ew;= [2 .((y—b)+—c)d Fly) =¢(b) —c <0 since b>8 and &@ satisfies (15). 
Now Ex, >— © by hypothesis, so by (19) 


z/ 
(20) B( Yi we) & Bev — b> — 0, 
1 
and hence by Corollary 1, whether Z#’ be finite or infinite, 
t? 
(21) E( > ws) = wht’ < 0; 
1 


thus by (20) and (21) Ex» <b. Since b was any constant >8 this implies that 
Ex, SB and hence by (18) that ¢ is optimal and V=8. 

The original proof of this result [4] along lines suggested by the general 
theory was considerably more complicated and was valid only under the un- 
necessary hypothesis that /¢ y*dF(y) < ©. The direct proof just given is based 
on the trivially simple inequality (19) which reduces the problem from one 
involving the maximum of 1, - - - , ¥n to one involving a simple sum. This in- 
equality was pointed out to the author by David Burdick. 

We remark that the optimal stopping rule (16) in case (ii) is suggested 
by the one-step argument which would have led us astray in Examples 1 and 
4 (and in Example 3, also for that matter). For, putting m,=max(y1, ° + +, Va) 
we have 


Vnti = Masi — c(a +1) = max(mMn, Vari) — c(n + 1) 
Mn + (Yn41 — Mn)t — c(n + 1) 


tn + (Yat1 — Mn)t — 6, 


so that E(xa+1| M1, = 88) Yn) =Xn bE (Vny1—Mn)t—c>xn if and only if 


[fo — Mn)dF(y) > ¢; 


ie., if and only if m, <8. 
We remark also that if we replace (13) by #,=yn,—cn (n21), the optimal 
solution remains the same, since 


Kn S%, but #:= % 


with ¢ defined by (14) or (16). 

The general theory of optimal stopping is more than a collection of examples 
and ad hoc methods of solution. It draws heavily on martingale theory and the 
other apparatus of modern probability theory, to which it also contributes. 
Rather than sketch the general methods in a necessarily incomplete form we 
have preferred to present a few simple examples in the hope that the reader will 
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be induced to consult some of the references below for a more formal pre- 
sentation. 

Our last example will be one in which the number of random variables is 
finite from the outset and hence no difficulties of principle arise concerning the 
existence of an optimal solution, but the explicit result is none the less surprising. 


EXAMPLE 6. An employer interviews a finite number WN of applicants for a 
position. They are Miss 1, Miss 2,---, Miss N, in decreasing order of excel- 
lence, and they are interviewed in random order, each of the NV! permutations 
being equally likely. The rules of the game are as follows. After each interview 
the employer can rank the girl just interviewed relative to her predecessors but 
is ignorant of her absolute rank relative to the whole group of NV. Thus if the 
first few girls in order of appearance were Misses 


7, 3, 9, 4, ss 
the employer would know successively that 


the first girl has relative rank 1 
the second has relative rank 1 

the third has relative rank 3 

the fourth has relative rank 2, etc. 


The employer must hire one applicant, but if at any stage he does not hire the 
girl just interviewed he must dismiss her and cannot call her back later. The 
object of the employer is to minimize the expected absolute rank of the girl hired. 

If ¢ is any stopping rule for this problem and x; denotes the absolute rank of 
the girl hired using ¢, then the possible values of x; range from 1 (best girl hired) 
to NV (worst hired). If ¢ consists of always hiring the first girl to appear (or hiring 
at random one of the WN girls) then 


since the first (or a randomly chosen) applicant is equally likely to have any of 
the absolute ranks 1, 2,---, WN. Clearly, a good strategy t, based on the in- 
formation on relative ranks, should be able to do better than that. Denote 


Ry = inf { Ea} (N 2 1), 


where the inf is taken over all the (finite) number of possible strategies available 
with a group of size NV. For example, when NV =1 only one strategy is available 
and Ry=1, for N=2 there are only two possible strategies, equally good, and 
R,=%, for N=3 there are four strategies available of which the best gives 
Rs =, etc. 

It is possible by using the general theory to devise a computer program for 
finding Ry for any given N=1, 2,---. It amounts to the following: Define 


cv-=(N+41)/2, andfori= N—1, N—2,---,1 
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compute successively 


i+ 1 ; 
So ‘|, where |x] = largest integer S x, and 


———— 
N 1 
(22) r 
-( Se te )e) 
ie i+1 , 4 Sg)/Cg }. 


Then Ry=co. The backward induction from ¢y_1 to cp is an example of the 
general algorithm for solving optimal stopping problems when there are only a 
finite number of successive stages 1, - + - , N at which stopping is permitted. In 
the present problem, c; represents the minimal expected absolute rank attain- 
able by any stopping rule which must stop somewhere between the (4-+1)th and 
Nth stages inclusive. 

Using this procedure it is found that 


(23) Rio = 2.56, R00 = 3.60, Riooo — 3.83, 


(The optimal strategy in each case is also produced by the computer but we 
shall not bother to describe it here.) No general formula for Ry other than the 
recursive definition by way of (22) is available, and for each N the recursion 
has to be started afresh. 

Inspection of (23) shows that the values Ry seem to be increasing, but sur- 
prisingly slowly. It can in fact be proved that Ry increases steadily with NV and 
hence that the constant 

R= lim Ry s + oO 


N~o 


is well defined. What is its value, finite or infinite? In [3] it is proved that 


© 7G 4 WN UGHD 
R=]]I (—) 3.8695. 


j=1 Jj 


Thus, no matter how large WN is, there exists a strategy for which the expected 
absolute rank of the girl hired is less than 4, even though the actual absolute 
rank of the girl hired may be as great as JN. 

If instead of arriving in random order with each permutation equally likely, 
the applicants are sent in by a malicious competitor of the employer who wants 
to force him to do badly, the competitor can use certain permutations more often 
than others in such a way that no matter what strategy the employer uses 
(even knowing the competitor’s mode of randomization) he will always have 
expected absolute rank of applicant hired=(V+1)/2, the purely random ex- 
pectation. This is relevant to certain real or imaginary military applications of 
the hiring problem. 

There are many variants [12] of this problem, the most common being that 
in which the employer, instead of trying to minimize the expected absolute rank 
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of the girl hired, tries to maximize the probability of hiring the best girl, regard- 
less of whom he hires when he does not get the best. For this case it has long 
been known that a probability 1/e of hiring the best can be attained by the 
proper strategy. The problem as we have put it is, of course, more difficult and 
perhaps more interesting. 
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REMINISCENCES OF A MATHEMATICAL IMMIGRANT 
IN THE UNITED STATES 


SOLOMON LEFSCHETZ, Princeton University and Brown University 


My career as mathematical immigrant began in 1911 upon my receiving the 
Ph.D. degree from Clark University (Worcester, Mass.). While small, Clark had 
as its President G. Stanley Hall, an outstanding psychologist, and several 
distinguished professors. The mathematical faculty consisted of three members: 
W. E. Story, senior professor (higher plane curves, invariant theory); Henry 
Taber (complex analysis, hypercomplex number systems); De Perrott (number 
theory). 

There were great advantages for me at Clark. I graduated from the Ecole 
Centrale (Paris) (one of the French “Grandes Ecoles”) in 1905, and for six years 
was an engineer. I soon realized that my true path was not engineering but 
mathematics. At the Ecole Centrale there were two Professors of Mathematics: 
Emile Picard and Paul Appel, both world authorities. Each had written a 
three-volume treatise: Analysis (Picard) and Analytical Mechanics (Appel). I 
plunged into these and gave myself a self-taught graduate course. What with a 
strong French training in the equivalent of an undergraduate course, I was all 
set. 

To return to Clark, I soon obtained a research topic from Professor Story: 
to find information about the largest number of cusps that a plane curve of 
given degree may possess. An original contribution which I made secured my 
Ph.D. thesis and my doctorate in 1911. 

At Clark there was fortunately a first rate librarian, Dr. L. N. Wilson, and 
a well-kept mathematical library. Just two of us enjoyed it—mzy fellow graduate 
student in mathematics and future wife, and myself. I took advantage of the 
library to learn about a number of highly interesting new fields, notably about 
the superb Italian school of algebraic geometry. 


Prof. Lefschetz continues an astonishingly productive career. His profound influence in the 
development of topology and of algebraic geometry is expounded at length in articles by W. V. D. 
Hodge and Norman E. Steenrod in the Princeton Symposium volume in honor of S. Lefschetz, 
Algebraic Geometry and Topology (1957) edited by R. H. Fox, D. C. Spencer, and A. W. Tucker. 
His numerous publications in these fields include the books, L’ Analyse Situs et la Géométrie Algé- 
brique (1924), Géométrie sur les Surfaces et les Variétés Algébriques (1929), Topology (1930), Algebraic 
Topology (1942), Topics in Topology (1942), Introduction to Topology (1949), and Algebraic Geometry 
(1953). In recent years he has produced fundamental research in ordinary differential equations, 
including the volumes Differential Equations, Geometric Theory (1957), and (with J. La Salle) 
Stability by Liapunov's Direct Method with Applications (1961). 

Prof. Lefschetz began his mathematical career in 1911 with his PhD under W. E. Story at 
Clark University. He held positions at the Univ. of Nebraska, Univ. of Kansas, then Princeton 
University until his retirement. At Princeton he was Research Professor, 1932-1953, and Depart- 
ment Chairman, 1945-1953. Since, he has been at the National University of Mexico, RIAS, and 
Brown University. His numerous awards include the Bordin Prize (Académie des Sciences 1919), 
the Bécher Prize (AMS 1924), the Feltrinelli Prize (Academia dei Lincei 1956), and foreign mem- 
berships in the Royal Society and the Académie des Sciences. He was Editor, Annals of Mathe- 
matics, President AMS (1935-1937), and is a member of the National Academy of Sciences and 
the American Philosophical Society. Editor. 
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My first position was an assistantship at the University of Nebraska 
(Lincoln), soon transformed into a regular instructorship. This meant my first 
contact with a regular midwestern American institution and I enjoyed it to the 
full. I owed it mainly to the very pleasant and attractive head of the depart- 
ment, Dean Davis of the College. The teaching load, while heavy, did not over- 
whelm me since it was confined to freshman and sophomore work. 

Not too many weeks after my arrival, the Dean got me to speak before a 
group of teachers in Omaha on “Solutions of algebraic equations of higher 
degree.” And then and there I learned an all-important lesson. For I spoke three 
quarters of an hour—three times my allotted time! When I found this out some 
weeks later from the Dean, my horror knew no bound. I decided “never again,” 
to which I have most strictly adhered ever since. 

A second lesson was of another nature. I utilized my considerable spare 
time in reading Hilbert’s recent papers on integral equations. At Clark I had 
also read Fredholm’s Acta paper on the same topic and my enthusiasm for inte- 
gral equations was very great. I offered to lecture on Hilbert’s work in my 
fourth term and this was accepted. Consequence: a very heavy teaching load 
for two students who I fear were quite bewildered. One of them, Oliver Gish, a 
graduate student in physics (later a distinguished geophysicist) remained my 
lifelong friend. I also formed a close friendship with his mentor and a capable 
mathematician, Professor L. B. Tuckerman (later of the Bureau of Standards). 

The course taught me a valuable lesson: the experience generally absorbs too 
much energy. I have since expressed this opinion to many a recent doctor, but I 
fear that few heeded it. 

My two years.in Nebraska made me realize a widespread feature of American 
institutions of higher learning which were State institutions. By general state 
rule they had to accept any graduate from an accredited high school. Conse- 
quence: in the freshman year a flood of very poorly prepared students and 
a large number of sections, especially in the first term. By the end of the first 
year the entrance flood was reduced to half; the sophomore sections—in mathe- 
matics at least—were in much smaller number, more readily handled and better 
taught. This went on down to the last year, with the flood in mathematics 
reduced to 10-15 or so (mostly girls) and the total number of graduates much 
smaller than at entrance. 

Lincoln, the capital of the State (population about 50,000) was a very 
pleasant city, with a distinct urban flavor. It was not too far from Omaha, the 
major city of the State. Most family houses were surrounded by a small garden 
and the whole made a very good impression. The University was at one end of 
town; the Agricultural College, part of the University (pet of the very rural 
Board of Regents), at the other end. There were a couple of small colleges 
situated in Lincoln. 


At the end of two years (1913) a larger offer, plus my approaching marriage 
to my Clark fellow student, made me accept an instructorship at the University 
of Kansas in Lawrence. The teaching conditions there were the same as in 
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Lincoln, but with a slightly smaller load. At the University of Kansas the 
department was divided into two groups: college plus graduate work and engi- 
neering. I was assigned to the latter. While the students were somewhat more 
purposeful, the preparation was equally weak in both parts. 

Lawrence (population 12,000) had a rather severe New England tradition. 
Except for the University with about 3000 students, it was really a most pleasant 
rural community. The University was on top of quite a hill, with well-con- 
structed and mostly recent buildings. The view from the top was exceptionally 
attractive. 

The major city near by was Kansas City. Lawrence was about 25 miles from 
Topeka (the capital), while Kansas City was 50 miles away. This was all before 
the automobile age and my friends and I indulged in many country walks. 

The general entrance preparation in Lawrence and Lincoln was so feeble 
that early teaching could only be technical and deprived of theory. As the 
freshman flood eroded, this situation improved somewhat. 

The rule in Lawrence for beginning faculty members was three years in 
each position and it was rather rigidly enforced. The situation did not seem 
perfect—far from it. However, I discovered in myself, first a total lack of desire 
to “reform” coupled with a large adaptive capacity. At Lawrence I only co- 
operated with a colleague in driving out several unattractive texts, notably 
Granville’s Calculus, for which my taste was <e. 

Years later I inquired of Professor Lusin (Moscow) why the Soviet mathe- 
maticians translated Granville. Reply: “We only took his excellent collection of 
problems, but provided our own theory.” This may explain our efforts to move 
this book out of Kansas. 


At this place I was prepared to indulge in extensive criticism, at least of the 
midwestern system. The fact is, however, that in both Nebraska and Kansas I 
found good and well-kept mathematical libraries, ample at least for my own 
purposes. Moreover, I came to realize the enormous advantage over the Euro- 
pean system: it provided uncountably many opportunities for younger research 
men with ideas to grow and develop their powers, as instructors for example, 
with ample leisure. For the teaching loads, while considerable, were not really 
intolerable. Moreover, they generally went with colleagues who had other 
interests, mathematical or administrative, but not intent upon imposing on one 
uncongenial mathematical interests. At all events, in my case, it turned out to 
be of great value. Needless to say, special research favors were rare indeed. 

In spite of the general level, I had in Lawrence three or four excellent stu- 
dents. One of them, Warren Mason, went to work for Bell Laboratories in 
New York (later near Elizabeth, N. J.), took his Ph.D. in physics at Columbia, 
and at Bell became a top specialist in the theory of sound and its applications. 
I am very proud of him. Still another strong student, Clarence Lynn, joined 
forces with Westinghouse in Pittsburgh (electrical department) and was most 
successful there. 

J have found that in freshman courses in mathematics, and less so in the 
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next year, hardly one third of the students care for and are not totally bored by 
mathematics. Hence at that early level a teacher must be exceptionally lively 
and have a sympathetic understanding of the students. Needless to say this 
must be coupled with a complete grasp of the topic taught. 

Here are a few very radical suggestions for later years. From the junior year 
on through graduate work they should be merged into a professional school, 
with teaching, at least in mathematics, of seminar type plus abundant but easy 
contact with faculty on an individual basis. In other words “baby talk” should 
end with sophomore years. 


The guidelines in my research were: Picard-Simart: Fonctions algébriques 
de deux variables (two volumes, mostly Picard); Poincaré’s papers on topology 
(=analysis situs) and on algebraic surfaces; Severi’s two papers on the theory 
of the base; Scorza’s major paper (dated 1915) in Circole dt Palermo on Riemann 
matrices. 

Around 1915 and for a long time, a certain result of Picard baffled me. Let 
H be a hyperelliptic surface. Direct calculation yielded: the Betti number 
R2(H) =6. Picard, however, appeared to give its value as 5. The discovery of 
the missing link played a major role for me. Namely, Picard only wanted Rp» 
for the finite part of H, neglecting the curve C at infinity. Hence C was a 2-cycle, 
and so was any algebraic curve! This launched me into Poincaré-type topology, 
the 1919 Bordin Prize of the Paris Academy and in 1924 Princeton! (The trans- 
lated prize paper appeared in the Trans. Amer. Math. Soc., vol. 22, 1921.) 

The immediate effect of the Prize was the Kansas promotion (January 1920) 
to Associate Professor plus a schedule reduction. Also (1923) there came a 
promotion to a Full Professorship. I spent the year 1920-21 in Europe, half in 
Paris, half in Rome. I gathered little mathematical profit in Europe; some from 
the summer of 1921 which I spent in Chicago. 

About Paris I particularly remember an interview with Emile Borel lasting 
five minutes in which I offered to write for his Series my future monograph 
L’ Analysis Situs et la Géométrie Algébrique. He accepted at once! (In such 
matters our “speedy” country knew no such speed.) Proof sheets, etc., were 
dealt with rapidly and not a syllable was changed. 


I come now to my Princeton period. In 1923 an invitation came from Dean 
Fine, the Chairman of the Department of Mathematics and Dean of the 
Faculty at Princeton, to spend the following year there as Visiting Professor of 
Mathematics. Dean Fine was the long-time head of the department and the 
true founder of what became an outstanding department of the University. 
With reason, upon the construction of the mathematical building it was called 
“Fine Hall.” (Dean Fine was killed in an automobile accident just before 
Christmas 1928 and his lifelong friend, Mr. Thomas D. Jones, immediately 
granted $600,000 as a memorial to Dean Fine for a new mathematical building.) 

Well, upon receiving Dean Fine’s invitation, I accepted. For the following 
year I received a permanent offer to stay at Princeton as Associate Professor. 
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This was changed 18 months later (January 1927) to a Full Professorship and 
January 1932 to a Research Professorship (Fine professorship) as successor to 
Oswald Veblen. In this position I had no assigned duties whatever. 

At Princeton IJ found myself in a world-renowned University and in one of 
its outstanding Departments. Among the great mathematical Professors there 
were: Eisenhart, Veblen, Wedderburn, Alexander, Hille. I was in closest contact 
with Alexander—a top authority in topology. 

My joining the Princeton faculty coincided with a definite change of direc- 
tion in my research from the applications of nascent topology to algebraic 
geometry (vide my prize paper) to a pure topological problem: coincidences and 
fixed points of transformations. For this problem I invented a completely new 
method of attack, which by 1925 culminated in a well-known fixed point ex- 
pression ¢(f), f a mapping of a manifold into itself, that said: 60 implies that 
f has fixed points; if f has none, ¢=0. The preparation and extensions required 
occupied me for several years. One of my early graduate students, A. W. Tucker, 
an outstanding Princeton mathematician, found the way to a far simpler method 
than my early one, which I have accepted in foto. 


Much of my Princeton teaching, until 1930, was still freshman-sophomore. 
However the students, selected with care at entrance, were much better pre- 
pared than in the midwest. The contrast of the systems was very great. 


Princeton system: A strictly private school, with limited funds and space, 
could not accept all comers. Hence it had, unavoidably, to fix the number of 
admissions, utilize a strict selection, and keep the admitted men practically 
through the four collegiate years. The same system, in some form, was also 
applied to admission to the Graduate School. 


Midwestern system: As 1 already stated, they had to admit all duly certified 
high school students. The freshman entrance flood resulted in teaching mostly 
by graduate students, many of uncertain quality. 

The Princeton system had two important consequences. First, it enabled 
one to organize preferred sections even before entrance. Second, courses could 
be initiated at a more advanced stage and proceed more speedily. Thus algebra 
and trigonometry were done each in two weeks, analytical geometry in five 
weeks, calculus started in the second freshman semester (in Kansas-Nebraska 
in the sophomore year). 

Some years later, good students from strong preparatory schools or high 
grade secondary schools (where they already had these subjects) were allowed 
to skip, even the whole first year. Moreover, such A-1 men (not many) were 
soon treated like graduate students, allowed to participate in advanced seminars 
and thus to become well acquainted with the members of the mathematical 
faculty. 

The Princeton aim was decidedly different from the Nebraska-Kansas aim. 
The latter had to provide for a considerable number of teachers in their states, 
to form moderate level technicians of all kinds, sending a very few of the best 
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for better training to major eastern institutions. Princeton on the contrary was 
planned to form the top echelons, notably in the sciences. This meant aiming 
first for the doctorate. In mathematics it soon became customary to retain the 
best men for at least one year after the Ph.D. on some fellowship, or in some 
teaching position with very light duties. A number of the men so developed 
occupy today major posts in outstanding institutions. 


In 1932 a major change took place through the establishment at Princeton 
of the Institute for Advanced Study, with mathematics as its first and strongest 
group. This resulted in the migration of three of our major members: Veblen, 
Alexander and von Neumann. 

The basic effect on me was regaining the mathematical calm of Nebraska- 
Kansas, which I had so enjoyed without realizing it. Our mathematics chair- 
man, Dean L. P. Eisenhart, with the unstated motto “live and let live” had 
much to do with this return of calm. During this period my mathematical work 
progressed. My first Topology treatise (1930) appeared and was many times 
approved by friendly colleagues. A second Algebraic Topology appeared in 
1942, rather less satisfactory, because too algebraic. Other books came. I was 
editor of the Annals of Mathematics, which grew to occupy an A-1 place in 
mathematics, but did not overwhelm me with work. Then came World War II 
and I turned my attention to Differential Equations. With Office of Naval 
Research backing (1946-1955) I conducted a seminar on the subject from which 
there emanated a number of really capable fellows, also a book: Differential 
Equations, Geometric Theory (1957). . 

When Dean Eisenhart retired (1945) I succeeded him as Chairman, until 
my own retirement in 1953. 


In 1944 I joined as a part-time connection the Instituto de Mathematicas 
at the National University of Mexico. This continued until 1966. At the 
Instituto I was as free as under my Princeton professorship. I conducted seminars 
in topology and differential equations, gave a couple of times a “volunteer” 
course on “general mathematical concepts” directed at beginners and, thanks to 
a good working library, was able to continue research. Conditions were of course 
quite different from ours, but as I became rapidly fluent in Spanish, it gave me 
many advantages. Through the years I found quite a number of capable young 
men, several of whom I directed to Princeton for further advanced training 
up to the doctorate and later. Among them I may mention Dr. José Adem, 
Chairman of the Department of Mathematics of the newly founded Centro de 
Estudios Avanzados in Mexico City. 

My long connection with Mexico has been the occasion of many side trips 
(especially in connection with meetings of the Mexican Mathematical Society), 
so that I have a fair acquaintance with that wonderful country. 

In 1964 the rarely awarded order of the Aztec Eagle was conferred upon me 
by the government of Mexico. 


My work as Russian reviewer for differential equations had made me aware 
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of our lag relative to the Soviets in this all important field in all sorts of applica- 
tions. The arrival of Sputnik in 1957 convinced me that this lag had to be 
remedied. As I attributed it to our scattered efforts, I came to the conviction 
that the only remedy was to establish a Center for study and research in dif- 
ferential equations. 

From Dr. Robert Bass, formerly a member of my project, I learned of the 
formation in Baltimore, as a division of the Martin Aircraft Company, of a new 
Research Institute for Advanced Study (RIAS) under the direction of Welcome 
Bender, a graduate of MIT and long time Martin engineer. When I approached 
him with my (modest) plans he was enthusiastic. In a few days I was entrusted 
with the formation of a group of say five top men and about ten younger asso- 
ciates, with myself as director. Suffice it to say that I had considerable success. 
I first was able to obtain the cooperation of Prof. Lamberto Cesari of Purdue, 
one of the major specialists anywhere; also of Notre Dame, Prof. J. P. Lasalle 
as my second in command (my best appointment) and complete the group 
with Dr. J. K. Hale of Purdue (Cesari’s best student there) and Dr. Rudolph 
Kalman of Columbia (an electrical engineer coupled with good mathematics). 
My strong basic group was thus complete. 

I demanded (and obtained) from Mr. Bender that my group operate under 
standard university conditions. 

Very shortly we became known. A considerable number of the good dif- 
ferential-equationists visited us, and some few were invited for a year or so. 

After some six years it was necessary to transfer our Center elsewhere. This 
operation, carried by Lasalle, resulted in our becoming part of the Division of 
Applied Mathematics at Brown University as “Center for dynamical systems” 
with Lasalle as Director and myself as (once weekly) Visiting Professor. At 
Brown our general relationship has been excellent. A year or so ago the Director 
of the Division died and was succeeded by Lasalle whose general performance 
could not be excelled. 


In conclusion I must recognize a budget of debts which I may never succeed 
in liquidating to the full. 

The first is my enormous debt to my wife Alice, my Clark companion. 
Without her constant and unfailing encouragement through 59 years, 56 as 
my wife, I would have long since ceased to operate. ... 

Second major debt: to the United States, which through their (however 
imperfectly organized) universities made it possible for me to follow my deep 
bent for mathematics. I should also include here the contribution of the National 
University of Mexico from 1944 to 1965—years after my Princeton retirement, 
and also of RIAS and Brown. 

In this long and agreeable route of 57 years I encountered so many simpdticos 
amigos that to name them all would be impossible. May they one and all accept 
my fervent gracias for my debt to them. I hope that they have felt that it was 
not incurred in vain. 


STUDENT ORIENTED TEACHING— 
THE MOORE METHOD 


LUCILLE S. WHYBURN, University of Virginia 


“T just never took Point Set Topology until I graduated and took aSummer 
course from Moore, quite by accident, and liked him. I liked the excitement in 
his class, the fact that we were working. It was different from other classes. I 
started to understand fine details about things, which attracted me.” This 
student view of R. L. Moore’s classes contrasts sharply with Robert S. Morison’s 
statement before the AAAS-BAAS Conference in Boulder, Colorado, during 
April 1969: ‘‘My over-all impression, in returning to a university after a lapse 
of twenty years, is one of disappointment that so few students seem to have 
much fun either in their science courses specifically or in university life in 
general.” The loyalty of his current students affords Professor Moore the oppor- 
tunity to continue teaching in 1969 at the University of Texas though he is 
well past the usual age of mandatory retirement. He carries a fifteen hour teach- 
ing load of five courses. His offering consists of a course in Calculus and four 
courses chosen from the following listing: 

Introduction to the Foundations of Analysis; 

Introduction to the Foundations of Geometry; 

Theory of sets; 

Foundations of Mathematics; 

Point Sets and Continuous Transformations; 

Research in Point Set Theory. 

As one of the prerequisites the catalog lists “consent of the instructor,” a 
prerequisite that is not to be taken lightly. “Consent of the instructor” implies 
that the student has attained a certain stage of mathematical maturity, that he 
is interested in “doing” mathematics not in ferreting out what former mathe- 
maticians have done through the reading of theorems and proofs or the applica- 
tion of knowledge thus gained to problems. Passive listening followed by exhibi- 
tion on quizzes and examinations that he has understood what was said is not 
sufficient in this class, each member must want a piece of the action. The dis- 
covery of proofs and definition of concepts unknown to the student under 
strongly competitive stimuli challenge the student to exercise his own honor 
system about not reading related material or cribbing ideas from any source. 

In the popular mind the MoorE-METHOD is the axiomatic method. Certainly 
this is a part of the method and it is a portion that is easily accepted by and has 
wide appeal for mathematicians. The proliferation of books in general topology, 
modern algebra, and set theory address themselves to the axiomatic method. 
Many of them are excellent. Such books have presented new insights into 
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mathematical relationships, caused re-examination of curriculums, and intro- 
duced a new spirit of adventure into the teaching of mathematics. The essence 
of the Moore-Method, however, is to use an axiomatic treatment to create in 
the student a spirit of self-confidence and pleasure in personal creative endeavor. 
Much depends upon the handling of the material. The present widespread 
effort to use axiomatic systems at many levels of mathematical instruction has 
met with spotty success. From Professor Moore’s point of view, his first fruitful 
contact with a student is in the calculus. He is a thorough teacher of manipula- 
tive skills commonly emphasized in such courses but in addition he is likely to 
introduce some use of syllogisms and a hard look at Duhamel’s Theorem in 
order to afford the better students an opportunity to reason carefully. This 
challenge is not aimed at, nor does it attract, the indifferent or pedantic student. 

No textbook, no description, can recreate the excitement of a seasoned 
scholar searching for just the right talent for the fashioning of a pure research 
topologist. R. L. Moore is such a scholar in action. “Nobody can possibly teach 
by the Moore-Method other than R. L. Moore himself,” a former student once 
said. “The thing that makes him so successful and causes him to get so much 
work out of his students is that he knows just what to say and when to say it 
and how to say it so that it stimulates the very best effort in each individual 
student. When a good student shows up in one of Moore’s classes, Moore is 
immediately challenged to see whether he can make that student perform. He 
watches the student, watches his reactions, and, upon seeing the first sign of 
understanding, he knows just how to coax that out and turn it into a per- 
formance. Then, you see, he has the student spending time on Moore’s gourse.” 
All are free to participate, and Dr. Moore’s encouragement, support, and manip- 
ulation of students, is of the highest order. The following description by 
R. H. Bing may convey the depth and flavor of the involvement: “Moore used 
a great many techniques for teaching. I think he had a certain charm about his 
personality that attracted students to him. Probably none of Moore’s students 
inspire the loyalty among students that Dr. Moore has engendered among his 
students. I don’t believe that we have the same charm. Our students might like 
us. We may do a good job of training in mathematics but we don’t have the same 
discipleship. He not only taught in the classroom but he taught in the halls. 
He taught through the influence of his wife. ... Also I remember walking to 
school on a number of occasions with other professors and they would say, 
‘Moore has been telling me about the theorem that you have proved in class and 
he says that he is very impressed with you.’ This is one of the techniques he had 
of encouraging his students by talking about them to other people and letting 
the word come back to them.” 

Immediately above the calculus, Introduction to the Foundations of Analysis 
carries the prerequisite: Six hours of Calculus and consent of the instructor. A 
student seeking a large body of information or expecting to be a passive member 
of a student group had best avoid the course. Lectures are not presented. Funda- 
mental assumptions and definitions are introduced as needed. Theorems to be 
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proved are stated. No text is used. Professor Moore has never written a text 
based on this course nor is he avid to have others do so. Leading the student 
through unfamiliar material, presented without the aid of printed matter is the 
procedure. The content of the course is not fixed; the procedure is. Coverage of 
material and its storage as a memory bank is not the objective. At first, only 
one-step reasoning is required to progress from assumptions or previously 
established results to the conclusion in the desired theorem. Usually several 
theorems are stated at once, numbered in the order of increasing difficulty, 
showing interdependence, and perhaps necessitating the discovery of and the 
definition of a property. Progress is made by the class. Someone may see that 
he can prove Theorem 3 provided Theorem 1 is true. His proof and its de- 
pendency on Theorem 1 will be examined. Inevitably false reasoning is presented 
upon occasion; this is turned to constructive use and the class is expected to 
develop a critical sense that will expose any mistakes in proofs presented. Thus 
give and take goes on among the students. 

As the course advances through the year more difficult theorems are stated. 
Multiple step reasoning becomes necessary; the usefulness of lemmas becomes 
apparent. More time, more thought, more searching of previously presented 
material is necessary. Certain theorems stand out in Professor Moore’s mind 
and he may say “I’m pleased you got Theorem 15.” He might point out the 
discovery of a concept, saying, “You have singled out the idea of upper-semi- 
continuity.” Also, from time to time false conjectures are introduced. Thus the 
student’s critical ability is further developed. Search for counterexamples, when 
proofs break down repeatedly, becomes routine procedure. In the more ad- 
vanced courses he will toss in unsolved problems, not making clear to the student 
that they are real research questions. Frequently when he raises such problems 
he does not know if the conjecture is correct or incorrect. One former student 
relates: “The first piece of research that I did under R. L. Moore, he tossed out 
one of these things and I got it. He made me go over the proof a second time in 
class. Finally, he said ‘You come in this afternoon and let’s go over that proof 
again.’ We spent about an hour checking it over carefully and at the end of that 
time he said ‘You write that up.’ ”’ 

Moore’s use of the axiomatic method differs significantly in intent and in 
consequence from the often encountered “development of the real number 
system.” The introduction of a system of axioms that underlie familiar notions 
and operations is fascinating to mathematicians of some maturity and may 
deepen their insight. Such a set of axioms can often be understood by less able 
and even public school students. The wide use of such material is amply justified 
and if a student is subjected repeatedly to an axiomatic approach to various 
parts of mathematics through textbook presentation he comes eventually to 
accept, appreciate, and understand the fundamental role of axioms. The beauties 
of mathematical structure become more apparent to him, and advantages of an 
orderly approach to even nonmathematical material become better understood, 
leading some people to speak of Systems Analysis in a broad sense. But the 
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Moore-Method develops rugged individualism, as one man said. Indeed there 
are business men who attribute their success to the training they received in one 
or two courses with Professor Moore. Others think the desire to participate, the 
need to be unique, self-confident, self-directed, and to communicate, as well 
as the need for spontaneity are met. One Ph.D. recipient said, “I felt a great 
part of it was that you started from scratch and you built up an entity, in 
which you knew every piece.” Under the Moore-Method the student is pre- 
sented with a set of axioms based on a certain number of undefined terms from 
which ke must create an understanding of the space or system set forth by the 
axioms. No models or edifying examples are presented. Imagination and con- 
jecture must seek and give meaning to the logical inference of the axioms. This 
is in accord with Moore's statement: “That student is taught the best who is 
told the least.” 

Freedom for independent action is accorded each student, so long as he does 
not present another’s work as his own in class. Long before unlimited cuts be- 
came a student fetish, members of Professor Moore’s advanced classes were 
advised that if an individual did not wish to hear the proof of a particular 
theorem he was at liberty to leave the room. The presumption was that no stu- 
dent would avoid hearing the presentation of a proof except for the express pur- 
pose of preserving for himself the opportunity of creating his own proof. A 
spontaneous, informal honor system springs up among the students. To para- 
phrase a former student: “The work is a very personal thing. It is put on a level 
that you do not find subjects taught. Subjects are usually taught like mountains 
afar off, but when you study with Dr. Moore it is as a participant in a task at 
hand. When you.are in his class you are talking about mathematics with him 
and with the students around. He has a personal enthusiasm about the subject 
that is very catching, everyone seems to have it. He obviously enjoys mathe- 
matics, does mathematics, and teaches mathematics. The students see that 
immediately. It is an enthusiasm that, even though a student may have been 
away from the University of Texas for some years, can still be recalled vividly. 
He gives one the feeling that mathematics is more than just a way of making a 
living; it is a way of life, an orderly fashion in which you will want to consider all 
things. It is a method of self-examination, of self-discovery. In his classes you 
are personally responsible for finding a proof; it is a question of whether or not 
you have the ability; consequently, discussing the problem with someone else 
would not be the way it should be done. It is a challenge he can make; this is 
something about his personality.” 

Imposing your superior knowledge upon others becomes tabu. A former 
student relates the following story. He and another long term student wished 
to join a summer course in the Foundations of Geometry which was being con- 
ducted for high school teachers, using a rigorous set of axioms, to enrich their 
understanding. They were admitted on the condition that they sit on the back 
row and not say anything until they were called on. “This is a classical example 
of the Moore-Method. He gave them a few axioms and some theorems to prove. 
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They got to one theorem and one of the high school teachers got up to give a 
proof and it had a flaw in it. I sat there and pressed my lips together and Moore 
looked as if he believed it completely. Then he said, ‘Do we all understand that?’ 
Everyone said yes. Then he said, ‘Well, just to play safe, let’s go through it 
again.’ This man went through it a second time. Again Moore asked if they 
understood it. ‘Yes,’ they said. About that time I couldn’t stand it any longer 
and I called out, ‘Dr. Moore,’ and he jumped right down my throat. Well it 
turned out that it could not be proved on the basis of the axioms already 
stated. He did not want me to spoil his game of making them discover this for 
themselves. You see the beautiful way is to get the class to understand the need 
for a new axiom before you give it to them. Occasionally he would call on us for 
a theorem that they couldn’t prove but otherwise we were to keep our mouths 
shut.” The Moore-Method, then, seems to attain the objectives so often set 
forth of (1) getting the student involved, (2) leading the student into new per- 
sonal discoveries, and (3) letting each student proceed at his own pace. 

“I think that Moore is a very fine teacher but I don’t want to give credit for 
his being a very fine teacher to this method. I think the credit belongs to the 
man,” says Eldon Dyer. Again and again his former students emphasize the 
fact that not every class is filled with the proof of theorems. Logic, its use, and 
its relation to mathematics is explored. Great emphasis is placed on the precise 
use of language. Clippings from newspapers and magazines are sometimes 
brought into the class. The class is asked to analyze what these clippings say 
and what they are meant to say. It may be this precise way of looking at non- 
mathematical ideas wields great influence on a student’s philosophy, on his way 
of attacking things that concern him. It has been pointed out that Dr. Moore 
came in contact with eighty to one hundred students a year over more than a 
forty year period. Thus he came in contact with many nonprofessional mathe- 
maticians and there is considerable evidence he influenced a large number of 
them significantly. Others have pointed out that he is a southern gentleman. 
Students are invited into his home where he shares with them his love of classical 
music and subjects them to mild pranks such as measuring their height with 
an ordinary looking yardstick that is an inch short of three feet. At the univer- 
sity he generously spends time informally with members of his classes but he 
never permits undue familiarity or disrespect. He finds indirect and often 
humorous ways of showing his disapproval of certain kinds of conduct. Repeat- 
edly, former students refer to his patience and assert that they are unable to 
equal him in this respect. Moreover, it has been noted that there is a tendency 
now to bring women into mathematics in greater numbers but R. L. Moore 
accepted Ph.D. candidates when talent seemed evident. Anna Mullikin, who 
received her Ph.D. in 1922, was the first woman to write her thesis with him. 
Among others are Lida K. Barrett, Mary Elizabeth Hamstrom, and Mary Ellen 
Rudin. His unusual physical vigor tends to appeal to the men in his classes. 
Eyewitnesses say that when he was about sixty-seven or sixty-eight years old 
there was a student who seemed to be taking up too much time bragging about 
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his physical feats. Professor Moore offered to show this young man what he 
could do; he did some impressive one arm push-ups. He did them with his feet 
up in a chair so that his head was angled down toward the floor; he did seven of 
them and jumped back up onto his feet without losing his wind. The student 
got the message. Professor Moore likes to box. The story persists that the first 
person he looked up when he went to Princeton as the first native American 
mathematician to tour on the American Mathematical Society Visiting Lecture- 
ship was the boxing coach, who was his sparring partner during the early days 
of his teaching career as an instructor at Princeton. Indeed it has been said that 
one of the secrets of his success as a teacher is that he is not afraid of a fight, 
either a physical one or a scholarly one. More to the point perhaps, he seems to 
inspire others with a feeling of their own mental strength, something quite 
different from being led to think that the professor is wonderful. 

Perhaps the fact that Moore threw his students into competition with each 
other in his graduate courses resulted from his being a scrapper. “It was an 
enjoyable kind of competition; there was no bitterness about it; there was this 
feeling that you were being developed, that you were being prepared to be a 
research mathematician. There was a feeling that you were already a participant 
in the making of mathematics.” W. L. Ayres puts it thus: “I have seen student 
after student go through for their degrees and usually the very first original 
work they do is their thesis. How much they do of that is somewhat doubtful at 
times. But under Moore you have had a long hard-knocks trail of research before 
you get to the doctor’s thesis. His students are thus prepared to stand on their 
own feet, which is an advantage. They have done research for three years.” 
Mary Ellen Rudin says: “I met Dr. Moore at age sixteen ... He signed me up 
to take trigonometry from him because that was the course I already knew that 
I wanted to take ... I started absolutely a blank sheet, and I ended up a pro- 
ducing mathematician. I took a course every year with Dr. Moore from the time 
that J entered as a freshman until I graduated with a Ph.D. I had tremendously 
good competition as a graduate student ... The student isn’t reading a book 
every day. He isn’t doing the sort of homework that one does in the usual course. 
His work is to produce mathematics every day and by producing mathematics 
every day he will form a habit. When he gets a Ph.D. he will continue to ‘pro- 
duce mathematics everyday.’ ” Yet Burton Jones has said: “My reaction was 
that it was the problems that Dr. Moore had given me to work. There is such 
challenge in trying to solve these problems or putting together the parts you 
know to see if you can not learn a little something more about the situation. 
I think this is the real power of the method. If you are mechanically inclined, 
you might compare it with having come upon a clock, say, that someone else 
has taken apart and you wonder if you can put it back together. There are 
some people that can’t resist this.” 

When Dr. Moore’s students go to other universities as faculty members and 
seek to talk about mathematics with their colleagues they do not find the same 
paternal encouragement existing among the older members of the faculty. 
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This is a shock. Suddenly they see the need for a broad spectrum of known 
mathematical results so that communication among fellow faculty members will 
be fun again. Most of them spend the next few years in self-education, broaden- 
ing their view of the subject. Prime examples of this are R. L. Wilder, G. T. 
Whyburn, and R. H. Bing. One might say they climb those “mountains afar off” 
and having done so introduce new vistas. They have introduced new cross con- 
nections between differing branches of mathematics, and sought new ways of 
attacking old problems. They have stimulated their colleagues through their 
eagerness to learn, to create, and to teach. They have contributed to American 
mathematics not only worthwhile research treatises, Colloquium Lectures of 
the American Mathematical Society, expository papers, books coordinating and 
making available current mathematical research, but they have also served as 
Presidents of the American Mathematical Society and the Mathematical Asso- 
ciation of America, as editors of journals, as members of boards and committees 
for the federal government, and as participants in the moves to improve mathe- 
matics teaching at all levels. They have tried to search out and use the most 
fruitful portions of the Moore-Method and even to introduce this method to all 
mathematicians since the method seems to attract students into mathematics 
and increase the ability of nonmathematicians to apply mathematical methods 
to their problems. 

It has often been said that Moore dominates the Department of Mathe- 
matics at the University of Texas, and that graduate students spend a dispro- 
portionate part of their time on Moore’s courses. One former student has said, 
“The year I took Foundations of Analysis Situs, I took a course in Potential 
Theory in which I fear I showed very little potential.” However, the second and 
third generations of Moore-Method practitioners have not found it necessary 
for their students to neglect their other courses and have functioned well in 
large departments where the Ph.D. in other fields is awarded in considerable 
numbers. Each of these professors modifies the method, adapting it to his im- 
mediate surroundings. At one institution where the beginning graduate course 
is based on a set of axioms either for a semimetric space, a Moore space, or an 
abstract space, the students are asked not to read for the Fall term. It is made 
clear that the objective is to develop the ability to find proofs for one’s self. 
By the end of the Fall term, however, they are encouraged to buy and begin to 
read J. L. Kelly’s book on General Topology and they are warned that half of the 
examination in the Spring will be on this outside reading. Meanwhile, the class 
resumes personal development of proofs through the study of continua, trans- 
formations, or some other area not specifically treated in Kelly. At one univer- 
sity, the beginning graduate course in topology begins with a set of axioms, pro- 
ceeds in the Moore manner accommodating the pace to the particular student 
group until the end of the first semester, then uses a mixed array of theorem 
proving sessions interspersed with lectures. At another university any one of 
several professors may conduct the first course in topology, each using some 
variation of the Moore-Method. During the student’s second year there are 
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several courses available in different areas of topology from which he may 
choose one or two classes. The average time for a Ph.D. in this department being 
five years, he is able to acquaint himself with a relatively wide spectrum of 
topology as well as develop his creative ability. 

“The Moore-Method is a geometric method with an adequate set of axioms 
and the theorems which he asks his students to prove are just like the ‘originals’ 
proposed in some geometry texts. The difference is merely a matter of sophisti- 
cation or style.” “The Moore-Method is a modification of the German seminar 
where the basic objective was affording the professor and student the oppor- 
tunity to do mathematics together.” “The Moore-Method is the Socratic 
method.” “The Moore-Method is the man himself.” Many mathematicians 
understand the Moore-Method but putting it into a concisely worded definition 
seems to vary. Perhaps we had best take it as our undefined notion and simply 
try to describe some of its characteristics. R. L. Moore was the first mathema- 
tician to write his thesis for his Ph.D. with Oswald Veblen, according to the 
listing in the American Mathematical Society Semicentennial Publications, vol. 1 
(History), A Ph.D. recipient from Princeton University has said of Professor 
Veblen, “Without having too good a grasp of a subject, he could chat about it 
and stimulate others to produce in it.” Getting students to produce mathematics 
is certainly one of Professor Moore’s main objectives. To quote from the AMS 
Semicentennial History, “R. L. Moore’s first publication was a neat redundancy 
proof which he discovered while a graduate student in 1902, and which Halsted 
published.” George Bruce Halsted was Professor of Mathematics at the Uni- 
versity of Texas at the time. Giving a student some chance to consider unre- 
solved mathematical questions and produce new mathematics early in his grad- 
uate experience is habitual with Professor Moore. The above paper put him in 
correspondence with E. H. Moore and he later studied at the University of 
Chicago, where E. H. Moore was the first Chairman of the Mathematics De- 
partment after the founding of the University of Chicago, and where he had 
invited Oskar Bolza and Heinrich Maschke to become members of his depart- 
ment. “Moore was a fiery enthusiast, brilliant, and keenly interested in the 
popular mathematical research movements of his day; Bolza, a product of the 
meticulous German school of analysis led by Weierstrass, was an able and 
widely read research scholar; Maschke was more deliberate than the other two, 
sagacious, brilliant in research, and a most delightful lecturer ... These three 
men supplemented one another remarkably ... During the period of 1892- 
1908 the University of Chicago was unsurpassed in America as an institution for 
the study of mathematics.” Other remarks about E. H. Moore from the AMS 
Semicentennial History reveal: “In the lecture room Professor Moore’s methods 
defied most established rules of pedagogy. Such rules, indeed, meant next to 
nothing to him in the conduct of his advanced courses. He was absorbed in the 
mathematics under discussion to the exclusion of everything else...” Defin- 
itely, this description points to perceptual influence on R. L. Moore by E. H. 
Moore. Further evidence of this might lie in the fact that Professor E. W. Chit- 
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tenden was invited to the University of Texas to teach a graduate course in the 
summer and that he used a similar method. 

Under the sponsorship of the Mathematical Association of America some 
effort has been made to demonstrate the Moore-Method in Summer Institutes 
for Teachers and to provide visiting lecturers who sometimes talk about the 
Moore-Method and attempt to widen the understanding of topology. Professor 
Burton Jones of the University of California at Riverside has experimented with 
the method in Australia and lectured on it in various Indian universities. Pro- 
fessor R. H. Bing has generalized the method and is using it to arouse a group 
attack on the Poincaré conjecture. Professor E. E. Moise at Harvard University 
has established a program for teachers which reflects the fact that he worked 
with Professor Moore for his Ph.D. Also, considerable work is being done in the 
School of Education at the University of Wisconsin that is related to the Moore- 
Method. In the opinion of the author, at least, R. L. Moore will not begrudge 
anyone the use of his method even in modified form so long as its use effectively 
develops in a student creative mathematical ability, the correct use of logical 
inference, or careful precise expression of ideas. 


The author wishes to express her gratitude for the ideas, suggestions, and encouragement 
accorded her through letters and personal interviews by the following mathematicians: W. L. Ayres, 
B. J. Ball, Gayle Ball, R. H. Bing, W. L. Duren, Eldon Dyer, H. M. Gehman, George Henderson, 
F. Burton Jones, J. H. Roberts, Mary Ellen Rudin, J. M. Slye, J. M. Thomas, F. P. Welch, W. M. 
Whyburn, J. M. Worrell, and Leo Zippin. She is also indebted to her husband, Gordon T. Why- 
burn, for his kindly criticism and helpfulness. 


AN ISOPERIMETRIC PROPERTY OF SURFACES 
WITH MOVEABLE BOUNDARIES 


J. C. C. NITSCHE, Institute of Technology, University of Minnesota 


Viewing various experiments with soap films, one is led to consider the 
following generalization of Plateau’s problem: 


Determine a surface of a prescribed topological type and of least area which 1s 
bounded by a configuration of several fixed contours, supporting surfaces, and move- 
able threads of given lengths. 


For a list of pertinent literature see [15], page 239. Soap laminae spanning 
frames with moveable contours—experimentally realized by hairs or thin silk 
threads—have been described, among other places, in [3], p. 48; [4], p. 174; 
[14], p. 584. 

While Plateau’s problem is always solvable, it is a well-known fact (for 
literature see [18]) that the existence of a solution for the general problem 
hinges on certain metric-geometric conditions which must be satisfied by the 
bounding configuration. Otherwise the minimizing sequences on which the exis- 
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tence proof is based do not converge or, if they do, converge only to degenerate 
limit structures—disconnected surfaces of lower topological types or one- 
dimensional formations. Two simple examples may illustrate these contin- 
gencies: A pair of distinct Jordan curves at too large a distance is not capable 
of bounding a doubly-connected minimal surface. A frame consisting of a fixed 
Jordan arc in space and a moveable thread of equal length with the same end 
points cannot span a nondegenerate surface of least area, since this surface 
would collapse into the Jordan arc. 

In a number of cases, however, existence proofs are feasible by extensions 
of the classical methods. The boundary behavior of the solution surface is fully 
studied for the fixed contours ({8], [10], [12], [17]) and satisfactorily under- 
stood for the free boundaries, i.e., the boundary portions on supporting manifolds 
({5], [6], pp. 218-223, [9], [13]). Nothing is known yet for the moveable con- 
tours. For well-posed problems it is very likely that the solution surface will be 
analytic up to the moveable parts of the boundary and free of branch points on 
these parts. Anticipating this fact, i.e., assuming the existence of a solution 
surface with sufficient regularity properties, we shall derive here a geometrical 
property which characterizes those contours that are freely moveable, subject 
only to the fixation of their lengths and, possibly, their end points. 


THEOREM. The moveable parts of the boundary must be curves of constant space 
curvature. On the solution surface ttself they are asymptotic lines of constant geodesic 
curvature. 


Variational problems describing surfaces with variable boundaries have 
been touched upon in the literature before (see, in particular, [2], p. 671). It is 
curious to note, however, that so far no results of a geometric nature seem to 
have been extracted from the formal conditions. There is an interesting con- 
nection to the isoperimetric problem on surfaces which has been extensively 
treated over the years since it was first discussed by F. Minding and J. Steiner. 
For the newer literature on this subject consult the bibliography in [16], for 
the older literature compare [11], pp. 614-620; see further [1], p. 154 and [7], 
pp. 149-153. If a moveable contour were to vary on a fixed surface, its optimal 
position would have to be that of a curve of constant geodesic curvature. For 
the problem at hand, of course, the solution surface itself is unknown at the 
start. Considering that closed curves of constant geodesic curvature very pos- 
sibly may not exist in certain regions of a (fixed) surface, one realizes that our 
general problem still poses many questions and invites interesting comparisons, 
particularly in cases where freely moveable contours are provided. 

We shall now prove our theorem for the concrete case of a frame which con- 
sists of a fixed rectifiable Jordan arc I’; and a moveable arc I of prescribed length 
(smaller than that of T1) having its end points in common with Ty. Denote by 
P the semidisc {u, v; wty<i, v>o} and let S= {r=x(u, v); (u, v)EP} bea 
surface of least area bounded by this frame. It is assumed here that the position 
vector of S possesses the following properties: 


1970] AN ISOPERIMETRIC PROPERTY OF SURFACES 361 


(i) x(u, v) belongs to class C°(P)(\C2(P) and satisfies in P the relations 
Ar=0, t= Los Luly = 0. 

(ii) x(u, v) provides a topological mapping of the boundary portions 0,P 
= {u,v;u2+v2?=1, 720} and &P={u, v; -1SuS1,v=0} onto the curvesT; 
and Ie, respectively. 

(iii) r(w, v) is analytic (or at least sufficiently regular) in P as well as in the 
interior points of 02eP, and r2>0 in the latter points. For 0<r<1 we set 
P,= { 1, ve urty2<r?, v>0o} and we define the boundary portions 01P, and 02P, 
as before. S, then will denote the portion {x =2(u,v); (u,v) CP,} of the surface S. 

Now let 3(u, v) be an arbitrary vector which is sufficiently regular in P,, 
0<r<1, vanishing along 0,P,, and consider the varied surface 


S-(e) = {x = y(u, 0; €); (u, v) € P,} 


whose position vector is )(u, v; €) =r(u, v)+e3(u, v). In order for S,(€) to be 
admissible, the length of the image of 02.P, under the mapping by y cannot de- 
pend on e. Remembering the minimizing property of S we are thus led to a 
variational problem with side conditions: The area A[S,(e)] must have a sta- 
tionary value for e=0 subject to the condition f”,|y.(u, 0; €)|du=const. 

Using the relation 2434+203. = (0/0u) (443) + (0/0v) (z,3) and applying Green’s 
theorem we find 


1 2 2 
Als] = — ff get duds 
= A[s,(0)] + « f J (tutu + tobs)dudo + &[ +] 


= A1S(0)] — ef zolu, au, O)du + ef +] 


Furthermore 


Lusu 
| Yu(w, O5€)| = | zal Fever ele T+ 


By the rules of the calculus of variations (see, for instance, [2], pp. 457-465) we 
can conclude that 


r Ludu r 0 Lu 
f E +2 =| du = J E —rA— (=) 3(u, O)du = 0. 
—r | tu | (wu ,0) —r Ou | Lu | (u,0) 


Here J is a (constant) multiplier. Since 3(u, v) was an arbitrary vector the rela- 


tion 
0 u(u, O 
du \| tu(u, 0) | 
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follows. Introducing the arc length onT., s=s(u) =f any Ln(u, 0) | du, and denoting 
the differentiation with respect to s by a prime, this relation becomes 
t»(w, 0) 
Ar!’ (u, 0) = —————- for -r Su Sr. 
| zu(u, 0) | 

Since the vector on the right hand side is of unit length, while |r’’| denotes the 
curvature of I’, the first part of our theorem follows. The second part of the 
theorem is a consequence of the fact that x, is a tangent vector on S. Thus, if 
I, is considered as a curve on S, its normal curvature must vanish. Such a curve 
is an asymptotic line. 
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AN APPROACH TO EMPIRICAL LOGIC 
C. H. RANDALL AND D. J. FOULIS, University of Massachusetts 


1. Introduction. The aim of an empirical science is to order, explain, and 
predict the observable consequences of certain physical operations. The mathe- 
matical apparatus for such an empirical science, therefore, ought to be based 
on a formal “empirical logic” capable of describing these events. Such logics, 
in turn, ought to be erected on a refinement of the conventional notion of a 
physical operation. A rigorous development, based on this desideratum, leads 
to a nonclassical symbolic logic that bears a striking resemblance to the so- 
called logic of quantum mechanics [2, 3, 6, 8, 15, 18, 19, 20, 25, 27, 32, 34, 35, 36, 
37, 39, 44, 45, 48, 49, 50]. 

By an operation we shall mean a collection of instructions that describe a 
well-defined physically realizable (reproducible) procedure, and that in addition 
specifies what must be observed and recorded. In particular, an operation must 
require that, as a result of every execution of the instructions, one and only one 
symbol from a specified set R be recorded as the outcome of that realization of 
the operation. As a consequence of this definition, R is an exhaustive and mu- 
tually exclusive set of outcomes for the corresponding operation. Carefully note, 
if we delete or add details to such a set of instructions, and in particular if we 
modify the outcome set R in any way, we thereby define a new operation. 

Here, we are by no means restricting our attention to operations involving 
only traditional laboratory procedures. In fact, test procedures on an assembly 
line, data gathering procedures in general (such as opinion polling), pencil-and- 
paper procedures (such as executing computational algorithms), and even pro- 
cedures involving subjective approvals or disapprovals are all admissible pro- 
vided that they satisfy the above criteria. 

This definition appears to be the only tractable one, since the only means of 
establishing that two individuals performed the same operation is with a de- 
scription. Niels Bohr repeatedly made this point in connection with quantum 
physics. He said [4, p. 3]: 

“The decisive point is to recognize that the description of the experimental arrangement and 
the recording of observations must be given in plain language, suitably refined by the usual physi- 
cal terminology. This is a simple logical demand, since by the word ‘experiment’ we can only 


mean a procedure regarding which we are able to communicate to others what we have done and 
what we have learnt.” 


2. Compositions of operations. In practice, operations are normally syn- 
thesized from sequences of more primitive procedures and arrangement ap- 
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provals, however informal or subtle. Our first task will be to obtain a formal 
representation for such a synthesis. By a primitive operation, we mean roughly 
an operation which we cannot, or do not choose to, factor into sequences of 
more basic operations. (In the final analysis, the decision to regard an operation 
as being primitive must be largely subjective.) 

Let @ be a set of primitive operations. It will be mathematically convenient 
to assume that, for each operation w€@, the outcome set R, for w contains at least 
two distinct elements. This is no real restriction, since we can always modify an 
operation in an essentially inconsequential way by appending a fictitious addi- 
tional outcome to its outcome set. 

There is no guarantee that the various outcome sets R, are mutually disjoint. 
In order to obviate this difficulty, we shall simply replace each R, by the labeled 
outcome set Xo= {w} XR. Clearly, this is an inconsequential modification, for 
we are only agreeing to record an outcome of the primitive operation w as (w, bd) 
rather than simply as b. We define X =U {xX o| wEeP} , and we call X the labeled 
outcome space for @. If x= (a, a), y= (6, b) CX, we say that x and y are orthogonal 
and we write x#y if and only if a=8 and ab. Thus, two labeled outcomes in X 
are orthogonal precisely when they are distinct outcomes for the same primitive 
operation in @. 


v1 Xe e@@e@ 86 
Fic. 1 


A given primitive operation wE@, together with its associated labeled out- 
come set X,, can be conveniently depicted as shown in Figure 1, in which the 
labeled circle represents the operation w and the labeled “legs” represent the 
outcomes 1, %2, °° * ,X%, comprising X,. The diagram in Figure 1 will be called 
the operational diagram for w. : 

Compound operations, synthesized from sequences of the primitive opera- 
tions in @, can now be depicted by a natural generalization of Figure 1. For in- 
stance, see Figure 2, which depicts a compound operation £ for which the primi- 
tive operation a is the initial step. In the event that the labeled outcome x; 
is obtained when this first step @ is executed, the primitive operation 8 is the 
required second step. In the event that the labeled outcome x2 is obtained when 
a@ is executed, the primitive operation y is the required second step, etc., etc.; 
and likewise for the third steps. Notice, however, that if y is the required second 
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v1 X2 X3 
! V 2 24 e . 
a1 v2 M1 X2 xg 
Oo 3 2 3g 
WL W2 Wa Wy We W3 yt Va 
Fig, 2 


step, no third step is specified and the operation E is terminated. The diagram 
in Figure 2 will be called the operational diagram for the compound operation E. 

Suppose that the compound operation E of Figure 2 is executed and that x; 
is the labeled outcome obtained when the first step a is executed, +; is the labeled 
outcome obtained when the second step B is executed, and wy, is the labeled out- 
come obtained when the third step 6 is executed. We shall then agree to record 
the outcome of this execution of E as the formal sequence x1y:w;. For the com- 
pound operation E, there are altogether fourteen possible outcomes, so that the 
outcome set Rg is given by 


Rez = { 1y1W1, H1ViWe, X1Y1W3, Liyo2W1, HiYoWe2, XiYoWs, V221, VoS2, Voda, V2K4, 


V3X1, V3Xt2, V3X3V1, teXaVo}. 


Evidently, the operational diagram for the compound operation E can be re- 
constructed merely from a knowledge of the outcome set Rz. For example, all 
of the formal sequences in the set Rz initiate with x1, with x. or with x3, making it 
clear that the initial operation for E is the unique primitive operation @ for 
which X= {%1, x2, x3}. An obvious inductive procedure yields the*complete 
operational diagram for E. 

Let S denote the set of all finite formal sequences of elements of the labeled 
outcome space X. A typical element a€S will have the form a=wuyuo - - + Un, 
with 1, ue, «> * ,Un©GX. We define the length of a, in this case, by length(a) =x. 
If we define a “multiplication” operation on S by 


CAUP o 8 Un) (v4.9 oe Um) = UjU2 °° * Undive>* > Um, 


then S becomes a semigroup; in fact, S is just the free semigroup over the set X. 
We identify the elements of length 1 in S with the corresponding elements of X, 
so that X is a subset of S. It will be mathematically convenient to consider the 
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monoid (semigroup with unit) obtained from the semigroup S by formally 
adjoining a unit element 1 to S and defining a-1=1-a=a and 1-1=1 for all 
aGS. We denote the resulting monoid by r=T(o)=SU {1}, and we define 
length (1) =0. 

A subset M of I will be said to be bounded if there exists a nonnegative integer 
n such that length (a) =z for every a€ M. If M is a nonempty bounded subset 
of I', we define length (14) = max {length (a) | a€M}. The outcome set Rz cor- 
responding to a compound operation E depicted by an operational diagram (such 
as Figure 2) is evidently always a bounded subset of I and length (Rz) gives the 
maximum number of sequential executions of primitive operations that might 
be required during an execution of the compound operation E. 

Not every bounded subset of I’ is the outcome set for a compound operation 
E. In order to characterize those subsets of I’ that have the form Rz, we must 
introduce a new definition. We shall say that two elements a, DCT are (lexi- 
cographically) orthogonal, in symbols ald, if and only if there exist elements 
c,d, eCI and elements x, yCX such that x«#y, a=cxd and b =cye. If M is asubset 
of I’, then we call M an orthogonal set if and only if a, b€& M with ab implies 
that aLd. 

It is not difficult to prove that a subset M of T is of the form Rz for some 
compound operation £ if and only if M~#{1} and M is a bounded maximal 
orthogonal subset of I’. In what follows, we shall regard the set {1} as the out- 
come set of a trivial operation J requiring only that the symbol 1 be recorded. 

An unbounded maximal orthogonal subset of I’ can be regarded as the out- 
come set of an “in principle” compound operation which might not be physically 
realizable because of the possibility that an infinite number of steps could be 
required for its execution. Despite the fact that “in principle” compound opera- 
tions might fail to satisfy our requirement of physical realizability, they can be 
considered as “limiting cases” of physically realizable compound operations, 
and it is therefore reasonable to expect our mathematical formalism to be able 
to represent them. 

As it has been interpreted above, an operational diagram can be viewed as a 
set of instructions. Since the operational diagram for a compound operation E is 
completely determined by its set of outcomes Rz, this set itself can be regarded 
as a complete description of the required procedure. Recalling that by definition 
an operation is precisely such a set of instructions, we propose to identify com- 
pound operations with their outcome sets. This is formally accomplished by the 
following definition: A compound ©@-operation is a maximal orthogonal subset of 
I'(@). The set of all compound @-operations is denoted by &6=&(@). The set of 
all bounded compound @-operations is denoted by &)=6&0(@). 

If EE&, it is natural to regard DC Rz as a proposition asserting that E was 
executed and in this realization of E one of the outcomes d©€D was recorded. 
This we recognize as precisely the Kolmogorov [23] point of view on which 
much of modern statistics has been founded. Following Kolmogorov, we shall 
also refer to such sets as events. It is not difficult to see that these events are 
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precisely the orthogonal subsets of I’. The set of all events is denoted by 0 = 0(@) 
and the set of all bounded events by O)= 0 (@). 

Notice that, in general, a given DCO will be a subset of many operations 
EC8&; hence, unlike Kolmogorov, we cannot associate such an event with only 
a single operation. Kolmogorov explicitly considered only one operation at a 
time, while we are simultaneously considering all of the operations in &. For 
example, let w, A, y, 6€@ with X,= {u, u'}, X= 74, v’}, X= | w, wt and 
X;= { 2, ol Then £,= { xu, xu’, x'v, x'y! } and Fo=jxu, xu’, x’w, xa! } are in 
&0(®) and xu is an outcome of both operations. Furthermore, {xu, xu’} is an 
event for both operations. 

Our inability to assign a unique operation to each outcome in I’, and conse- 
quently to each event in 0, is less of a predicament than it appears to be. In the 
above example, if we execute 6 and record x and then execute w and record u, 
we have in fact followed the instructions for both #,; and FE». It was carefully 
noted earlier that we would regard as a realization of an operation any execution 
of a physical procedure that satisfied the explicit specifications in the instruc- 
tions; consequently, the recording of xu was simultaneously an outcome for 
both #; and E,. The same argument applies to events. Thus, we need to modify 
the suggested Kolmogorov interpretation of an event by deleting any reference 
to a specific operation. 


3. The calculus of events. The structure of our empirical logic will be 
crucially affected by our decisions concerning the assignment of truth values to 
the propositions represented by the events DCO. In the absence of a dogmatic 
enforcement of suitable stipulations concerning the verification of such proposi- 
tions, we are left vulnerable to an insidious difficulty—the validity of evidence. 
Data is frequently challenged on the grounds that the experimental procedures 
that produced it did not properly account for . In practice, it is never 
difficult to fill in such a blank; the possibilities are literally inexhaustible. An 
objective line must be drawn somewhere, and in this section, we make our choice! 

Notice that our failure to associate a unique operation E €& with an outcome 
bET amounts to rejecting, as admissible information, any knowledge of the 
operations that would have been executed if outcomes other than those recorded 
had resulted. In other words, any intention to execute one of these potential 
operations, and any unrecorded consequences of such an intention, are regarded 
as being inadmissible data. Only the realizations of the outcomes of compound 
@-operations are admissible as valid evidence in the empirical universe of discourse 
erected on @. The events DEO are to represent the propositions that are validated 
or refuted by such evidence. We emphasize that, in our view, such a proposition 
is well-defined if and only if the exact conditions under which it is to be regarded 
as true, as well as those under which it is to be regarded as false, are stipulated in 
terms of valid evidence. Thus, we must assign to each DEO a subset DtCY”r 
of outcomes for which D is regarded as validated and a subset D-CT of out- 
comes for which D is regarded as refuted. 
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A strict classical interpretation of an event D would require that we put 
D+=D and D~=IT\D. For reasons which we can only outline briefly here, we 
shall, however, define D+ and D- differently. We wish to place three require- 
ments on the map D'>Dt, namely: 


(1) ab€ {a}+ for a, DET. 
(2) For A, BEO, AC B—-At+CBt, 
(3) For a€l, EEE, \a}+C(aE)*, where az = { ae| eC}. 


The motivation for requirements (1) and (2) is plain enough, but require- 
ment (3) might be somewhat unexpected. In (3), we suggest that the event aE 
be regarded as asserting the validation of the event {a} and a possible subse- 
quent intention to carry out the operation £. Since we do not care to rule out the 
potential execution of any of the operations E C&, we are obliged to impose condi- 
tion (3). Requirements (1), (2) and (3) generate the following, which is our off- 
cial formal definition of D+, for DE 0:Dt= {aET| 3b,c Eland 3 LE & with 
a=bc, bECD}. 

Our candidate for D- is perhaps more transparent. We propose to view a 
realization of an outcome ¢C€I as refuting any event claiming the occurrence of 
outcomes all of which are orthogonal to e. Hence, for DEO, we define D- 
= {eCr| VdED, eld}. 

Let EG& and let DEO. If DCE, we shall say that the operation E is a 
direct test for the event D, since in this case each execution of E produces an 
outcome d CE for which we can directly test the validity of D (without reference 
to intended or potential operations). If EC (D+\WUD-), we shall say that the 
operation E tests D, since in this case each execution of £ produces an outcome 
@€E for which we can still test the validity of D by checking whether or not d 
belongs to Dt. 

The above considerations lead us to define the binary relation ~ of strict 
implication on © as follows: For C, DE0, C~<D if and only if C+C D*. As one 
might hope, it can be shown that C~<D if and only if D-CC~. The relation < 
is reflexive and transitive on ©, but it is not in general antisymmetric. Hence, 
we are led to define the equivalence relation = of strict equivalence on © by 
C=D if and only if C<D and D~C. 

Notice that, for DEI, HGE8&, we have {b} =bDE; but, in general, {b} # Eb, 
where Eb= {ebl|eGE}. This “temporal asymmetry” is a consequence of our 
definition of lexicographic orthogonality and our decisions regarding valid evi- 
dence. An event bE has been viewed as asserting that {b} has been validated, 
and essentially only that. On the other hand, Ed asserts more, since with regard 
to b the operation E is no longer potential, but realized. In a crude sense, the 
execution of E is viewed as exercising a possible influence on the outcomes of 
subsequent operations; whereas, in drawing our “objective line,” we consciously 
failed to take into account the possibility of intended operations as exercising 
such an influence. 
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In addition to the relations of strict implication and strict equivalence, there 
are three other binary relations on © that are germane to our calculus of events. 
Let C, DEO. We define CoD if and only if CUDC0. We also define C LD if 
and only if CoD and C(\D=@. Finally, we define CL, D if and only if 
CLD and CUDEE. 

Notice that, for C, DE0, CoD if and only if there exists an operation EC& 
such that CCE and DCE; that is, CD if and only if the two events C and D 
admit a simultaneous direct test operation E€&. There is, of course, a more 
general possibility; namely, that there exists an operation EC& which simul- 
taneously tests both C and D (but not necessarily directly). It is pleasant to be 
able to report that C and D are simultaneously testable in the latter sense if and 
only if there exist events Ci, DsG@0, such that (1 =C, Di=D and Cy Dy. 

If C LD, we shall say that the event C is orthogonal to the event D. It isa 
theorem that CL D if and only if C'CD~. Consequently, if CLD, then any 
validation of C constitutes at the same time an automatic refutation of D. 

If C Lin D, we shall say that the event C is maximally orthogonal to the event 
D.1It isa theorem that C L, D if and only if C+=D~-. Consequently, if CL, D, 
then an outcome validifies C if and only if it refutes D. Maximal orthogonality 
and strict equivalence are related by the following fact: The event C is strictly 
equivalent to the event D if and only if there exists an event B that is maximally 
orthogonal to both C and D. 

In Kolmogorov’s work [23], in which only one basic operation is under dis- 
cussion, the events are represented by subsets of the sample space and their 
denials (or negations) are accordingly represented by the set theoretic comple- 
ments of these subsets in the sample space. In our event calculus, a given event 
D€0 admits complements £\D in any of the operations EC& for which DCE; 
hence, we can associate (in general) a multitude of relative negations C= E\DE0 
with our given D. Evidently, C is a relative negation for D in this sense if and 
only if CL» D. 

It is a theorem that the set of all events CE such that C _L,, D constitutes 
an equivalence class in © with respect to the relation of strict equivalence. This 
suggests that the equivalence class {CG0|C Ln D} should be regarded, in some 
sense, as representing the absolute negation of the event D. This, in turn, sug- 
gests that we shift our attention from the event calculus (0; ~, =, -, L, Ln) 
to the quotient structure £ = 0/= obtained from the event calculus by identifi- 
cation of strictly equivalent events. 


4, The operational logic. If DEo, we define [D]={CEo|C=D}. Such 
an equivalence class will be referred to as an experimental proposition. We agree 
that the procedure for verifying [D]| is the same as that described above for 
verifying D. Evidently, this procedure is independent of the choice of the repre- 
senting event D. We define £ to be the set of all experimental propositions. For 
[C], [D]E&, we shall say that [C] strictly implies [D], and write [C]S[D], 
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if and only if C<D. Clearly, S$ is a well-defined partial order relation on &. 
Note that, for all [D]E&, we have {@} =[@]sS[D]S[{1}]=6; hence, {@} 
acts as an order zero and & acts as an order unit for the partially ordered set &. 

For [D]€&, we can now form an absolute negation [D]’/= {CEo0|C1nD} 
as suggested above. It readily follows from the theorems stated in the previous 
section that the mapping ’: ££ is a well-defined orthocomplementation on &; 
that is, for [D], [C]Es, 


(i) [D|S|C] if and only if [C]’s [D]’, 
(ii) ({D’])’=[D] and 
(ii) [DJA[DY ={@}, [D]V [DJ =. 


Henceforth, as above, we shall use the symbols /\ and \V to denote infimum 
(greatest lower bound) and supremum (least upper bound) respectively in &. 

It is noteworthy that & is actually a complete lattice; that is, if {[Da]|aEI} 
is any family of elements of £, then the infimum /\.[D.,| and the supremum 
Val|Da] exist and are given by the following convenient formulas: 


(1) Aal[D.| = [C], where C is any maximal orthogonal subset of NeDs. 
(2) Val Da| — (Aa{ Dal’). 


It is significant that £ is a distributive lattice (and, hence, a Boolean algebra) 
if and only if © contains exactly one primitive operation. However, a weak form 
of the distributive law, the so-called orthomodular identity, always obtains [41]: 


3) [c]s[D]J=[D]=[c]V(D] A I[c]), — for [c], [D] € &. 


Thus, & forms what is referred to in the literature as an orthomodular lattice— 
a type of lattice that has been extensively studied in recent years because of its 
connection with quantum theory and von Neumann algebras [7, 9, 10, 11, 13, 
14, 16, 17, 18, 21, 22, 24, 28, 29, 30, 31, 33, 38, 41, 42, 43, 47]. 

Specifically, an orthomodular lattice is a lattice L with an order zero 0 and 
an order unit 1 and with an orthocomplementation ’: L—>L such that the ortho- 
modular identity eSf—f=e\V(f/Ae’) holds for all e, fEL. For e, fEL, it is 
customary to say that e and f are orthogonal, and to write eLf, if and only if 
essf’. If every set of pairwise orthogonal elements of LZ is countable, then L is 
said to be separable. It is also common practice to say that e is compatible with f, 
and to write e<>f, if and only if there exist mutually orthogonal elements e,, /, 
d€L such that e=e,\/d and f=fi\Vd. It is a fact that any maximal set of pair- 
wise compatible elements of Z is closed under the operations of /\, \V and ’ and 
forms a Boolean algebra under these operations. These Boolean algebras are 
known as the blocks of L. Thus, the structure of LZ is determined by the structure 
of its blocks and the manner in which they “intertwine.” 

The intersection of all the blocks of an orthomodular lattice Z is called the 
center of L and is denoted by C(L). Anelement e€ C(L) is called a central element 
and an element e€ ZL is central if and only if ef for every fEL. One says that 
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L is irreducible if and only if 0 and 1 are the only central elements in Z. An 
orthomodular lattice Z is irreducible if and only if it cannot be factored (non- 
trivially) as a cartesian product. For the additional lattice theoretic terminology 
used below, see [1, 12]. 

In £, we already have seen that [C]=[D]’ if and only if there exist repre- 
sentative elements C1€ [C] and DiC [D] such that C; L,, D1. Similarly, [C]~o[D] 
and [C] L[D] in & are respectively equivalent to CD, and C; LD; in 0. 

It can be shown that the calculus of events can be recaptured (up to an 
isomorphism) from the structure of £ as an orthomodular lattice. As a conse- 
quence, a lattice automorphism of £ automatically preserves orthocomplementa- 
tion and is necessarily induced by a “change of variables”—permutations of 
outcomes in outcome sets and permutations of outcome sets of the same car- 
dinality. 

We shall say that an element [D]C& is bounded if and only if there exists a 
bounded event D,€0 such that DiG [D]. We denote by £&p the set of all bounded 
elements of £&. It is significant that £» is a sublattice of & closed under the ortho- 
complementation (hence, £» is an orthomodular lattice in its own right). The 
elements of £» are in fact the experimental propositions that are in practice 
operationally accessible. We shall call &»o the operational logic over © and we 
shall call £ the complete operational logic over ©. 


THEOREM. & ts a complete orthomodular lattice, while &o 1s an orthomodular 
lattice that 1s not o-complete. The outcome set for every primitive operation we 
1s countable tf and only if both & and S&o are separable. The orthomodular lattices & 
and & are simple (hence irreducible) if and only uf © contains two or more primitive 
operations; otherwise, they are both Boolean algebras. Neither & nor So contains 
atoms, and consequently are order dense in themselves. The lattices & and Lo are 
modular tf and only tf they are Boolean algebras. Every element of & 1s a supremum 
of an orthogonal subset of &o; in fact, & 1s the completion by cuts of Lo. If P contains 
two or more primitive operations, then every automorphism of & or of &o is induced 
by a “change of variables.” 


5. Conclusion. The above results only provide a glimpse of the exceedingly 
rich and descriptive structure of £. We have not even considered the obvious 
question of the character of the sublogics that could be generated by various 
sets of operations. 

Implicit in the structure of the logic £, there is a complete description of the 
physical operations of concern and the relations between them. In the interest 
of brevity, however, we must postpone such details to later papers. 

The so-called logic of quantum mechanics [3, 32] is also a nonclassical ortho- 
modular lattice; as a consequence, one might naively suppose that £ would also 
be such a logic if the generating primitive operations were pertinent to quantum 
mechanical systems. Although this conjecture is demonstrably false, it neverthe- 
less is so that there exist connections between £ and quantum logic. The exact 
nature of these connections will be discussed in a subsequent paper. Since the 
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elements of £ are explicitly defined in terms of physical operations, and this is 
not the case in quantum logic, it is important that these connections be 
understood. 

It is noteworthy, for example, that in quantum logic the interpretation of the 
lattice theoretic infimum and supremum is a delicate matter [2, 3, 27 ], which 
many authors feel is still unresolved; whereas in £& this issue can readily be 
settled. We need only consider the infimum, since the interpretation of [D] is 
transparent and ValDal =(/A [D.|’)’. Recall that AalDal = [Cc], where Cis any 
maximal orthogonal subset of M.D, and hence that Ct=N,Dj. Thus it is 
evident that the experimental proposition /\2|D.| is validated by an outcome 
bET if and only if b validates every [D,]. On the other hand, if an outcome 
bET refutes any one of the experimental propositions [D,], then it will refute 
Aa\|Da|. This much would be expected; what might not be supposed is the fact 
that there exist, in general, )CI that refute A\.[D.], but fail to refute any one 
of the experimental propositions [D.]. We also have that A.[DeJ={@} if 
and only if there does not exist an outcome 0€TI that will simultaneously vali- 
date every [D,]. Although this “interpretation” leaves something to be desired 
in the way of classical insight, it does specify precisely and operationally the 
conditions under which the experimental proposition A\.|D.] is to be regarded 
as true or false—and this is all that is required. 

The interpretation difficulty mentioned above should have come as no sur- 
prise, for, after all, the logic of quantum mechanics has been a consequence of a 
model of “reality” rather than a foundation for it. This state of affairs is by no 
means unique; in all of the empirical sciences, deterministic or stochastic models 
of nature are characteristically established first, and then—if at all—the implicit 
logics are investigated. In brief, these posterior logics depend not on reality, the 
purported ultimate authority of empirical science, but on models of reality. This 
curious practice, that leaves the conventional wisdom so vulnerable to crucial 
physical tests, such as the Michelson-Morley experiment, was noted by Mach 
[26] among others. Scientists [5, pp. 3-4], [46] have sought “self-healing” sys- 
tems with which to escape the havoc caused by these periodic upheavals. It 
would seem that a large step in this direction will have been taken when we 
heed the advice of Garrett Birkhoff, offered in connection with quantum me- 
chanics [2, p. 159]: 

“Scientifically, quantum logic should draw its authority directly from experiments. This 


approach is not only scientific; it has the mathematical advantage of making the lattice theory of 
the quantum logic autonomous.” 


The emphasis on the operational approach described above should not be 
construed as an adoption of radical empiricism. The development of a purely 
operational logic does not entail the simultaneous rejection of subjective meth- 
ods; in particular, explicitly identifying the observables of an experimental 
science does not automatically deny the unifying and explanatory power of 
idealized models. The operational logics described here are only intended to 
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serve as foundations on which to erect the more familiar and utilitarian machine- 
ry of science. In subsequent papers a nonclassical statistics and an essentially 
subjective logic will be constructed on these foundations. Some of the details 
and a discussion of this program can be found in [39], [40]. 


This article is based in part on an address to the Northeastern Section of the MAA, November 
30, 1968, by D. J. Foulis. 
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HELLY TYPE THEOREMS DERIVED FROM BASIC 
SINGULAR HOMOLOGY 


H. E. DEBRUNNER, University of Bern, Switzerland 


In the first part of this paper the famous Helly intersection theorem is pre- 
sented as a simple application of singular homology in a manner suitable for 
classroom use. In Section 6 the same proof is exploited to yield a partially new 
stronger result on intersections in m-manifolds. 


1. Introduction. The tedious work needed to verify the basic properties of 
singular homology theory asks in introductory courses for a wealth of relevant 
applications. With restriction to the very basic facts and to simple spaces such 
as spheres and cells recent textbooks present as applications the invariance of 
dimension, the Brouwer fixed point theorem, the nonexistence of unit tangent 
vector fields on spheres of even dimension, the Jordan-Brouwer separation 
theorem, the invariance of domain and the fundamental theorem of algebra. 
As another application of the same restricted tools we shall derive the following 
Helly intersection theorem: 


Turorem 1. (Helly [5]). Let {xX ihies be a finite family of open convex subsets 
of euclidean n-space R” such that each n+-1 members of the family have a point in 
common. Then all the sets of the family have a point in common. 


This statement has numerous variants and applications; for an excellent 
survey on the ramifications and proofs of Helly’s theorem the reader is referred 
to the Danzer-Griinbaum-Klee report [3]. This flexibility is related to the fact 
that the theorem has a purely topological background. In fact, Helly has also 
proved a more general but much less known topological version of this theorem. 
We shall present here this topological intersection theorem. The available proofs, 
Helly’s in [6] and the simpler one in Alexandroff-Hopf [1], do not stand at the 
basis of homology theory as presented in introductory courses. 

We shall use singular homology with integer coefficients. Any other nonzero 
coefficient group would do as well, for in the course of the proof we need only 
check whether some abelian groups are zero groups or not. The needed part of 
homology theory consists of the functorial properties, the fact that deformation 
retractions induce isomorphisms in homology, and the exactness of the Mayer- 
Vietoris sequence for two subspaces covering a space by their relative interiors; 
in addition we shall use the following weak continuity property: 


(C). If {U2} is a family of subspaces of the topological space Y such that each 
compact subspace C of Y 1s contained in some space U, of the family and such that 
H,(U.) =0 for each space of the family, then H,(Y)=0. 


For singular homology, (C) is clearly true, since the carrier of each singular 
chain ¢ is a compact set C, and if acyclec of Y is bounding in some U,, the same 
holds in Y. 
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For simplicity we use for nonempty spaces Y also the reduced homology 
groups H,(Y), thus saving extra considerations for g=0. 


2. Statements. Let a homology cell be a space having singular homology 
groups isomorphic to those of a point or cell, i.e., a homology cell X is a non- 
empty space with H,(X) =0 for all integers g. Since nonempty convex subsets of 
R* are homology cells and intersect in convex subsets (eventually empty), 
Theorem 1 is contained in 


TueoreM 2. (Helly [6]). Let {| X;}je7 be a finite family of open subsets of 
euclidean n-space R” such that the intersection X;\ +--+ (\X;, of each r sets of 
this family 1s nonempty for rSn-+1 and ts even a homology cell for rSn. Then 
Njes X; 1s a homology cell. 


The proof of Theorem 2 is based on two lemmata. The first tells us something 
about the homology structure of spaces built up from open homology cells. 


LemMA Am. Let {X1, +--+, Xm} be a finite family of homology cells imbedded 
as open subsets in some topological space M in such a way that the intersection of 
each r members of the family 1s a homology cell whenever r=1,---,m—1. 


(a) If XiW\-++-OXm ts empty, then HX ---UX,) is nonzero for 
g=m—2 and zero for gAm— 2. 

(b) If XiN-++-OAXn is nonempty, then HX ---UXn) = 
A g-mu(Xi ++ +OXm) for each integer q. In particular Xi ---UXy, is a 
homology cell tf and only 1f Xi(\ + + -(\Xm 1s a homology cell. 


The second lemma exhibits a significant property of dimension. 
Lemma B,. If Y is a nonempty open subset of R®, then H,(Y) =0 for g=n. 


Note that the corresponding statement for closed subsets of R* is not valid 
in singular homology theory, as shown in [2]. 


3. Proof of Theorem 2. Assume there exist families satisfying the hy- 
potheses of Theorem 2 but not the conclusion. Let {xX , « m} among such 
families be one with the minimal number m of members. We may apply Theorem 
2 to any proper subfamily of 1X3, mee, Xin}; for the hypotheses pass to sub- 
families, and due to the minimality of m we conclude that the intersection of all 
sets of any proper subfamily is a homology cell. This means that Lemma A,, 
applies to the chosen family {xX lett, Xn}. Consider first the case Xif\->-> 
(\Xm= @, which under the assumptions of Theorem 2 can happen only if 
m>n+1. Lemma Am gives the result (i) Hn.(XiU ---UX,)40. On the 
other hand, since m—22n, Lemma B, shows (ii) Hm2(XiU - - -UX,) =0. 
The statements (i) and (ii) contradict. Consider now the case Xif\--:- 
NXmA# 2. Our choice of the family was such that X11 --+OXm is not a 
homology cell, and under the hypotheses of Theorem 2 we thus must have 
m>n. There exists p20 such that H,(XiN - - - \Xm)#0 and application of 
Lemma Am (b) gives (i) Hoim—1(X1U + - « UXm)¥0. On the other hand, m>n 
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guarantees p+m—12n, so Lemma B, yields (ii) Hpima(XiU + + -UXn) =0. 
Again (i) and (ii) contradict, and the proof is reduced to the lemmata. 


4. Proof of Lemma 4A,,. We proceed by induction on m. Suppose m>2 and 
the statement A »_1 is valid. Let {xX °° ty kd m} be a family satisfying the com- 
mon hypothesis of Am. The subfamily { Xi, rey Xm} then clearly satisfies 
the assumptions of Am, and the particular case of Ami (b) guarantees that 
XW +++>UXna1 is a homology cell U1. There is a reduced Mayer-Vietoris 
sequence for the pair {XiU +--+ UXm41, Xm} of open homology cells, namely 


. . . On ~ 
(1) .»- 9 H(U) @ (U2) — HAUL U2) > Bya(Ui120 U2) > 
H,A(Ui:) ® Hyi(U2) > -- - 


where Wy=X1U + - »\U Xm, V2=Xm, UV 2 = (Xi Xm) + + © (XmalNXm). 
We know that Ui and U; are homology cells, so the direct sum terms are zero 
and by exactness we have H,(U;\U U2) = Hy-1(U:N U2), or explicitly 


(2) Oy: H (Xi U nO, Xm) ~ HyA((X1N Xn) U.--U (X if) Xm)). 


Now apply Am-_: to the family {XiNXm,-° °°, Xma\Xm}; note that intersec- 
tions of r members in this family are intersections of ry-+1 members in the given 
family {X 1,:**,Xm}3;in particular the intersection of all the members is equal 
to Xif\ - + + OX. If this is empty, Am-1 (a) shows that the right hand side of 
(2) is nonzero for g—1=(m—1)—2 and zero otherwise, i.e., for g=m—2 and 
qg#m—2 respectively. If Xi.’ - - -OXm is nonempty, the right hand side of 
(2) is by Ami (b) isomorphic to Hg—1)—-(m—yaa(XW + + « Xm), so 


(X10 «++ Xm) © Aemis(X1 0 +++ O\ Xm). 


Thus A,» is valid if Am_1 is. 

It remains to start the induction with the case m=2. Let X1, X_ be open 
homology cells in M. If X1.\X2is empty as in case A2(a) then Xy\/X¢ is the dis- 
joint union of two homology cells Xi, X2, thus Ho(Xi.U/X2) 40, H,(X1WX2) =0 
for g3<0, as asserted. 

If Xi0-\X2 is nonempty, the sequence (1) with the homology cells X; in place 
of U;again shows that 0«: H,(X1U Xo) = Hy-1(Xi\X2), which is statement A2(b). 


5. Proof of Lemma B,. The statement B, being trivial for 7» =0, we suppose 
n21. Divide R* by (n—1)-planes into a tessellation by congruent cubes and 
define S to be the family of all these closed nonoverlapping u-cubes filling up R*. 
A closed subset of R* which is the union of k u-cubes of S is called 2-cubical of 
order k. To each open subset Y of R” and each compact CC Y there is a suffh- 
ciently fine such subdivision of R* and with respect to it an m-cubical set U of 
finite order satisfying CCUCY. In view of the continuity property (C) it 
suffices to verify the statement | 

D(n, k): If UCR® is n-cubical of order k=1, then H,(U) =0 for g2nz21. 

Of course this is trivial if the isomorphism between singular and simplicial 
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homology is available. Avoiding it, we prove D(n, k) by simultaneous induction 
on 2 and k. The start of the induction is trivial: In the case D(1,%), U is a finite 
union of disjoint intervals, in the case D(n, 1), U is acube, thus both times 
H,(U) =0 for g2n21. Now we consider with respect to a tessellation S an 
n-cubical set U of order k, where n>1 and k>1, and we assume D(n’, k’) is 
valid if n’<n or k’<k. If U is not pathconnected, it is the union of two disjoint 
n-cubical sets Ui, U2 of order <k and 


H(U)~H(U) @H,(U.) =0 forgz=nzZ21 


by the inductive hypothesis applied to Ui and U2, and we are done. So assume 
U is pathconnected. Let E; denote the (7 —1)-plane given by x, =fin a cartesian 
coordinate system with axes and unit such that the cube 


{eGR |OSa,S51 fori=1,---,n} 


is a cube of our tessellation S. Define 4 and é to be the infimum and the supre- 
mum, respectively, of {tER| EX\\U4D } . Pick out FE; for t=$(4 +) and write 
Eo for this ( — 1)-plane and Aj, M2 for the two closed half-spaces determined by Eo. 

If Uo, U1, Us respectively denote the union of all those n-cubes of S which 
are contained in U and meet, respectively, the plane Eo, the halfspace Ay, the 
halfspace He, then obviously U= Us\UU2 and Up= UN U2 # S (since U is con- 
nected). In the case | t2—-4| $2 there are at most two layers of cubes and we have 
even U=U,;= U2= Up». But the orthogonal projection of Up» onto the (n—1)- 
cubical set Up/\Eo is a deformation retraction, so (i) H,(Us7\ U2) = H,(U») 
«HH (Uo \Ev) =0 for g=n—121 by induction hypothesis. In the case 
| tz —ti| $2 we aré done, since U= Up. In the case | t,—é| >2 we insert (i) in the 
Mayer-Vietoris sequence (1) set up for our sets U; and U2 and by exactness we 
get (ii) H,(U) = H,(UiU U2) = H,(U1) 6 H,(U2) for g2n; but both U; and U, 
are n-cubical of order < and so the inductive hypothesis and (ii) yield H,(U) =0 
for g2n. This completes the proof. 


6. Generalizations. Helly’s Theorem 1 on convex sets has a well-known 
analogue for convex subsets of the metrical m-sphere S”, but in this variant the 
critical number is ~+2 rather than m+1. In fact, in all the statements and proofs 
of Sections 2 and 3 we may obviously substitute S"—! in place of R* and a shift 
of the dimension number yields our assertion. Here a subset Y of the metrical 
sphere S is said to be convex if between any two points of Y there exists in S 
exactly one shortest geodesic arc and it is entirely contained in Y. The same defi- 
nition applies to sets Yin any Riemannian manifold S; we shall see that Theorem 
1 also holds with R* replaced by any Riemannian n-manifold S; only in the case S 
has the homology of an n-sphere an additional condition has to be imposed on the 
family or the critical number will be n+-2 rather than n-+1. In this sense homology- 
n-spheres (i.e., m-manifolds having the homology of an u-sphere) are the only 
manifolds playing an exceptional role in the context of the Helly Intersection 
Theorem; by an additional argument involving the fundamental group it would 
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be possible to restrict the exceptions to homotopy spheres. These statements on 
intersections of convex sets again stem from a topological intersection theorem 
which we shall now deduce by combining a full exploitation of Lemma A,» with 
the analogue of Lemma B, for n-manifolds. 

Manifolds will be without boundary and connected. A space X is said to be 
p-acyclic, if it is nonempty and if 4,(X)=0 for all gS; in particular X is 
(—1)-acyclic if it is nonempty, and it is p-acyclic for all integers p if and only 
if it is a homology cell. 


LemMaA By}. (n21). If V is an n-manifold, then H,(Y)=0 for q>n; further- 
more Hn(Y)=Z or H,(Y)=0 according to whether Y is compact and orientable 
or not. 


For a proof cf. [4, p. 121]. We now state our result, which contains theorems 
of Molnar [7, 8, 9] and of Berstein (see [3, p. 125]) as special cases. 


THEOREM 3. Let M denote an n-manifold which ts not a homology-n-sphere and 
let {xX i} jer be a finite family of open subsets of M such that, for each choice of r 
members of this family, whenever rSn-+-1 the intersection X;,0\ +++ (\X;, ts 
(n—r)-acyclic. Then (\jex X; 1s a homology cell. If M is a homology-n-sphere the 
same statement holds under the additional condition that no n+2 sets of the family 
cover M. 


Proof. Assume there exist families in M satisfying the hypotheses of The- 
orem 3 but not the conclusion. Let {X1, - + -, Xm} among such families be one 
with the minimal number m of members. Theorem 3 applies to any proper sub- 
family of {X re, 4 m} , for the hypotheses pass to subfamilies, and due to the 
minimality of m we conclude that the intersection of the sets of any proper sub- 
family of {Xi1,---, Xm} is a homology cell. This means that Lemma An 
applies to the chosen family. 

Assume Xif\ ---(\Xm=@. Under the assumptions of Theorem 3 this 
happens only if (i) m2=2n+2. Write Y=X1U - -- UX». Lemma An(a) assures 
(ii) Hn—2(Y) 40 and (iii) H,(Y) =0 for g4m—2. But Y is itself an n-submani- 
fold of M,so Lemma B/ contrasted to (ii) yields m —2 Sn and comparison with 
(i) shows m—2=n. But then (ii) together with Lemma By yield that Y=M, 
and by (ii) and (iii) M4 must be a homology-z-sphere. Yet in this case, since 
m=n-+2, the additional condition imposed on {X1,---, Xn} in Theorem 3 
assures VY; M and we arrive at a contradiction. 

Now assume X1f)\ ---(\Xmx#@. By the choice of our family XM -- - 
(\Xm is not a homology cell, so there exists a smallest integer g such that 
(iv) Ha-mys(Xi\ ++ + \Xm)4#0 and consequently, by Lemma Am, also’ (v) 
A (XW ---UXm) 40. Since X1U --- UX, is an n-manifold it follows 
0<q—m+1SqS8n or equivalently (vi) mSq+1su+1. Therefore, on the as- 
sumptions of Theorem 3, Xif\ --+-OX~m is (w—m)-acyclic, hence g—m-+1 
>n—m by (iv). This and (vi) imply that g=. But then (v) and the minimality 
of g show that X1U - - - UX, has the homology of an u-sphere and therefore 
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M is a homology-n-sphere covered by the sets X1,--+-, Xm. Since mSn+1 
again we arrive at a contradiction to the additional condition that M is not cov- 
ered by less than ~+3 sets of the given family. 
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Reader’s Comments on Some Papers in this MonTtHLy 
R. A. RosENBAUM, Wesleyan University 


A. F. Horadam, A Generalized Fibonacci Sequence, 68 (1961) 455-460. 
Miss Sue Dunfee of Hollins College points out that equations (13) and (17) 
should read as follows: 


Hy + Hyg = 2HpHnys + (—1)" ef, 

where f=q for n=1, and f=H,-1 for n>1; 
Hasty + (—1)'H,_- 

Hf, 
C. F. Harding, Matrix of Finite Groups, 74 (1967) 430-432. 

Professor J. L. Berggren of Simon Fraser University remarks that, on pp. 
32-34 of “Representation Theory of Finite Groups and Associative Algebras” 
by C. W. Curtis and Irving Reiner, the authors describe a method by which the 


regular matrix representation can be read off from a suitably arranged group 
table, thus contradicting a statement in the article. 


=e +O’. 


M. L. Tikoo, Location of the Zeroes of a Polynomial, 74 (1967) 688-690. 
Professor Marcia Ascher of Ithaca College notes that the corollary of 
Theorem 1 should contain the inequality 


[Z| smax(| Coal /[ Ca] 4 [Coal mE A/| Ca) 0) + (Gal /| Gal), 


t=] 
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4=1,---,n-—2, where ¢>0 is arbitrary, rather than the stated bound on |Z | , 
the latter being correct only for a certain class of ?’s. 


J. C. Parnami, On Iterates of Euler's 6- Function, 74 (1967) 967-968. 

Professor George K. White of the University of British Columbia and 
Professor M. G. Beumer of the Technological University of Delft call attention 
to some prior work: 

Pillai, S.S., “On a function connected with ¢(),” BAMS, 35 (1929) 837-841. 

Shapiro, H. N., “On an arithmetic function arising from the ¢-function,” 
this Montuty, 50 (1943) 18-30. 


A. A. Gioia, and A. M. Vaidya, Amicable Numbers with Opposite Parity, 
74 (1967) 969-973. 

Professor Thomas E. Elsner of the General Motors Institute writes that the 
last of the inequalities preceding (2.1) should read: 


art] Draiagts > 22-1. 


fortunately, this change does not alter the results of the paper. 
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AN APPLICATION OF SIMPLICIAL HOMOLOGY THEORY 
Gait H. ATNEOSEN, Western Washington State College 


The purpose of this note is to illustrate an elementary application of simpli- 
cial homology theory to obtain a geometric result. 


THEOREM. Every polygonal knot in Euclidean 3-dimensional space E* spans a 
surface in E’. 


A surface is a compact connected 2-manifold; a knot spans a surface if it is 
the manifold boundary of the surface. The result is not new. It is well known 
[6, p. 572] that every oriented polygonal knot in E® spans an orientable surface 
in E® (see also [1, p. 140] and [3]). In fact, J. Levine has given a proof of the 
generalization to locally flat m-spheres in R°+? for n22 using transversality 
[5, p. 11]. He has further used this to give a classification of knots in codimen- 
sion 2 for n24; generalizations to n=3 may be obtained using results of J. L. 
Shaneson [7]. 

The proof we give of the above theorem could be presented immediately 
after the development of the simplicial homology groups. This would provide 
the student with an opportunity to work with the newly acquired notions in a 


382 G. H. ATNEOSEN [April 


strongly geometric setting. Some details of the proofs are omitted, but they can 
be readily supplied. 

First, we observe that all polygonal knots in E* can be carried by a space 
homeomorphism into a polygonal knot that lies in a 3-book. A 3-book is a subset 
of £* that is the union of three geometric 2-simplexes such that each pair of 2- 
simplexes meets precisely on a single geometric 1-simplex B on the face of each. 
The 2-simplexes are called the Jeaves of the 3-book and B is its back. A polygonal 
knot K is said to be nicely embedded in a 3-book if: (i) K intersected with the 
back of the 3-book does not contain an arc; (ii) if p is a point of K contained in 
the back of the 3-book and s and t are straight line segments of K with endpoint 
p, then s and # are contained in distinct leaves of the 3-book. A figure-eight knot 
that is nicely embedded in a 3-book is pictured in Figure 1. 


aL 


oe 


Fic. 1 


Lemma 1. Let K be a polygonal knot in E*. Then there is a space homeomorphism 
mapping E* onto E* such that the image of K 1s a polygonal knot that is nicely em- 
bedded in a 3-book. 


Proof. We may assume that K is in regular position [2, p. 6] and that all the 
double points of a parallel projection of K onto the xy plane are contained in the 
x-axis. There is a space homeomorphism which carries the over-crossings into 
the upper half of the xz plane and is such that the entire image of K is contained 
in the union of the xy plane and the upper half of the xz plane. It then follows 
that there is a 3-book containing the image of K. A slight modification of the 
above homeomorphism, if necessary, will yield a nice embedding of the image 
of K in a 3-book. 
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LEMMA 2. Let K be a polygonal knot nicely embedded in a 3-book. Then K spans 
a surface in the 3-book. 


Proof. We may assume that the 3-book B? is triangulated in such a manner 
that K is the carrier of a subcomplex of the triangulation. This triangulation will 
be fixed in the following discussion, and it will be used in computing all simplicial 
homology groups. For the coefficient group we shall use the group Z, of integers 
modulo two. Then Ai(B’, Z:), the first simplicial homology group of B* with 
coefficients in Zz, is trivial. Thus, every 1-cycle modulo two is also a boundary. 
In particular, if c; is the 1-cycle modulo two that has value one on each 1-simplex 
in K and value zero on all other 1-simplexes, then there exists a 2-chain modulo 
two cz such that dc,=c; (0 denotes the boundary operator). Let M be the sub- 
set of Z? that is the union of all 2-simplexes in the triangulation of B* on which 
c, has nonzero value. 


Fic. 2 


We next prove that M is a surface with manifold boundary K. Since 0dc.=c1, 
every 1-simplex in M—K must be the face of precisely two 2-simplexes in M. 
Thus if x© M— K, then x has a neighborhood in M homeomorphic to an open 
2-cell. If xC K and x is not a vertex in the triangulation of B*, then, since K is 
nicely embedded and 0ce= ci, it follows that x is contained in the interior of a 
1-simplex that is the face of exactly one 2-simplex in M. If xCK and x isa 
vertex, then straightforward arguments considering various cases yield that x 
has a neighborhood in M whose closure in M is homeomorphic to a closed 2-cell. 
Thus M is a surface in B* with manifold boundary K. 

The theorem then follows from the above two lemmas. Figure 2 shows a 
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surface spanned by a figure-eight knot that is nicely embedded in a 3-book. 
This surface is nonorientable. 
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ON LOCALLY RECURRENT FUNCTIONS 
Trpor SaLAt, University of Komensk¥, Czechoslovakia 
A real function f defined on an interval [a, b] is said to be locally recurrent 


on this interval if for every x€ |a, b] and every e>0 there exists y with f(y) =f(«) 
and 0<|y—x| <e (cf. [1]). 


THEOREM. Let f be a nonconstant continuous locally recurrent function on 
[a, b]. Then f is a function of unbounded variation on |a, b]. 


This has been proved by Marcus [3] but we establish it as a consequence of 
a theorem on the Banach indicatrix. 


Proof. Let 7 denote the Banach indicatrix of f on [a, b|. Thus if y is a real 
number let r(y) bé the number of elements of the set 


(1) {x & [a, b]; f(x) = y} 


if this set is finite and 7(y) = © if the set (1) is infinite. We also denote by V2(f) 
the variation of f on [a, d]. 
Since f is a nonconstant continuous function on [a, b] we have 
m= min f(x) < max f(x) = M. 
te [a,b] te [a,b] 
It follows from the well-known Darbouxian property of the continuous 
functions that for every yE [m, M] there exists an x€ [a, b| such that f(x) =y. 


So we have r(y)21 for every yE&[m, M]. From this and from the fact that f 
is a locally recurrent function it is easy to verify that 


(2) Ty) = + 


for every yE[m, M]. 
We use the relation 


vin =f rordy= fronds 


™m 
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(cf. [5] p. 374-375 or [6] p. 246) with (2) and the theorem follows at once. 
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HAMEL VERSUS SCHAUDER DIMENSION 
J. W. Evans, University of California, and R. A. Tapia, University of Wisconsin 


In this note we define the notion of Schauder dimension of a Banach space 
and show that the Schauder dimension is less than the Hamel dimension, as is 
known to be true for separable Banach spaces, if and only if the Schauder dimen- 
sion possesses a property which the cardinal No possesses. We shall denote the 
cardinality of a set A by | A | . Let X bea real Banach space and M a subset of X. 


DEFINITION 1. The set M is weakly (strongly) linearly independent if it is not 
contained tn the linear (closed linear) span of any proper subset. 


DEFINITION 2. A maximal weakly (strongly) linearly independent subset of X 
is called a Hamel (extended Schauder) basis for X. 


DEFINITION 3. The cardinality of any Hamel (extended Schauder) basis is 
called the Hamel (Schauder) dimension of X. 


REMARK. A set which is strongly linearly independent is often called topo- 
logically free, while extended Schauder bases are often referred to as minimal 
generating sets. It is extremely difficult to extend the concept of a Schauder basis 
from separable Banach spaces to arbitrary Banach spaces if every element in 
the space is still required to have a unique representation in terms of the basis, 
as is true in the separable case. This difficulty arises from the subtleties involved 
in attempting to formulate a workable definition of conditional convergence of 
series summed over an arbitrary countable set. The obvious generalization of 
merely introducing a linear ordering or well-ordering on the index set, as is sug- 
gested in [1] and [7], will not work. Markouchevitch [5] very cleverly drops 
this requirement and gives the following generalization of a Schauder basis. 


DEFINITION 4. The set {Xe|\aEGA i CX 1s called a Markouchevtich basts for 
X of 
(i) (btorthogonality) there exists a family of continuous linear functionals 
{ falaGA } such that fa(%s) = Sas (Kronecker delta) ; 
(1) (ii) (totality of xq) for any continuous linear functional f, tf f(a) =0 for all 
a then f=0; and 
(ili) (totality of fx) for any xX, tf fa(x) =0 for all a then x=0. 
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REMARK. It is clear that any Schauder basis and (in Hilbert space) any com- 
plete orthonormal set satisfy these conditions. In fact, in Hilbert space condi- 
tions (ii) and (iii) are equivalent. The following proposition shows that a 
Markouchevitch basis is an extended Schauder basis. 


Proposition 1. The set {x.|aGA}CX is an extended Schauder basis for X 
af and only tf conditions (i) and (ii) of (1) are satisfied. 


The proof is an immediate application of the Hahn-Banach theorem 
[3, p. 62]. 


REMARK. It is interesting to note that if continuity is not required in condi- 
tions (i) and (ii) of (1), then Proposition 1 holds with extended Schauder re- 
placed by Hamel. Our definition of extended Schauder basis amounts to drop- 
ping requirement (iii) in (1). 

Some authors follow Arsove and Edwards [1] and drop requirement (ii) 
from (1). In doing this certain paradoxes may arise, e.g., nonseparable spaces 
may have countable bases, see [1]. It seems very unnatural to allow bases whose 
linear span is not dense in the space, i.e., do not satisfy (ii) of (1). 


PROPOSITION 2. Any two Hamel (extended Schauder) bases for X have the same 
cardinalities. 


Proof. Let {x.|aEA} and {yg|8EB} be two Hamel (extended Schauder) 
bases for X. Assume | A| <|B]|. To each xq corresponds a finite (countable) set 
B.CB such that x, is contained in the linear (closed linear) span of { ya | BEB,}. 
Let D=Usea Ba..We have |D| S$ Ny-|A| =| A] <|B|, therefore {ys|BED} is 
a proper subset of {yal BEB} whose linear (closed linear) span is all of X. We 
arrive at a similar contradiction by assuming | A| > |B | . 


PropeosiTion 3. If B ts any cardinal, then there exist Banach spaces with 
Schauder dimension B. 


Proof. Let Q be any point set with || =B. It is well known that L,(Q) 
(p21), with the discrete measure in 2, is a Banach space, and it follows that 
{ Io|wEQ} , where J2(8) = bag, a, BEQ, is an extended Schauder basis forL, (Q). 


DEFINITION 5. A cardinal is of type I af tt 1s the cardinality of a set which 1s the 
union of an infinite sequence of sets with strictly increasing cardinalittes. 


THEOREM 1. Jf H is a Hamel basis and S an extended Schauder basis for X, 
then 

(i) || =2'51 af | S| as of type I and 

(ii) | H| =| S| af | S| is not of type I. 

Proof. If |S | is finite, then |S | is clearly not of type J and the two notions 


of bases coincide. When |S| 2 No, we know from [3, p. 56 and 75] that 
|X| =|#H| and that the number of finite subsets of S is |.S|, while the number 
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of countable subsets of S is |.S|*°. Each x€X is the limit of a sequence with 
terms in the linear span of S. There are at most |S | Ro such sequences so 
|X | < | S | No. but using (i) of (1) we see that to each distinct countable subset 
{x|¢= 1,2,--+- {CS we may associate distinct 

XG 


w= 2) a GX, 

2illars|| 
so that |.S|®*<X or X=|5S|*. This gives us |S| <]S|**=|X| =| A| s2!81. 
The theorem now follows, upon accepting the generalized continuum hypothesis, 
from the fact that |.S| <| S|8o if and only if |S] is of type I [6]. 


COROLLARY 1. There exist real Banach spaces of equal Hamel dimension, but 
unequal Schauder dimension. 


Proof. lf B is any cardinal of type J, then by Proposition 3 there exist Banach 
spaces with Schauder dimension B and Schauder dimension 28, but from the 
theorem both these spaces have Hamel dimension 2%. 


CorROLuARY 2 (Goffman). If B is any cardinal, then there exist Banach spaces 
with Hamel dimension B tf and only tf B 1s not of type I. 


Proof. lf B is not of type J, then by the theorem the Banach space in Prop- 
osition 3 also has Hamel dimension B. Now let X be a real Banach space with 
Hamel basis 


No 
{tala € D}¥and D=U Aj, where | Agil > | Ail. 
i=1 


If D, denotes the closed linear space of {Xe aGURy A;} , then D, is a closed 
proper subspace of X and therefore nowhere dense. Also X =U**, D,; but this 
contradicts the Baire category theorem |3, p. 121]. 


REMARK. The cardinal No is of type J, while the cardinality of the con- 
tinuum is not. 
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RESEARCH PROBLEMS 
EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (if any are known to the author) and by a brief description of known 
partial results. Material should be sent to Victor Klee, Department of Mathematics, University 
of Washington, Seattle, WA 98105. 


TO WHAT LIMITS DO COMPLEX ITERATED RADICALS CONVERGE? 
C. S. Ocitvy, Hamilton College 


Let f(x) =limnsfn(x) be defined for all positive real x for which this limit 
exists, where fi(x) = +/x and fn(x) = /x+fna(x), a> 1. That is, f(x) is the limit 


of the partial radicals of the iterated radical 
R(x) = Vet Vet VeF-o:. 


If f(x) exists then it is also equal to limnz.. fns(x), and we can write y=f(x) 
=~/x-+y, x>0.°Solving this quadratic for y yields y=4(1+/1+4), the 
parabola whose axis is y= } with vertex at (—4, 7) and focus at (0, 3). We can 
separate this into two functions, y,;=$(1++/1+4:2) and y.=#(1—+~W/1+4), the 
upper and lower “halves” of the parabola respectively. 

It is well known that R(x) converges to y:(x) for all x>0, [1]. But note that 
R(0) =0=42(0). 

The first problem is to determine the convergence of R(x) for —~{Sx<0, 
where the partial radicals f,(«) are all complex. We seek to prove (or disprove) 
the following conjecture, based on numerical calculation and plotting of the 


first few fn: 


1. For —45x<0, R(x) converges to the real value y\(x), no matter whether 
the square root of a complex number is consistently taken as the one with the 
least positive amplitude or consistently taken as the other one. 

Is there any iterated radical that converges to values given by yo(x), x0? 
A logical candidate is 


0) = - Ve- Ve Ve, 


because if Q has a limit y, then y= —+/x-+¥, which leads to the same quadratic 
as before. It is clear that Q(1) diverges. 
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Our next conjecture is 


2. Q(x) converges to y2(x) for all x>1. 

It is easy to prove by induction that if for any «<0, R(«)—z, then Q(x)—3, 
the complex conjugate of z. Thus if conjecture 1 is correct, then we know that 
the values of Q(x) for —{<x<0 are given not by 2 but, surprisingly, by 1. In 
the range 0<x<1, the partial radicals associated with Q are complex for all 
n= 2, whereas the corresponding f,(«) are real, and the induction proof cannot 
be carried over. Nevertheless, it would still appear that 


3. For all O<x<1, Q(%)—y1(x). One may also ask: 


4. What iterated radical, if any, converges to yo(x) for —t<x<O and 0<x<i? 
Finally, a broader objective would be: 


5. Develop procedures by which to investigate the convergence of more general 
tterated radicals, such as 


Vit Vazt+ Vast---, 


where the a; are negative or complex. Herschfeld [1] remarks only that “Con- 
vergence questions appear to become very difficult in such cases.” 
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ARE THERE AN INFINITY OF UNITARY PERFECT NUMBERS? 
M. V. SuBBARAO, University of Alberta 


A divisor d of is said to be unitary if d and u/d are relatively prime. We 
shall denote the sum of the unitary divisors of ~ by o*(m). Let z be called a 
unitary perfect number provided o*(n) =2n. 

The inevitable question arises: are there an infinity of unitary perfect num- 
bers? P. Erdés to whom the writer mentioned this problem in 1965, expressed 
the opinion that it might be a very difficult one, comparable to the problem of 
odd perfect numbers and he readily offered a prize of $10 for the first complete 
solution, to which the writer offers a supplemental prize of an equal amount. 

It is trivial to show that there are no odd unitary perfect numbers. Suppose 
n is an even unitary perfect number and is of the form  =2¢m, where m is odd 
and has r distinct prime divisors. In a paper in 1965, L. J. Warren and the 
writer showed, by elementary methods, the following theorem: 


THEOREM A. 


(i) If r=1, then n=60. (v) a cannot be 3, 4, 5 or 7. 
(ii) If a=1, then n=6 or 90. (vi) r cannot be 3 or 5. 
(iii) If a=2, then n=60. (vii) Jf a=6, then n = 87,360. 


(iv) If r=2, then n=60 or 90. (viii) If r=4, then n =87,360. 
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A year ago, the writer together with three undergraduate students, T. J. 
Cook, R. S. Newberry and J. M. Weber, (participants in the Undergraduate 
Research Participation Program under National Science Foundation Grant No. 
GY 4599 to the University of Missouri) obtained further results in this direction 
including the following: 


THEOREM B. Let n=2¢m be unitary perfect. With the same notation as in 
Theorem A, (i) t¢ 1s not possible for a= 8, 9, 10; (ii) tt 1s not possible for r=6. 


The proof involves extensive and exhausting calculations using a desk calcu- 
lator. The details are too long to be shown here. 

These theorems can be used to show, for example, that after 87,360, there 
can be no unitary perfect number with less than 20 digits. The writer was re- 
cently informed, however, that a graduate student at the University of Tennes- 
see, Mr. Charles R. Wall, found by accident another unitary perfect number, 
namely 


218.3-54-7-11-13-19-37-79-109- 157-313, 


a number with 24 digits! 

Therefore it seems rather unsafe to make the conjecture that there are only a 
finite number of unitary perfect numbers, but the writer is still inclined to make 
it! 

Some other results involving these numbers will be given elsewhere. 
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ISOLATION OF ZEROS IN THE SECOND ORDER LINEAR 
DIFFERENTIAL EQUATION 


M. R. CuL_Len, Louisiana State University 
Let w(x) be a solution (40) to the second order linear differential equation 
y"’ + an(x)y" + as(x)y = B(x), 


where a(x), a2(x), and B(x) are continuous on (a, b). When B(x) =0, it is well 
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known that the zeros of Y(x) are isolated, and this appears in many texts as an 
application of the fundamental uniqueness theorem [1, p. 105]. We wish to 
show that in case B(x) has isolated zeros on (a, b), the same conclusion holds for 
W(x). This result is already known. In the more general case of a nonhomogeneous 
nth order linear differential equation, the fact that ~(x) has isolated zeros may 
be deduced from a mean value theorem of Polya ([2], page 313, Theorem I) 
or from a related result of Widder ([3], page 394, Theorem III). The proof of 
this mean value theorem, although quite elegant, involves several manipulations 
with mth order Wronskians, and the theorem of Widder requires considerable 
preparation. The proof which will be presented (for the case 7 = 2) is elementary, 
and should give the student in an intermediate differential equations course 
(such as in [1]) an opportunity to put to use some of the key theorems regard- 
ing second order linear differential equations (namely, the uniqueness theorem, 
Abel’s formula, and the variation of parameters formula). 


LeMMA 1. Suppose B(x) =0 and that (x) is a nontrivial solution. Then the 
zeros of d(x) are tsolated. 


Proof. Suppose not. Then there exists an x»€(a, b) and a sequence x, in 
(a, b) such that d(%,) =0 and x%,—%o. Hence, (xo) =0 and 


¢'(%o) = Himit [b(%n) — (%0)]/(*%n — 40) = 0. 


By the uniqueness theorem [1, p. 105], ¢ is identically zero, a contradiction. 


LEMMA 2. There exist linearly independent solutions ¢i(x) and ¢2(x) to the 
homogeneous differential equation such that ¢1(x)/do(x) is increasing between 
successive zeros of d2(x). 


Proof. Let di(v) and ¢2(x) be two linearly independent solutions. For 
do(x) £0, 


[b1(x)/b2(x)]’ = W(d2, O13 *)/[b2(x) ]?. 


By Abel’s formula [1, p. 113], the Wronskian W(do, ¢1; x) is of constant sign 
on (a, b). Hence, replacing ¢1 by —qy, if necessary, we may assume that the 
sign of the Wronskian is positive, and the result follows. 


THEOREM. Suppose W(x) is a solution to the nonhomogeneous second order linear 
differential equation y"’ +ay(x)y’ +a2(x)y=B(x), where B(x) has isolated zeros on 
(a, b). Then W(x) has isolated zeros. 


Proof. Suppose not. Then, as in Lemma 1, there is an x» in (a, 0) such that 
W(%0) =0 and w’(xo) =0. By the uniqueness theorem [1, p. 90, exercise 6], and 
the variation of parameters formula, 


wa) =f "B(i) [x(0)$a(x) — b(t) x(x) |/W dt, 
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where W(é) = W(qi, 2; £) and ¢@1 and @¢2 are as in Lemma 2. Since the zeros of 
B(x) and ¢2(x) are isolated, we may choose r>0 such that B(x) and ¢2(x) have 
no zeros on (xo—7, Xo) and (%o, xo-+r). First, consider x€(%o, xo+7). For 
xo<t<x, we have di(x)/do(x) >d1(¢)/de(t); and, multiplying by the positive 
number ¢2(x)@e(t), we have 


$1(x) do(t) — o2(x)di(t) > 0. 


Hence the integrand is of one sign for x» <t<x; and, therefore, (x) is nonzero. 
Similarly, for xE(%o—r, x0) and x <t<xpo, the integrand is of one sign, and hence 
(x) is nonzero. But we assumed x» was a limit point of zeros, a contradiction. 

It is easy to see that the above theorem is false if B(x) is not identically zero, 
yet has nonisolated zeros. Consider ’’=6(«), where B(x) =x(20x?—1)sin(1/x) 
— 8x? cos(1/x). B(x) changes sign infinitely often about zero (consider 1/nz), 
and W(x) =x° sin(1/x) is a solution with «=0 as a nonisolated zero. 

Considerably more can be said about the zeros of nontrivial solutions. For 
many other theorems of this type, the reader is referred to [2] and [3], where 
questions such as the number of zeros on an interval and interpolation by solu- 
tions are discussed. 
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THE GENERALIZED JORDAN CANONICAL FORM 
D. W. Rosinson, Brigham Young University 


One of the topics usually considered in a course in linear algebra is the study 
of various matrix representations of a linear transformation on a vector space 
over a field. If the field is algebraically closed, then the most useful and well- 
known representation is the Jordan canonical form. However, it is not as well 
known that, by means of a slight extension, this form may essentially be used 
over a larger class of fields. The purpose of this note is to suggest a way to bring 
this “generalized Jordan canonical form” into the classroom. 

Let X be a vector space of finite dimension over a field K, and let 7 bea 
linear transformation on X. First, suppose 7 is cyclic with minimum polynomial 
aw’, where 


w= NP — ae_A* 1 — 2+ += GA — do 
is irreducible over K. Since 7 is cyclic, there is a vector x in X such that 
{ 2, ur, : ++, ert}, xr(t), er(T)7, °° +, er(r)te},- +, sm(r)™Ire-1} 


is a basis of X. The matrix of 7 relative to this basis is 
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C N O:::0 


A(x’) = 
0 0 O- C 

where 

0 1 O - 0 0 0 0 

0 01---0 | °° °»&»+J- .. ee. 

C = ) N = 

Soe ee ee ee 0 O---d 

ao ay Qo °° ° Ay 1 0---Q 
H(nr’*) is called the hypercompanion matrix of 7". 

More generally, suppose + has elementary divisors 7;®, 7=1,---+, m:, 
4=1,---,m. It may be shown that X is decomposable into a direct sum of 
subspaces X;;, where X,; is invariant under 7 and the restriction of 7 to X;; is 
cyclic with minimum polynomial 77", 7=1, ---,m:;,74=1,---,m. Thus, since 


a basis of X may be obtained by stringing together bases of the subspaces X;,,, 
it follows that 7 may be represented by the direct sum of the hypercompanion 
matrices H(m}"), j=1,---, mi ¢=1,---, m. (For further details see, for 
example, [2], p. 161-162.) 

In case s=1, the hypercompanion matrix reduces to the usual Jordan matrix 
with C=a,) and N=1. In this case the matrix representation enjoys a number 
of convenient computational advantages. For example, it is very easy to express 
the value of any power of the Jordan matrix. However, if s>1, then computa- 
tional difficulties arise. Indeed, since C and N do not in general commute, it is 
awkward to even express the specific form of a power of H(z’). 

On the other hand, if J is the s-by-s identity matrix, then 


C J 0O---0 

0 C Y-:-9 
G(r") = 

0 0 0--:-C 


behaves in many ways as the usual Jordan matrix. For example, if ¢ is a positive 


integer, then 
Ct (|) Cri () Ct2... ( t Jorn 
1 2 r—i1 


canrefo oo (Jon-(t ows) 
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We call the rs-by-rs matrix G(m*) the generalized Jordan matrix corresponding 
to r". (The reader is referred to [3], where specific use is made of this particular 
matrix.) 

The question arises, under what conditions are the matrices H(r") and 
G(r") similar? In other words, when is it legitimate to use G(a") as the matrix 
representation of a cyclic linear transformation with minimum polynomial 7°? 
Since a matrix is cyclic if and only if the degree of its minimum polynomial is the 
order of the matrix, this question is answered by the following lemma, where zr’ 
denotes the derivative of 7. 


Lema. If a is irreducible and r>1, then the generalized Jordan matrix G(r*) 
has minimum polynomial x* tf and only tf 7’ ~0. 


Proof. Let G=G(a") and note from the expression of G‘ above that 


mC) x (C) kof Ok 

0 -xr(C) r(C:--: * 
© = (Cc) ¥(C) 

0 0 0 -+-x(C) 


0 0---90 0 


0 0---:0 0 


Now suppose 7’ =0. Then (#(G))"-!=0 and a’ is not the minimum polynomial 
of G. Conversely, suppose 7’ 40. Since 7 is irreducible, this means that 7 and 7’ 
are relatively prime, and it follows that 7’(C) is nonsingular. Thus, (r(G))-!0, 
(7(G))*=0, and wv’ is the minimum polynomial of G. 

An irreducible polynomial with a nonzero derivative is said to be separable 
({5], p. 65). Thus, as a consequence of this lemma and the observations above, 
we have the following (compare [1], p. 115): 


THEOREM. If 7 ts a linear transformation on a finite-dimensional vector space 
over a field, then t may be represented by the direct sum of the generalized Jordan 
matrices corresponding to tts elementary divisors 1f and only if every irreducible 
factor with multiplicity greater than one in the minimum polynomial of r ts 
separable. 


An arbitrary polynomial is said to be separable provided each of its irre- 
ducible factors is separable, and is said to be semisimple provided it is the 
product of distinct irreducible factors. If we call a linear transformation sep- 
arable whenever its minimum polynomial is separable, and semisimple when- 
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ever its minimum polynomial is semisimple ([4]|, p. 678), then we have the 
following 


COROLLARY. Let 7 be a linear transformation on a finite-dimensional vector 
space X. Then 1s separable tf and only tf there exist linear transformations o and p 
on X such that o 1s separable and semisimple, p 1s nilpotent, r=o+ p, and po =ap. 


Proof. Suppose that each of the irreducible factors of the minimum poly- 
nomial of 7 is separable. Then, by the theorem above, we may represent 7 by 
the direct sum of generalized Jordan matrices. But, it is clear from the form of 
G=G(n") that if S=diag(C, C,---, C) and R=G-—S, then S has minimum 
polynomial 7, R is nilpotent, G=S+R, and RS=SR. Hence, by piecing these 
parts together, we may readily identify the required linear transformations 
o and p. 

Conversely, if such o and p exist, then let w=m - -- am be the minimum 
polynomial of o, where each 7m; is irreducible and separable. Since p(c) =0, 
r=o-+p, and po=ap, it follows that pu(7)="u(o+p) =o) +a(p, o)p=a(p, o)p 
for some w(p, «). Hence, since p is nilpotent, (7) is nilpotent and the minimum 
polynomial of 7 divides a power of yw. In particular, since each 7; is separable, 
so also is each of the irreducible factors of the minimum polynomial of 7. That is, 
7 is separable. 

The o and p of this corollary are unique and are called, respectively, the 
semisimple and nilpotent parts of 7. (See, for example, [4], p. 679.) They may, 
of course, be constructed independent of the discussion above. This in turn 
provides an alternative approach to the discussion of the generalized Jordan 
form. Indeed, given this decomposition of 7, it is a simple matter to construct a 
basis of X, so that the matrix representation of 7 is in the generalized Jordan 
form. 

To illustrate this fact, we conclude this note by constructing such a basis in 
case 7 is again cyclic with minimum polynomial a’, where 7 is irreducible, sep- 
arable, and of degree s. In this case, the semisimple part ¢ has minimum poly- 
nomial 7, and the nilpotent part o is nilpotent of index r. Moreover, since o and 
p commute and 7 =0-+p, if x is such that X is spanned by {xr‘}, then X is also 
spanned by { xotp! i Consequently, 


{%, to,+ ++, 0° \, xp, xop,- ++, toe 'p,---, a a 


is a basis of X. Finally, it is easily checked that G(r’) is the matrix of 7 relative to 
this basis. 
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AN ELEMENTARY PROOF THAT PRIMES ARE SCARCE 
E. L. SPITZNAGEL, JR., Washington University 


In teaching basic number theory to a freshman mathematics class, I became 
interested in finding a proof that the density of primes is 0(1) which would be 
intelligible to most of my students. I believe the proof sketched below meets 
this criterion. It starts from first principles and appears to be simpler than the 
one given by Hardy and Wright. As usual, we let r(z) denote the number of 
primes which are Sz. 


THEOREM. Let C be any number >0. Then there ts a natural number N such 
that r(n)/n<C for alln2N. 


Proof. For each natural number 2, let 7 be the natural number such that 
22*-2 <n S52. Then 


s(n) /n S w(2*)/m < w(228) [22-2 = Aer (224) /22%, 


Let j be any natural number. Each prime p, 27-!<pS2/, is a divisor of the 
binomial coefficient 
23 
(+4)) 


From this fact we deduce the following series of inequalities: 


24 . , 

je p= (201), 
(4) vil , 
Now if we take the exponent of 2 on each end of this series, we find, for j>1, 
(*) a(2/) — (241) S 27/(g — 1). 


Upon replacing the denominator j—1 in (*) with the number 1, we find that 
the number of primes pS 2? is less than or equal to 2?+2*1+ --- +2?+1 
<2*+1, Also, if the denominator j—1 in (*) is replaced by the number 2, it follows 
that the number of primes p, 2'<pS 2**, is less than or equal to 


(22 + 220-1 +- oe + 21) /4 < 22441 /4, 
Therefore 7 (22*) < 2#+1+4 (2241/7), and thus 
(2%) /22 < 1/214 2/i. 


Pick & to be the smallest natural number such that 1/24-1+2/k <C/4. Then for 
all i@R, 1/2*'+2/1<C/4. Finally, set N=2”. If n2N, then 72k, and so 


a(n)/n < 4n(2*)/2% < 4C/4 = C, 


a2 


INV 


as was to be shown. 
Note that the proof makes no use of the concept of logarithm, with which, in 
the writer’s experience, many high school graduates claim they are not familiar. 


1970] CLASSROOM NOTES 397 


However, to someone who is familiar with logarithms, the proof actually yields 
w(n)/n=O(1/log n), though with a larger bounding constant than the one 
usually obtained in a proof of Tchebychev’s Theorem. 


I am indebted to Larry Dornhoff, who read an early version of this proof and suggested several 
improvements. In particular, his idea of using the two telescoping sums made the argument much 
more straightforward. 


A NOTE CONCERNING THE LAW OF QUADRATIC RECIPROCITY 
Peter Hacis, Jr., Temple University 


If p and q are distinct odd primes we shall write pRq if p is a quadratic resi- 
due of g, and pq if p is a quadratic nonresidue of g. For a given odd prime p 
we consider the problem of determining all the odd primes qg such that pRgq. 
Such problems are considered for special values of p in most texts on elementary 
number theory, and the results to be obtained here are implicit in the discussions 
of the applications of the Quadratic Reciprocity Law to be found in these books. 
I have not, however, seen an explicit statement of the following 


THEOREM 1. Let p and q be distinct odd primes and write gq =2pk tr, 0<r<p, 
af p=1(mod 4); and q=4pk tr, 0<r<4p, r=1(mod 4), tf p=3(mod 4). Then 
pRq tf and only tf rRp. 


Proof. If (r/p) is the Legendre symbol we recall that (*/p)=(s/p) if 
r=s(mod p), and (—r/p)=+(r/p) according as p=+1(mod 4). Therefore, 
if p=1(mod 4) we have from the Law of Quadratic Reciprocity (p/q) =(+7r/p) 
= (r/p) from which the conclusion of the theorem is immediate. If =3(mod 4) 
then, since r=1(mod 4), (p/(4pk-+1r)) =(r/p). Also, since —r=3(mod 4) we 
have (p/(4pk—1r)) = —(—1r/p) =(r/p). This completes the proof. 

We give two examples. If =13 then the least positive odd quadratic resi- 
dues of 13 are 1, 3, 9. Therefore, 13Rq if g is of the form 26% +r where r= 1, 3, 9, 
and 13Ngq otherwise. If =7 then the positive odd quadratic residues of 7 less 
than 28 and congruent to 1 modulo 4 are 1, 9, 25. Therefore, 7Rq if and only 
if g=28k +r, r=1, 9, 25. 

A similar result holds for —p. For we have 


THEOREM 2. Let p and q be distinct odd primes and write q=2pk+r, 0O<r<2p, 
af p=3(mod 4); and q=4pk-+r, 0<r<4p, of p=1(mod 4). Jf p=3(mod 4) then 
—pRq if and only tf rRp. If p=1(mod 4) then —pRgq tf and only 1f r=1(mod 4) 
and rRp, or r=3(mod 4) and rNp. 


Proof. We again employ the Law of Quadratic Reciprocity. Assume first 
that p=3(mod 4). If g=i(mod 4) then (—p/q)=(6/q)=(r/p), while if 
q=3(mod 4) then (—p/g) = —(6/q) =(r/p). If p=r=1(mod 4) then (—p/q) 
=(p/q)=(r/p). If p=1(mod 4) and r=3(mod 4) then (—p/g)=—(6/9) 
= —(r/p). 
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For example, if =7 then the positive odd quadratic residues of 7 less than 
14 are 1, 9, 11. Therefore, —7Rq if and only if g=14k-+r where r=1, 9, 11. If 
p=5 then the odd quadratic residues of 5 less than 20 and congruent to 1 
modulo 4 are 1, 9, while the odd nonresidues less than 20 and congruent to 3 
modulo 4 are 3, 7. Therefore, —5.Rq if and only if g=20k-+7, r=1, 3, 7, 9. 
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PANEL DISCUSSION ON ASSISTANCE TO DEVELOPING COLLEGES 
LLAYRON L. CLaRKsON, Texas Southern University 


The purposes of this presentation are to give an overview of some of the 
major problems in the undergraduate mathematics programs in developing in- 
stitutions, with particular emphasis on the so-called Black institutions, and to 
make some recommendations to the Association. 


Brief narrative. It is obvious that the problems and programs in the predomi- 
nantly Black institutions are not fully understood by the major organizations. 
For example, the problem of what to do about the mathematics education of 
the nonscience major continues to cause great concern in the Black institutions. 
To my knowledge, none of these institutions has decided not to teach mathe- 
matics to these students. Yet we recognize that, to a large extent, the remedial 
and compensatory programs we conduct either duplicate or replace the high 
schools’ mathematics instructional programs. But what are we to do? Many 
students do not have the basic mathematical skills necessary to complete success- 
fully nonmathematics courses where mathematics is used. Many want to do 
graduate work in fields requiring some use of mathematics. Many accept teach- 
ing positions and many others take examinations requiring some elementary 
mathematics. 

Many programs seeking better ways to communicate mathematics to this 
segment of the student population are now being conducted. The Association 
should call a series of conferences for the purposes of cataloguing, analyzing, and 
evaluating the present remedial and compensatory type programs. Such a con- 
ference might want to recommend to the Association that it should form an ad 
hoc curriculum committee for the specific purpose of considering curriculum 
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materials for nonscience majors in developing institutions; indeed all institu- 
tions that accept students who are academically deficient. 

Of course, there are problems in designing efficient programs for majors in 
the various sciences and mathematics. Although many of these students are rela- 
tively competent in the so-called basic skills, they have developed many anti- 
quated patterns and they have poor ideas about the true nature of mathematics 
and about the value and use of mathematics in today’s society. As a conse- 
quence, the well-prepared and well-motivated freshman has to be counseled care- 
fully and frequently. Otherwise, his first real mathematical experience at the 
college level could cause him extreme frustration, and it could cause the science 
community to lose forever a potential professional mathematician. 

For further revelations into these problems J recommend that the Associa- 
tion form a task force with strong Black representation to catalogue the needs in 
the mathematics education programs in Black institutions for the general en- 
‘ lightenment of the membership at large and to be used specifically in the form- 
ulation of policy, the designation of priorities in the Association. 

If the major organizations are really interested in assisting the Black colleges, 
they can provide a real service by demanding that the Black schools be properly 
represented when legislation and guidelines for support programs for higher 
education are being drafted. Upon implementing such demands, measures should 
be taken to assure that well-informed spokesmen are participating, so that a 
reasonable portion of the funds will be allocated for training the disadvantaged, 
including Blacks, and for meeting other concerns of the developing institutions. 
Let me cite some examples of what I call the misallocation of funds. Many in- 
stitutions that were legally segregated just a few years ago or that are physically 
remote from major Black communities are receiving large grants to study 
Blacks. In my opinion, this practice is not a product of sound judgment; rather 
it is a product of political manipulations. Many of these institutions have no 
feeling for the importance of such undertakings, nor the expertise and personnel 
for such investigations. Indeed, many of these grantee institutions must employ 
Blacks to implement their programs. It seems to me that if a real impact is to be 
made on many of the Black community problems plaguing the nation, the Black 
institutions that are close to these problems must be awarded a share of the 
leadership and the support necessary to do the research, to design new programs, 
and to supervise the implementation of these programs. It seems that the Black 
institutions are in a much better position to provide a good answer to, “Relevant 
to What?” 

These are general remarks but there are significant implications to the mathe- 
matical community. I think that it is time for the mathematical community to 
broaden its scope and give rewards to its members who design humanistic appli- 
cations of the subject matter. For example, current statistics indicate that only 
one Black out of 200 goes to college. At Texas Southern, only about 3 out of 
every 10 who enter finish with some kind of bachelor’s degree. Can the mathe- 


400 L. L. CLARKSON [April 


matical community stimulate more youngsters into higher education? Can we 
provide positive, valuable experiences so as to make better citizens of those who 
drop out? And how can we improve the quality of those who finish? These are all 
questions that must be answered, and they can be answered by the Black insti- 
tutions provided they gain adequate support to do the necessary work. 

The problem of training the next generation of Black mathematics professors 
must be considered. If the present trend continues, the call for Blacks for prac- 
tically all universities and colleges will expand. I am guessing that the real talent 
among Blacks has only been partially tapped. I know that the necessary recruit- 
ing, and at least the initial training of many Blacks, can best be done in an 
atmosphere requiring the least amount of personal adjustment for these stu- 
dents. The Black institutions should be given the leadership in providing the 
undergraduate training for the next generation of Black teachers. There are 
significant implications here for the involvement of the mathematical com- 
munity. 

The final point I wish to make before I summarize my recommendations is 
that one day, very soon J hope, many veterans are going to return from Vietnam 
and other foreign lands. A significant number of these will be Black. Indications 
are that they will be seeking better lives for themselves and their families. I 
understand that the federal government has already considered making some 
special eligibility considerations for these veterans. Many of these veterans will 
need some precollege-level work in an adult setting. I am convinced that the 
Black schools are better equipped to serve the veterans. I am also convinced that 
the major universities will be awarded the significant grants to serve the country 
in this special way, unless major organizations such as the Association express 
realistic views on the proper utilization of all the national resources. 

In addition to the problems briefly considered above, many predominantly 
Black institutions are experiencing significant changes that will affect their in- 
structional programs. For example, many well-trained Black professors are 
leaving Black institutions to join the staffs of well-established institutions; 
many bright young students who would probably attend Black schools are at- 
tending large white institutions; a large number of well-trained nonblacks are 
actually seeking teaching positions in Black institutions; and many Black insti- 
tutions are expressing commitments to deal directly with many community 
problems. There are numerous other examples. One can only speculate on the 
implications of these changes. 


Recommendations. The following recommendations to the Association have 

been discussed with some of my colleagues: 

1. We recommend that the Association call a series of two-day regional con- 
ferences for the specific purpose of cataloguing the problems and programs 
of Black institutions, including special programs such as those of the In- 
stitute for Services to Education and The Southern Education Founda- 
tion. 
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2. We recommend that the Association appoint an ad hoc curriculum com- 
mittee to recommend to developing institutions meaningful alternatives to 
the current Committee on the Undergraduate Program in Mathematics 
undergraduate curriculum recommendations. 

3. Werecommend that the Association adopt as policy a motion stating that 
adequate representation from Black institutions, and indeed from de- 
veloping institutions, be included in initial planning sessions and subse- 
quent sessions when important issues concerning the obligations of the 
Association are to be considered. 

4. We recommend that the Association use its power to influence congress- 
men, funding agencies, etc., to include representation from Black institu- 
tions in the initial sessions where legislation and guidelines for various 
programs concerning mathematics education are being considered. 

5. Werecommend that the Association survey present efforts for training the 
next generation of Black mathematics professors and then appoint an ad 
hoc committee with the specific charge of recommending to government, 
industry, funding agencies, etc., new ways to get more young Blacks in- 
volved. 

6. We recommend that the Association research the needs in developing in- 
stitutions concerning a variety of instruction-related problems, such as 
adequate graduate faculty, graduate fellowships, staff and student re- 
cruitment, travel expenses for professional activities, upgrading libraries, 
student assistants, visiting professors and lecturers, development of ap- 
plied mathematics offerings, computers in education, precollege centers, 
curriculum development and other professional activities, research in 
learning and teaching mathematics, proper physical facilities, research 
instructorships, etc. 

7. We recommend that the Association do what it can to stop the negative 
publicity the developing institutions, particularly the Black institutions, 
are getting; instead announce that these institutions are valuable assets to 
the national education network, that they do serve the communities, that 
they do conduct some basic research, that they do serve to provide edu- 
cational opportunities to a significant segment of the population who 
would otherwise be denied such an opportunity and that they do turn out 
educated citizens. What more can one ask? 


This article is based on a presentation to the Mathematical Association of America, Eugene, 
Oregon, August 1969. 
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E 2208. [1970, 78]. Correction. Proposed by D. Rameswar Rao, Secunderabad, 
I[ndta 
Let m25 and 2Sd)sSn. Prove 


oma = 0 (mod b — 1). 


E 2228. Proposed by C. C. Lindner, Emory University 


Call a latin square of order m an X,-latin square provided that each of the x 
symbols on which it is based occurs on each of the two diagonals. Show that if 
n=2*, k22, there exist »—2 mutually orthogonal X,-latin squares. 


E 2229. Proposed by J. V. Michalowicz, The Catholic University of America 


Any two positive integers a, b have a g.c.d.(a, b) and a l.c.m.[a, b], where 
[a, b] =ab/(a, b). Can this be generalized to express the I.c.m. of any finite 
numbers of elements in terms of g.c.d.’s only? 


E 2230. Proposed by B. E. Rodden, Defence Research Board, Toronto, Canada 

(A) Urns I and II each contain exactly 2 balls numbered consecutively from 
1 to m. One ball is drawn from urn IJ. Balls are then drawn one at a time from urn 
II until the same-numbered ball is found. No balls are replaced in the urns. The 
process is repeated until urn II is empty. Find pra, the probability of precisely 7 
matches. 
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(B) For a more complex problem, m balls are drawn from urn I. Again balls 
are drawn one at a time from urn II and compared with the balls from urn I. 
When a match is made, one new ball is drawn from urn I to replace the matched 
ball, and the process is continued. When urn I is empty, the process continues 
with the remaining unmatched balls until urn I] is empty. Find $;,, the probabil- 
ity of precisely ry matches in the case m=2. 


E 2231.* Proposed by M. S. Klamkin, Ford Scientific Laboratory 

It is a known result that if the centroid of the vertices and the centroid of the 
area (both uniformly weighted) of a quadrilateral coincide, then the figure is a 
parallelogram. If the centroids of the vertices, of the edges, and of the area (all 
uniformly weighted) of a pentagon all coincide, must the figure be a regular 
pentagon? 


E 2232. Proposed by H. D. Ruderman, Hunter College High School, New York 
City 

Let U, be the smallest number of different unit fractions totalling 1 where the 
largest unit fraction is $1/n. For example, Uj=1, U2=3, and U3=5 because 


' 1 n 1 n 1 n 1 n 1 
3 4 5 6 20 
It is well known that for every z there is a finite number of terms giving the sum 
1. Find an upper bound on U,. 


E 2233. Proposed by E. J. Cockayne, University of Victoria, Canada 

Let A, B, C be three distinct points on the surface of a sphere, not all on the 
same great circle. The closed curve I‘ formed by the minor great circle arcs AB, 
BC, CA divides the surface into two unequal areas. Suppose Z is the set of points 
which comprise the smaller area including the boundary I. Prove that any 
point P of the surface minimizing the sum of the minor great circle arcs PA+PB 
+ PC isa point of Z. 


E 2234. Proposed by P. M. Gibson, University of Alabama at Huntsville. 
Prove or disprove: If A =(a;;) is a nonsingular 2X complex matrix, if also 
A-!=(b,;), and each a,; and 0;; is nonzero, then the matrices of reciprocals 


(aj°) and (b;') are singular or nonsingular together. H. Flanders [this MoNnTHLY, 
(1966) 270-272] proved this for n=3. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Number Theory Problem 


E 2142 [1969, 82; 1070]. Proposed by Erwin Just, Bronx (N. Y.) Community 
College 

Let k, 6 and r be fixed integers. Call an integer 2 special if each member of 
{ kb™4+-7} ,t=1,2, +--+ -+,7,is composite. Prove that the number of special integers 
is infinite. 
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II. Notes by the proposer. (A) In the original solution, if Rb-++7 does not have 
an odd prime divisor, then Fermat’s theorem, which is used in the proof, may 
not be applied. In this case, however, kb-++7 is a power of 2 and kb"-47 is also 
even, hence composite. 

(B) The result may be somewhat generalized. For arbitrary positive integers 


rand s, let {R;}, {b;, {d;}, (a=1,2,---,7), (=1,2,---,5), be sets of fixed 
nonzero integers with 0;+1. Call an integer n special if each member of 
{d;+ >= kb} } , 4@=1,2, ---+,7), is composite. The number of special integers 
is infinite. 


The proof employs the techniques of the original solution. By choosing ; to 
be a prime divisor of d;+ >_3_, #jb; we see that, for any positive integer m, since 


a a @ +> bi) +) bb:(67 °° — 1), 
j=1 j=1 j=1 

it must be the case that dj-+ Dija1 20} is divisible by p;. The required 

conclusion is obtained by choosing n=1-+-m[[j_, (p:—1). 


III. Comment by M. J. DeLeon, Florida Atlantic University. The first sen- 
tence in parentheses on the second line of the solution needs modification. Re- 
place “This is possible for any 7 unless b| 1.” by “This is possible for any 7 unless 
(b, 4) #1.” For a counterexample to the original statement consider k=1, b=45, 
4= 1830. 


E 2143 [1969, 82, 413, 825; 1970, 79]. Addendum. Solutions to this problem 
were also contributed by C. S. Karuppan Chatty (India), G. R. Padmanaban 
(India), P. Nagasundaram (India), and H. R. van der Vaart. 


A Basic Digital Inequality 


E 2179 [1969, 690]. Proposed by M. G. Beumer, Technological University, 
Delft, Netherlands 

If n20, r>1, 0<aSr (un, 7, a integers), show that z has, in the scale of 7, 
exactly 


Ms 


{ [nr-* + ar | — [mr-*]} 


a 
ll 


1 
digits which are 2r—a. 


Solution by Leonard Carlitz, Duke University. Put 


n=n+nortner+t--- OS m<7). 
Then 
a > |-l 
m=n—|—|I|7, Me =|—|—|—I|y7, 
r r r 
and generally m= [n/r*-1|]—|n/r' |r (R=1, 2, 3,---,). Clearly, for fixed &, 


n,=r—a if and only if [z/r*-1| —[n/r* |r =r—a, that is, 
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Cele 


But, if 2=mr'-1+s, 0OSs<r*-}, then 


ECE +o-E+)-s4- B44] 


Thus m2r—a if and only if 
On the other hand 


Hence 


+ e]-[]- to mera 


Eile tlle} 
pai (Lr* r rt 
is equal to the number of digits m, such that m,2r—a. 


REMARK. The above argument evidently yields the following more general 
result. Let 0<a,Sr (R=1, 2, 3, ---), and 


=, + nor + ngrrt+t--- OS nm <7). 


Then the number of digits m, such that m,2r—a, (k=1, 2,-- +) is equal to 


~ nN ay nN 
3} —+—|-|—|}. 
k=1 rk 7 rk 
Also solved by Anders Bager (Denmark), W. E. Donovan, Neal Felsinger, M. G. Greening, 
(Australia), D. C. B. Marsh, Simeon Reich (Israel), E. F. Schmeichel, Charles Wexler, and the 


proposer. 


and so 


A Trigonometric Identity 


E 2180 [1969, 691]. Proposed by Norman Schaumberger, Bronx Community 
College, New York 

Show that >0"73(—1)/ cos*(aj/n) =n/2"-) 

Solution by Henry Ricardo, Yeshwa University and Manhattan College. Setting 
w=exp(mi/n), we have 
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© (—1)c0st(xj/n) = Dearie + 09/2) 
= (1/2") > atti > (") «y(n —2k) 
= (1/27) > ("" > (yi (2n—2k) | 


Now > frqwi@-24) = 0 unless &=0 or k =n, in which cases this sum is equal to x. 
Thus the original sum becomes 


$f) ()]-n 


Also solved by Giinter Bach (Germany), Anders Bager (Denmark), M. T. Bird, L. Carlitz, 
John Cerullo, M. S. Demos, G. C. Dodds, W. E. Donovan, W. O. Egerland, N. Ersec, O. H 
Fraser, R. Garfield, I. J. Good, M. G. Greening (Australia), J. E. Hafstrom, Steve Hartman, 
Philip Haverstick, Dennis Henkel, Robert Heller, Sidney Heller, A. C. Hindmarsh, Stephen 
Hoffman, John Horvath, W. C, Huffman, C. M. Jensen, I. N. Katz, P. G. Kirmser, Lew Kowarski, 
J. R. Kuttler, O. P. Lossers (Netherlands), Beatriz Margolis (Argentina), D. C. B. Marsh, 
D. E. Nixon, Tj. Plomp (Netherlands), E. A. Power (Australia), Simeon Reich (Israel), Steve 
Rohde, Perry Scheinok, E. F. Schmeichel, J. S. Shipman, C. V. L. Smith, Dragutin Surtan (Yugo- 
slavia), M. W. Wilson, M. R. Wise, Chung-liu Wong, and the proposer. 

Several solvers gave an analogous identity involving the sine function. Good proved the 
following generalization 


2-1 + 4 Qajs 7m (") tlsssa—l 
w\” T. n 
~1)i( =) os ee. = 
> (—1) cos — cos 5 


j=0 n 


Qrn-1 


if s = 0, 


making use of the fast Fourier transform, the result being equivalent to a note he contributed to 
Nature, vol. 222, p. 1302. 


Equilateral Triangles on a Given Triangle 


E 2181 [1969, 691]. Proposed by Jack Garfunkel, Forest Hills High School, 
New York 


Given any triangle ABC and a given segment BP on side BC, determine 
(by geometric construction) segments CQ, AT on sides CA and AB respectively, 
so that equilateral triangles erected outwardly on these three segments have 
vertices that are the vertices of an equilateral triangle. 


Solution by S. L. Greitzer, Ruigers—The State University. Let X be the third 
vertex of equilateral triangle BPX. Construct equilateral triangle XCC’ and 
angle BAZ =60°. Let C’Z, parallel to CA, meet line AZ, thus determining Z, 
and draw line XZ. Then AZ=AT, and XZ is one side of the desired equilateral 
triangle. 

Proof. Rotate triangle XC’Z clockwise 60° about X. Then XC’ will fall on 
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XC and C’Z will take the position CY. So triangle ZX Y is equilateral, and 
CY=CoO. 
Also solved by Bernhard Andersen (Denmark), Lucio Artiaga, C, W. Eliason, Jr., Michael 


Goldberg, Norman Miller, A. T. Olson, W. W. Parsons, Tj. Plomp (Netherlands), Edna M. Pratt, 
Simeon Reich (Israel), E. F. Schmeichel, A. W. Walker, Mark Yu, and the proposer. 


K-Sequences with Small Divisors 


E 2182 [1969, 691]. Proposed by N. P. Salz, Bangkok, Thailand 

Let a K-sequence be a block of K consecutive odd integers, each of which is 
divisible by at least one of the 2 odd primes, p1=3, p2=5,- +--+, Pat, Pn. Prove 
or disprove: If p,-2 divides at most one term of a K-sequence, then K $p,1—1. 


I. Solution by Don Coppersmith, Massachusetts Institute of Technology. The 
statement in general is untrue. The 19 consecutive odd integers starting with 
82370091 are divisible by 3, 13, 5, 3, 7, 11, 3, 5, 23, 3, 19, 7, 3, 17, 13, 3, 11, 5, 3 
respectively. Only one is divisible by 17. Here n=8, K=19=pn_1. 

Also the 32 consecutive odd integers beginning with 5546972216583 are di- 
visible by 3, 5, 23, 3, 13, 17, 3, 7, 11, 3, 19, 5, 3, 29, 7, 3, 5, 13, 3, 11, 31, 3, 17, 
37, 3, 23, 5, 3, 7, 19, 3, 5, respectively. Here only one is divisible by 29; n=11, 
K=32=p,.1+1. 


II. Solution by Alfred Brauer, Wake Forest University. In order to prove the 
existence of infinitely many primes in the arithmetic progression kx-+J with 
(R, 1) =1, Legendre [1] tried to prove the following statement: Let p, denote the 
nth prime. There exist at most p,z_1—1 consecutive odd integers such that each 
of them is divisible by at least one of the primes po, p3, *- +, px. Dirichlet [2] 
was the first to recognize that Legendre’s proof was incorrect. He tried in vain to 
find out if the statement itself was correct or not. Hence, he did not use it for his 
proof for the theorem of the primes in arithmetic progression. In 1858, the 
Academy of Paris proposed for the Grand Prix: Prove or disprove the statement 
of Legendre. Five papers [3] were sent in, but none of them solved the problem 
completely. The prize was given to A. Dupré [4] who supposedly proved that 
the proposition is correct for p, =19 and for p, = 29, 31, and 37, but wrong for 43 
<pn113. In 1930 H. Zeitz and I [5] finally proved that the proposition is in- 
correct for all primes p, 243. Moreover, we proved the following theorem there: 


For every e€>0 there exists a number no=no(€) such that for all n> no there are 
sets of (2—€)Dn consecutive odd integers of which no one is relatively prime to 
P= pops ee Pn 

We take e=3. Then there exists a set S, of [3p,] such consecutive odd in- 
tegers for ~>m. Therefore the statement of the problem is incorrect for all 
those sufficiently large », for which S, contains only one multiple of prs. 

Assume now that S, contains at least two odd multiples of pr». Let Rpn_s 
and (k-+2)pn_2 be the two smallest of them. Denote the smallest element of S, 
by a and the greatest element of S, which is smaller than (k-+-4)pa-_e by b. 
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First we assume that at least [4-2] elements of S, are less than kpn_s. 
Then there are at least [$,_.| elements of S, less than (k-+2)p,_s, and only one 
of these, namely Rpn-e, is divisible by pz». We denote this subset of S, by T. 
Assume now that less than [1,2] elements of S, are less than 2p,-». Then more 
than [4p,-2] elements of S, are greater than (R+2)pn,_2 since otherwise the num- 
ber of elements of S, would be less than 


[2Pn—2] + poe + [$on-2] S [$Pn-2] < [852]. 


This gives a contradiction. Therefore there are at least 


Pn-e + [$bn-2] = [$bn-2] 


elements of S, in the closed interval {2p,2+2, 06} of S, and only one of them is 
divisible by pr_2, namely (R-+2)hn_2. We denote this subset of S, by U. Then 
either T or U has at least [$,-2| elements of which only one is divisible by 
Pn—2- 

In order to prove that the statement of the problem is incorrect for all suffici- 
ently large m we only have to show that 


[Epn—2] = Pn-1- 


It follows from the theorem of Tschebycheff for x=p,2 and 6=% that there 
exists at least one prime between p,_2 and [$,_2| and prz_1is the smallest of them. 
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Also solved by D. E. Kuhn and by W. D. Maurer. 


Editorial Note. It will be seen that Coppersmith’s second set above provides a counterexample 
to Dupré’s statement for p,=37. 


Admissible Lines 


E 2183 [1969, 691]. Proposed by Steven Silverman, University of British 
Columbia 

We say that two points in the plane determine an admissible line if the line 
they determine is vertical, horizontal, has slope 1, or has slope —1. What is the 
maximum number of admissible lines points can determine? 
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Solution by Sidney Heller and Larry Padwa, Brookhaven National Laboratory. 
Consider the 2 points to be a graph of n vertices. The maximum degree of each 
vertex is 4, one for each direction. Hence, by a fundamental theorem of graph 
theory, the sum of the degrees, 4, equals twice the number of edges in the 
graph. If no three vertices are collinear, then 2” is the upper limit for the number 
of admissible lines. Call a graph which attains this maximum full. 

We know also that there are at most $u edges in each direction which gives 
2n—2 admissible lines as the upper limit for m odd. 

We can demonstrate that the full graph for any even z greater than or equal 
to 12 exists by exhibiting the full graph of order 8 and the full graphs of orders 
12, 14, 18. For odd m greater than 12, the maximum of 2m—2 admissible lines 
can be given by adding an isolated vertex to the full graph of even order n—1. 
Further, the upper limit of 27 —2 is achieved for all odd greater than 4. 

Let f(z) be the maximum number of admissible lines for points. f(2) =1, 
f(3) =3, and f(4) =6, since the complete graph is the best obtainable, but of 
course it is not full. It can be proved that the full graphs of order 6 and 10 do 
not exist, but that f(6) =11 and f(10) =19. (The determination, particularly for 
n = 10, is tedious and lengthy.) 

To summarize: except for n=2, 3, 4, 6, 10, we have f(z) =4[4], and the ex- 
ceptional values have been mentioned above. 


Also solved by D. M. Bloom. Partial solutions by Don Coppersmith, R.B. Eggleton (Australia), 
R. W. Kehr & E. F. Schmeichel, O. P. Lossers (Netherlands), L. F. Meyers, E. C. Milner, C. C, 
Oursler, and Simeon Reich (Israel). 

Note. The case n= 14 seemed most troublesome. The full figure can be formed by taking two 
octagons with a common vertex removed. 
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All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State Univer- 
sity, New Brunswick, NJ 08903. Solutions of Advanced Problems in this issue should be 
typed (with double spacing) on separate, signed sheets and should be mailed before July 31, 
1970. Contributors (in the United States) who desire acknowledgment of receipt of their solu- 
tions are asked to enclose self-addressed stamped postcards. 


5727. Proposed by P. R. Halmos, University of Hawai 


A matrix 


whose entries are matrices has four formal determinants: AD—BC, AD—CB, 
DA—BC, DA—CB. (In this sense a matrix of size n, instead of 2, has (n!)*' 
formal determinants.) Prove or disprove the following two assertions. (1) If M@ 
is invertible, then at least one of its formal determinants is invertible. (2) If all 
the formal determinants of M are invertible, then so is M. 
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5728. Proposed by F. N. Fritsch, Lawrence Radiation Laboratory, University 
of California, Livermore 


Let X = { x1, ree, xv} be a finite subset of EL, with NS2n. Show that there 
exist two hyperplanes 3 and & such that: (i) XC3ICUK; (ii) KH and K are 
not parallel. 


5729. Proposed by L. F. Kemp, Jr., Polytechnic Institute of Brooklyn 

Let u Xv and u’ Xv’ be two Cartesian product measures defined on (XX Y, 
SX T), the Cartesian product of two measure spaces (X, S) and (Y, T), then 

1. uXv<u' Xv’ if and only if ux<u’ and v<v’. 

2. uXvtLu’ Xv’ if and only if wutw’ or vty’. 


5730. Proposed by K. D. Juhlin, Uniwersity of Illinots 

Let E{ be the real line with the upper limit topology (basis consists of all sets 
of the form (a, b]). Is (0, 1], with the topology induced by the topology on EY 
homeomorphic to Ei (i.e., is E{ homeomorphic to one of its basic open sets?) ? 


5731. Proposed by L. N. Childs, State University of New York at Albany 

(a) Let M, N be finite dimensional vector spaces over a field R. Let S bea 
field extension of R and let M/s, Ng denote the S-spaces generated by M and NV 
(Ms= M®@ 2S, etc.). Prove the following result: 

(*) If xin M@eN has the property that for some S as above, x@1=z in 
Ms @gWNsz has the form xs = y @ g2, y in Mg, zin Ng, thenx=u@v, win M,vin N. 

(b) Assume now only that M, WN are finitely generated free modules over a 
commutative ring with unity R, and let S be an R-algebra which is a finitely 
generated projective R-module. Find conditions on RF so that (*) is still true. 


5732. Proposed by James Chew, University of Akron, Ohio 

Prove or disprove: Let (X, -, 3) be asystem such that (X, -) is a group and 
(X, 3) is a topological space such that multiplication is continuous. 

If card (W1\W-!) 22 for every open set W containing the identity, then 
(X, -, 3) is a topological group. 


5733. Proposed by M.F. Neuts, Cornell University 

Let X be a nonnegative integer-valued random variable, and suppose that Y 
is a random variable satisfying OS YSX. We are interested in the property 
(J): Yand Y—X are independent. 

(A) If Y takes only integer values, and if the conditional distribution of Y, 
given the value of X, is uniform on {0, 1, ---, X} show that (9) holds if and 
only if Yand Y—X have geometric distributions with the same parameter. 

(B) If Y takes real values and has uniform conditional distribution, given X 
on the interval 0S YSX, when does (9) hold? 
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SOLUTIONS OF ADVANCED PROBLEMS 


Coverings for Sets of Measure Zero 


5665 [1969, 423]. Proposed by Peter Ungar, New Vork University 

A student claimed that an arbitrary set of measure zero can, for any e>0, be 
covered by a family of intervals hh, Js, - - - , such that the length of J, is <€/2”. 
Is this true? 


I. Solution by Wiliam Beyer and David Kahaner, Los Alamos Scientific 
Laboratory. The claim is false. If such a covering were possible then 


lim > (=) - .° p> 0 


e>0 n=1 2” 0 Pp = 0. 


Thus the Hausdorff dimension of this set is zero. On the other hand there exist 
sets of measure zero with positive Hausdorff dimension, e.g., the Cantor set has 
Hausdorff dimension log 2/log 3. See Hurewicz and Wallman, Dimension Theory, 
p. 102, also Hausdorff, Dimension und Gusseres Mass, Math. Ann., 79 (1919) 163.. 


II. Note by the proposer. Given a set S of measure zero, the convergent series 
of positive numbers fall into two classes, depending on whether for every e>0, S 
has a covering by a set of intervals J, Jo, - - - with the length of J, <ea,, or not. 
The question arises whether for every set S there exists a convergent series for 
which coverings of the above kind exist. Given S, such a series can in fact be 
constructed as follows. Let In, Iz, - + - be a covering of S by intervals of total 
length < (4)*, for R=1, 2, ---. Enumerate this double infinity of intervals in 
some manner. For the purpose of the rest of the proof, let ¢ XJ denote the in- 
terval with the same center as J, and ¢ times as long as J, where ¢ is a positive 
number. Let J, be 2* times the mth interval in the above enumeration, where k 
is the index of the family of intervals to which the mth interval belongs. Let a, 
be the length of In. Then > .a,<1. Also, for any fixed h, the set of intervals 
2-*XIn (h=1, 2, +--+) will contain the hth family named above as a subset and 
hence cover S. 

Extensive work on this kind of problem is found in Sierpinski, Hypothése du 
Continu. He cites a paper by E. Borel in Bull. de la Société Math. de France, 
47 (1919); also a surprising result of Besicovitch (Acta Matematica, 62 (1934)) 
who disproves Borel’s conjecture that if a set can be covered by a sequence of 
intervals of lengths a, a2, -- +, where ad, de, - - - is a completely arbitrary se- 
quence of positive numbers, then the set must be countable. 

Also solved by Michel Bousquet, R. J. Buck, M. A. Ettrick, K. M. Garg, D. A. Herrero, Mark 
Kac, O. P. Lossers (Netherlands), Nicholas Passell, J. T. Rosenbaum, A. C. Segal, and the proposer. 


Extension of Kantorovic’s Inequality 


5666 [1969, 423]. Proposed by Alexandru Lupas, Cluj, Romania 
Let ® be a linear space of real valued functions defined on a set X, assume 
that 1€®. Let F be a positive linear functional on ® such that F(1)=1. For 
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0<msfi:s M,1=1, 2, prove the inequality 


IIA 


1 fife 1 
o * ae aen) ” (op ea) © ana 
mfri+ Mfe mf, + Mfs 4mM 
with equality if and only if f;=M, fa=m. 


Solution by Hiroko Hidaka, Osaka Kyoiku University, Japan. Our proof runs 
along the line of Rennie [this MonruHLy, 70 (1963), 982] in the proof of the 
Kantorovic inequality. Since0<mSf;S M(4=1,2), we have (f1— MU) (fo—m) S0. 
Since F is positive, linear and F(1)=1, we have an analogue of Rennie’s in- 
equality: 


(2) (+t (aaa)® 


Now from the fact that the product of two positive numbers does not exceed 4 
if their sum is 1, we get the required inequality (1). Obviously, (1) is an extension 
of the Kantorovic inequality (put f;=f2). 

Equality in (1) does not imply fi=M and f2=m as stated. Counterexample: 
In the algebra of all 2-dimensional vectors (with the coordinate-wise multiplica- 
tion), suppose f;=(m, M), fe=(M, m) and F((a, b)) =b. 


Also solved by D. A. Zave, and by the proposer. 
On Riemann Integrability 


5667 [1969, 423]. Proposed by Ray Glenn, Asheville-Biltmore College 
Prove that a function f which has a finite limit at each point of the closed 
interval [a, b] is Riemann integrable on [a, d]. 


Solution by Joseph Horowits and Alan Shuchat, Uniwersity of Toledo, Ohio. 
If f has a finite limit at each point of [a, b], then f=g+h, where g is continuous, 
h=0 at all points where # is continuous and lim,.., 4(x) =0 for all xo. By con- 
sidering positive and negative parts separately, we may assume 420. 

Let An= {x:h(x)21/n}. If Aq is infinite, then it has an accumulation point 
xo and lim,.2, h(x) 21/n. Thus each A, is finite and the discontinuities of 4 form 
a set of measure zero. Riemann integrability follows. 


Also solved by D. T. Adams, J. W. Andrushkiw, Linda W. Brinn, L. E. Clarke (England), R. 
A. Christiansen, R. L. Cramer, M. R. Cullen, John Dennis, W. G. Dotson, Jr., M. A. Ettrick, R. S. 
Fishman, R. V. Fuller, D. W. Hadwin, D. R. Horner, Richard Johnsonbaugh, Emmett Keeler, 
J. F. Leetch, O. P. Lossers (Netherlands), R. J. Loy, R. K. Mueller, D. E. Myers, P. A. Nickel, 
Simeon Reich (Israel), A. C. Segal, Jean Spitzer, R. A. Struble, C. Vu-Son, and the proposer. 

Notes. Vu-Son observes that the conclusion remains valid if the hypothesis is only that right 
and left hand limits exist at each point. Leetch points out that the problem also appears as a 
“Quickie” Q 413, MATHEMATICS MAGAZINE, 40 (1967), p. 232. 
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“PROBLEMS IN THE THEORY OF RINGS” REVISITED 
IRVING KAPLANSKY, University of Chicago 


Introduction. On June 6-8, 1956 the Division of Mathematics of the National 
Academy of Sciences-National Research Council sponsored a conference on 
linear algebras, held at Ram’s Head Inn on Shelter Island, Long Island, New 
York. I ventured to give a talk in which I proposed twelve problems in the 
theory of rings. In due course the proceedings of the conference were published, 
and [31] is the manuscript of my talk. In an introductory paragraph I expressed 
the hope that these problems would help to rekindle interest in the theory of 
rings. Whether or not the problems had anything to do with it, it is a pleasant 
fact that there has since been plenty of activity in ring theory. In particular, six 
of the twelve problems have been solved, and there are partial results on three 
others. Incidentally, when Herstein reviewed [31] he was already able to report 
three problems solved and progress on one more. 

In this paper I propose to revisit the twelve problems and survey their 
current status with reasonable completeness. Where new problems suggest 
themselves, I record them, and I continue the numbering from 13 on. For the 
reader’s convenience, and because [31] may not be too readily accessible, the 
original problems are reproduced. 

I have not at all attempted to review all the major progress in ring theory 
since 1956; I stick rather closely to topics connected with the twelve problems. 
Particularly notable among things I do not touch are the work of Goldie on 
noncommutative Noetherian rings, and that of Cohn on free ideal rings and re- 
lated topics. I hope that some abler pen will provide a comprehensive survey 
before too long. 

I owe Herstein a great debt for his willingness to spend many hours sharing 
with me his thorough knowledge of the twelve problems, and indeed of all of 
ring theory. Such merits as this enterprise possesses are virtually entirely due 
to him. I will venture only to claim full credit for all the boners. Thanks are also 
due to Amitsur, Bergman, Cohn, Connell, Passman, Rosenberg, and Villamayor 
for valuable comments on a draft of the paper. 


PROBLEM 1. Is every right primitive ring left primitive? 


) 


The answer is “no,” in the light of Bergman’s example [8]. The example is 
also presented in [23, pp. 122-5], and is sketched on p. 255 of [26]. 


Irving Kaplansky was Saunders Mac Lane’s first PhD student. He did his undergraduate 
work at Toronto and graduate work at Harvard. He was an instructor for several years at Harvard, 
then in defense work for a year at Columbia. He joined the Chicago faculty after the war, served 
as department chairman 1962-67, and currently holds the George Herbert Mead Distinguished 
Service Professorship. He held a Guggenheim Fellowship in 1948-49, and in 1965-66 he spent a 
leave at Queen Mary College, London. 

He was elected to the National Academy of Sciences in 1966, is a trustee of the American 
Mathematical Society, and received honorary degrees from the University of Waterloo and 
Queen’s University. His book, Commutative Rings (Allyn and Bacon), has just appeared. Editor. 


445 


446 IRVING KAPLANSKY [May 


In [27| Jategaonkar presents further examples which have the additional 
property of being principal left ideal domains. 

It may be useful to investigate whether suitable additional hypotheses (e.g. 
chain conditions) imply that right and left primitivity are equivalent. With one 
eye on Jategaonkar’s examples, we ask: 

Problem 13. If a ring R is right primitive and right and left Noetherian, is 
R left primitive? 


PROBLEM 2. Does there exist a simple radical ring? 


That the answer is “yes” was announced by Sasiada in [60]. Full details of 
the argument were furnished by Cohn and Sasiada [10]. See also the account on 
pp. 125-131 of [23]. Sasiada’s example is sketched on pp. 254-5 of [26]. It is 
worth noting that Sasiada’s construction uses power series and consequently 
yields a ring with cardinal number at least that of the continuum. Hamsher 
[23, p. 131] and Rjabuhin [56] independently observed that it is possible to 
drop down to a countable example. Indeed, any countable subset of a simple 
radical ring can be enlarged to a countable simple radical ring. 

Posner [54] encountered simple radical rings in his investigation of noncom- 
mutative valuation rings. Osofsky [50] found them pertinent in her work on 
injective envelopes of cyclic modules. In both cases the hypothetical simple 
radical rings are not known to exist. 

The existence of simple radical rings and the existence of nil algebras which 
are not locally nilpotent (see Problem 3 immediately following) presumably 
enhance the chances for an affirmative answer to the next problem. 

Problem 14. Does there exist a simple nil ring? 

Some properties of simple nil rings (if they exist) are noted by McWorter 
in [47]. As a final comment on Problem 2, I note Herstein’s proof of the non- 
existence of simple radical rings satisfying a polynomial identity. This was men- 
tioned in [31]; full details are given by Susan Montgomery in her thesis [49]. 


PROBLEM 3. Is every algebraic algebra locally finite? (Kurosh's problem.) 


Kurosh proposed the problem in [36]. Golod furnished a negative answer 
in [16], in the form of a finitely generated nonnilpotent nil algebra. The example 
was a byproduct of his remarkable joint work with Shafarevich on the class 
field tower problem [17]. Vinberg [66] added some refinements. Later accounts 
of the work appear in [15], [23, Chapter 8], and [24, pp. 116-121]. 

Golod’s example simultaneously dragged down to defeat the (unbounded) 
Burnside problem for groups. It is also reported that various people have con- 
structed in this way an Engel group that is not locally nilpotent. (For back- 
ground on this problem see [19].) 

In view of the Golod example, the affirmative answer [29] to Kurosh’s prob- 
lem for algebras satisfying a polynomial identity seems to stand as a twenty- 
year-old high water mark of work on the plus side of the ledger. The simplifica- 
tions introduced by Levitzki [44] should be noted, and excellent accounts ap- 
pear in [26, Th. 1, p. 242] and [23, pp. 162-168]. 
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Special cases of Kurosh’s problem still invite attention. In a draft of this 
paper I asked whether a primitive algebraic algebra is necessarily locally finite. 
In a letter dated July 23, 1969, Amitsur described how to “surround” the Golod 
example by a primitive algebraic algebra which is not locally finite. However, the 
algebra is not itself finitely generated, so the first half of Problem 15 remains 
open. 

Problem 15. Is every finitely generated primitive algebraic algebra finite- 
dimensional? Failing this, is every finitely generated algebraic division algebra 
locally finite? (It is equivalent to ask whether every algebraic division algebra 
is locally finite.) 

The result [30, Th. 10.4] that reduced Kurosh’s problem to the primitive 
and nil cases must now be consigned to oblivion as doubly useless. But when a 
fixed algebra is under consideration, a useful result still survives: If A is an 
algebraic algebra such that every primitive image of A is locally finite and the 
radical of every homomorphic image of A is locally finite, then A is locally 
finite ([44, Th. 7.2] or [26, Th. 3 on p. 244]; these accounts correct an inac- 
curacy in [30, Th. 10.3]). Still other conditional theorems are [30, Th. 10.5 | 
and Corollaries 1 and 2 on p. 409 of [44]. 


PROBLEM 4. Can any ring with a polynomial identity be embedded in a matrix 
ring over a commutative ring? 


The answer is “no.” Indeed a ring with a polynomial identity need not 
satisfy any standard identity. Jacobson [26, p. 260] attributes the first example 
to Drazin, and then (as Herstein did in his review of [31]) gives P. M. Cohn’s 
example of an infinite-dimensional exterior algebra over any field of charac- 
teristic 0. This satisfies [[xy]z]=0 but violates the standard identity of any 
degree. 

On the affirmative side, we have Posner’s theorem that any prime ring 
satisfying a polynomial identity can be embedded in a matrix ring over a field 
({53] or [23, Th. 7.3.2] or [24, Th. 5.6]). This is a sharpening of earlier results 
of Amitsur [1]. 

In the first draft of this paper, I asked whether Problem 4 might be salvaged 
by demanding that the ring satisfy the standard identity in 2 variables for 
some n. This too is false, as Amitsur shows in [5]. He constructs an example of 
a ring R which is even a homomorphic image of a subring of ” by n matrices 
over a commutative ring, yet it cannot be imbedded in m by m matrices over a 
commutative ring for any m, however large. 


PROBLEM 5. Let A, denote the n by n total matrix algebra over a field. Does 
there exist a polynomial which always takes values in the center of A, without 
being identically 0? 


The problem is stated a little carelessly. The polynomial 1 might be offered 
as an answer! Or, if this is too bizarre, we could add 1 to, say, the standard 
identity. 
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Herstein showed me (he credits the observation to John Thompson) that 
over a finite field one can construct a polynomial in one variable which sends 
every 2 by ” matrix to a scalar, and not always the same scalar. This is again 
unsatisfactory. Let us revamp Problem 5, and give it a new number to avoid 
confusion. While we are at it let us insert the restriction n2 3, for the problem 
is motivated by the polynomial (xy—yx)? which works so nicely for »=2. 
(Strictly speaking, (xy—yx)? should be linearized to meet the conditions of 
Problem 16.) 

Problem 16. Let A, denote the » by m total matrix algebra over a field, 
nz 3. Does there exist a homogeneous multilinear polynomial (of positive de- 
gree) which always takes values in the center of A, without being identically 0? 

For the problems on group rings I shall for simplicity stick to group alge- 
bras over a field, although the literature contains numerous results concerning 
group rings over a ring. 

The notation is that G is a group, F a field, and A(G) the group algebra of 
G over F. In this I follow [31], but I acknowledge that it might be better to 
indicate the dependence on F. 


PROBLEM 6. If G has no elements of finite order, does A(G) have no divisors of 0? 


There has been very little progress on Problem 6. It was noted in [31] that 
the answer is affirmative if G can be (linearly) ordered, and therefore also if G 
is abelian (for it is easy to prove that a torsion-free abelian group can be or- 
dered). In [7], Banaschewski showed how to bypass the introduction of an 
ordering, and also extended the investigation to semi-groups. Lagrange and 
Rhemtulla [39] noted that a one-sided ordering of G suffices (we say, for in- 
stance, that G is right-ordered if a<b implies ac<bdc). In his investigation of 
right-ordered groups, Conrad [13, p. 274] observed that a suitable semi-direct 
product of two infinite cyclic groups can be right-ordered but not ordered. It is 
historically interesting that the same group appeared in Levi’s paper [43], and 
was presumably the first published example of a torsion-free group that cannot 
be ordered. Bovdi [9| answered Problem 6 affirmatively if G satisfies a certain 
kind of transfinite solvability. 

J learned from George Bergman about a list of 101 problems on rings and 
modules put out in connection with a symposium at Kishinev on September 3-6, 
1968. Problem V.2 of this list asks the following: if G is torsion-free, are scalar 
multiples of group elements the only units in A(G)? Most of the remarks made 
concerning Problem 6 also apply to this question. 

We can ask what group-theoretic property of G corresponds to other ring- 
theoretic properties of A(G). Quite a number of such results are assembled in 
[12]. Probably the most striking is THEOREM 8: A(G) is prime if and only if G 
has no finite normal subgroups #1. A charming companion theorem is Passman’s 
THeoremM III in [51]: A(G) has a non-zero nilpotent ideal if and only if the char- 
acteristic of Fis p¥0 and G has a finite normal subgroup with order divisible by p. 
Lambek [40, pp. 151-165] has a discussion of group algebras which includes 
these two results. 
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Perhaps the following question deserves attention: 
Problem 17: When is A(G) primitive? 


PROBLEM 7. When 1s tt true that A(G) 1s semi-simple (1n Jacobson's sense)? 
Is ut always true uf F has characteristic 0? 


First suppose that F has characteristic 0. At the time of [31] the only 
available result was the proof by Hilbert space methods (due to Rickart) of 
semi-simplicity over the fields of real and complex numbers. The key idea for 
new progress was the study by Amitsur in [3] of the behavior of the radical of 
general algebras (not necessarily group algebras) under field extensions. 

First, under a separable algebraic extension the radical stays put. (As pointed 
out by Rosenberg in his review of [3], this result is implicit in Theorems V. 8.1 
and V. 14.1 of [26]. In the revised edition of [26], the result appears as Theorem 
1 on p. 252.) Second, in a transcendental extension a radical can only arise from 
a suitable nil ideal of the original algebra. In [3] these ideas are combined with 
the Hilbert space result and methods from [2] to get the semi-simplicity of 
A (G) in case F has infinite transcendence degree over the rationals. In particular, 
there is victory for uncountable F, a result independently proved by Herstein 
(unpublished). 

In [4] Amitsur made a drastic advance and simplification. He first (Lemma 
1) deftly proved that for F the field of rational numbers A(G) has no nonzero 
nil ideals; this is the entering wedge for eliminating the Hilbert space methods. 
It then followed quickly that A(G) is semi-simple for any F not algebraic over 
the rational numbers, and that if semi-simplicity of A(G) holds over the rational 
numbers then it holds for any field of characteristic 0. For later accounts of this 
work see [22, pp. 31-46] and [26, pp. 251-254]. 

It is tantalizing that the subject has not budged since then; the full proof 
of semi-simplicity seems so near and yet may be so far away. 

The literature contains a number of theorems proving semi-simplicity for 
special classes of groups, e.g., [4], [9], [12], [18], [57], [61], [64]. I shall not 
attempt to survey these results. 

We move to the case where F has characteristic p. The situation is highly 
unsatisfactory in that nobody has proposed a plausible conjecture as to what is 
needed to make A(G) semi-simple. In the first draft of this paper I naively asked 
whether the absence of finite normal subgroups with order divisible by p might 
be sufficient for semi-simplicity. (As noted above, this condition is necessary 
and sufficient for the absence of nilpotent ideals.) D. S. Passman and A. Chalabi 
independently showed me that this is false. 

Passman [51] proved that if G has no elements of order p, then A(G) has no 
nil ideals and in Theorems IV and VI he went on to corollaries analogous to 
those obtained by Amitsur for characteristic 0. We can make a bolder conjecture. 

Problem 18: If G has no elements of order » (where > is the characteristic 
of F) is A(G) semi-simple? 

If Problem 18 turns out to be false, we can try a stronger hypothesis. Prob- 
lem 19 is proposed as the second question on page 28 of [23]. 
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Problem 19: If G is torsion-free is A(G) semi-simple? 

We summarize the situation in characteristic p, as regards semi-simplicity: 
absence of finite normal subgroups with order divisible by p is necessary, but is 
not sufficient; absence of elements of order p might be sufficient, but is not at all 
necessary (cf. [52 ]). 


PROBLEM 8. It 1s known that tf G is locally finite and has no elements of order p 
in the case of characteristic p, then A(G) ts regular in the sense of von Neumann. 
Is the converse true? 


With the weakened conclusion that G is torsion, this was proved by M. 
Auslander [6] by homological methods, by McLaughlin [46] by elementary 
methods, and by Villamayor [63] again by homological methods. (The first two 
of these were already reported in [31].) Then Villamayor proved the full result 
in [65]; he did this by homological methods and on page 948 he also gave a 
very simple elementary proof. 

The fourth problem on page 28 of [23] is pertinent. 

Problem 20 (Herstein): If A(G) is algebraic over F is G locally finite? 

Problem 20 is very easy in characteristic 0, cf. this MONTHLY, problem 4934, 
67(1960), p. 927; solution 68(1961), p. 1015. 

An affirmative answer to the next problem would unify Problems 8 and 20. 

Problem 21 (Herstein): Suppose that every element in A(G) is either right 
invertible or a left zero-divisor. Does it follow that G is locally finite? 


PROBLEM 9. It 1s known that if the characteristic 1s p and G ts a locally finite 
p-group then A(G) ts unit element plus a nil algebra. Is the converse true? 


It is neater to state this in terms of the augmentation ideal A (A is the set of 
all V\2:, WEF, g:EG, with }/\;=0). A is a two sided ideal of codimension 
one in A(G). Our problem is: if A is nil, prove that G is a locally finite p-group. 
It is easy to see that G is at any rate a p-group. If we strengthen the hypothesis 
on A to the assumption that A is locally nilpotent, then it can be proved that G 
is locally finite as well. See [45] for these results. But Problem 9 as stated 
remains open. 

Problems 9 and 20 suggest the conjecture that Kurosh’s problem, although 
false in general, is true in suitable circumstances inside a group algebra. We 
might as well go the whole hog. 

Problem 22: Is every algebraic subalgebra of a group algebra locally finite? 

If Problem 22 turns out to be too daring, we can retreat to (one-sided or 
two-sided) ideals, or to nil subalgebras, or to both. 

I shall record the two problems on group algebras which are posed on 
pages 122-123 of [33]. 

Problem 23: In a group algebra are one-sided inverses two-sided? 

Hilbert space methods have proved this to be true in characteristic 0; see 
[48]. But the characteristic p case remains a mystery. 

Problem 24: Let u be an idempotent in a group algebra over a field of char- 
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acteristic 0, 70, 1. Is the coefficient of the unit element in uw rational? 

It is known that the coefficient in question is an algebraic number such that 
it and all its conjugates are real and lie between 0 and 1. 

Let me record a theorem which is easily proved by the methods of [48]: in 
the group ring of any group over the integers the only idempotents are 0 and 1. 
For finite groups this is an old result. Generalizing a result of Swan [62, p. 571 |, 
Coleman [11] extended the finite case to group rings over suitable integral 
domains. 

For several interesting results on idempotents in group algebras see 
Rudin [59]. 


PROBLEM 10. Let A be a simple associative ring which 1s an n by n matrix ring, 
n= 3. Let Z be the center of A, G the multiplicative group of regular elements of A, G’ 
the commutator subgroup of G. Is G’/(G’C\Z) simple? 


(In [31] the first G erroneously is G’.) 

One is tempted to call the Artinian case “classical,” although the full proof 
by Dieudonné [14] is as recent as 1943. In [31] it was noted that Problem 10 
had been settled affirmatively in only two further cases: (1) A =the ring of all 
linear transformations on an infinite-dimensional vector space over a division 
ring, modulo its unique maximal two-sided ideal. This result is due to Rosenberg 
[58]. (2) A =a factor of type II). This requires improving Kadison’s topological 
simplicity [28] to algebraic simplicity. A sketch of the proof has now appeared 
in Appendix IV of [32]. 

The problem had not budged an inch until the thesis of Lanski [41]. Lanski 
has answered Problem 10 affirmatively for certain classes of simple rings. His 
results cover the case where A is algebraic with an infinite center, and also in- 
clude Rosenberg’s theorem. Since completing his thesis, Lanski has extended 
his work so as to cover the factor of type II; result as well. 


PROBLEM 11. Kéthe has determined the structure of algebras satisfying: (1) 
algebraic over an algebraically closed field, (2) simple with unit, (3) of countable 
dimension, (4) locally simple (t.e., any finite subset can be embedded in a finite- 
dimensional simple subalgebra with the same unit). Can a structure theory be 
developed if (3) or (4) 1s dropped or weakened? 


The credit solely to Kéthe was a slip. What was being quoted was pages 
256-257 of Kurosh’s paper [37]. Kéthe [35] had shown (and Kurosh reproved) 
that such an algebra is an infinite tensor product of total matrix algebras. These 
can be further decomposed to be of prime size. Then Kurosh proved the number 
of factors for each p to be invariant (this uniqueness proof does not need 
countability). So the structure referred to in Problem 11 is this: a cardinal num- 
ber for every prime. 

Of course the problem is vaguely stated. But it seems to be fair to say that 
there has been no progress (presumably because no one has thought about it). 

A reasonable weakening of (4) in Problem 11 is to assume the algebra to be 
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approximable by finite-dimensional semi-simple subalgebras. For an example 
of what can happen, see [38]. 


PRoBLEM 12. If D is a division ring, can D=|DD |? (Here |DD] is the set of 
all sums of additive commutators xy — yx.) 


Harris [21] constructed such a division ring. It is noted in [31], with full 
details in [21], that as a consequence the theorem of Wedderburn in [67] does 
not extend to Artinian rings. (The hoped for generalization reads as follows: if 
every element in a ring & is a sum of nilpotent elements, is R nil?) 

Lazerson [42] constructed a division ring with the stronger property that 
every element is a commutator (and not merely a sum of commutators). In fact 
in his example one of the elements forming the commutator can be fixed. In 
other words, Lazerson’s division ring admits a derivation which is onto. 

It is claimed in [34, p. 4] that Artinian rings are after all not the natural 
generalization of finite-dimensional algebras over a field, and that this role 
ought to be taken by rings which are finitely generated modules over a commu- 
tative ring. In this version Wedderburn’s Theorem does indeed generalize, as 
was proved independently by Gruenberg [20, p. 295] and Herstein and Small 
[25]. Gruenberg’s assumption that the ring is Noetherian can be bypassed by 
the device of dropping down to a subring finitely generated over the integers. 
Herstein and Small relied on Posner’s generalization [53] of Wedderburn’s 
theorem to rings satisfying a polynomial identity. Procesi [55] has generalized 
Posner’s result by treating the case where “nilpotent” is replaced by “algebraic.” 
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THE CHAUVENET SYMPOSIUM 
J. C. ABBOTT, U.S. Naval Academy, Annapolis, Maryland 


On October seventeenth and eighteenth, 1969, the United States Naval 
Academy! and the Mathematical Association of America? sponsored a sympo- 
sium on “The State of Mathematics Today.” The Symposium followed the 
dedication ceremonies for the Naval Academy’s new mathematics building, 
Chauvenet Hall. 

The Association was represented at the dedication by its President, Pro- 
fessor Gail Young, who spoke on William Chauvenet and the Chauvenet Prize. 
He pointed out that Professor Chauvenet was not only a leading mathematician 
of his day, but was also one of the founders of the Naval Academy and its first 
head of the Mathematics Department. In recognition of his devotion to the 
highest ideals of good exposition and teaching, the Association established the 
Chauvenet Prize in 1925 for outstanding expository papers in mathematics in 
English. Since then seventeen awards have been made, to mathematicians 


1 The Office of Naval Research supported the Symposium under contract #043-392. 
2The Symposium replaced the regular Fall Meeting of the Maryland-District of Columbia- 
Virginia Section of the Association. 
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whose names have become household words in the mathematical community. 
Six of these Chauvenet Prize winners participated in the Symposium. The 
MONTHLY will publish the written versions of the talks given at the Symposium. 
The first two are included in this issue; three others will appear in the June— 
July issue; the sixth by Professor Leon Henkin, which had to be postponed due 
to illness in his family, will appear later. 

We have felt that one of the major problems which faces mathematics today 
is the problem of proliferation. Mathematics is expanding so rapidly that it is 
no longer possible for any one individual to follow the great traditions of the 
nineteenth century and encompass any but a small corner in one of the many 
vast fields of mathematics. Great names such as Euler, Cauchy, Hilbert, e al., 
who were able to make significant contributions in all areas of mathematics, are 
much scarcer today. Therefore the objective of this Symposium was to bring 
together a number of mathematicians from widely diverse areas who are re- 
nowned, not only for their research contributions, but also for their ability to 
communicate their achievements to others not in their special fields. In this way 
we hoped to make available to the general mathematical public an understand- 
ing of the role and importance of the various parts of mathematics and thereby 
continue the expository tradition set by Professor Chauvenet that “Expository 
mathematics is good mathematics and good mathematics lends itself to good 
exposition.” 

The first paper in this issue is by Professor Paul Halmos of the University of 
Indiana on “Finite-Dimensional Hilbert Spaces.” Like many of this country’s 
most famous mathematicians, Professor Halmos was a product of central Europe 
having been born in Budapest, Hungary. He came to this country in 1933 and 
received most of his professional training at the University of Illinois. He had 
a varied career beginning at Syracuse and including many years at Chicago and 
Michigan with a recent tour at the University of Hawaii. His well-known texts 
on Measure Theory, Finite Dimensional Vector Spaces, Naive Set Theory, 
Boolean Algebra and Algebraic Logic, show the breadth of his interest and have 
been a constant source of pleasure to many. He has a personal style that few 
mathematicians achieve and he has always been devoted to excellence in exposi- 
tion. He received the Chauvenet Prize in 1947 for his paper on “The Foundations 
of Probability.” In his present paper he returns to some unsolved problems in 
finite dimensional Hilbert spaces using some techniques of general operator 
theory and lattice theory. 

Professor Guido Weiss is the youngest of the Symposium speakers, having 
received his doctorate from Chicago in 1956. He was born in Trieste, Italy in 
1928, and spent some years at DePaul before going to the University of Wash- 
ington at St. Louis in 1961. His main interests have always been in the area of 
harmonic analysis and harmonic functions. He received the Chauvenet Prize in 
1967 for his paper on “Harmonic Analysis.” In the present paper he examines 
some methods of complex variables in harmonic analysis. 

The first paper of the Symposium was to have been by Professor Gordon T. 
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Whyburn of the University of Virginia. Unfortunately, just before the Sympo- 
sium the mathematical community was shocked to hear of Professor Whyburn’s 
sudden death. However, following the best traditions of all great teachers, 
Professor Whyburn had so well prepared his talk that it was nearly completed 
at the time of his death. Consequently his colleague, Professor Edwin Floyd, 
also of the University of Virginia, was able to summarize it as a memorial- 
tribute to Professor Whyburn at the Symposium. Professor Floyd was also able 
to complete the original Whyburn manuscript so that it will appear as the first 
article in the June-July issue of this MONTHLY. The article is particularly appro- 
priate since it summarizes Professor Whyburn’s work of the last fifteen years 
on analytic topology. 

Professor Whyburn was a product of R. L. Moore at Texas and followed 
the traditions of this school by developing his own school at the University 
of Virginia. It has produced some of the world’s great topologists, including 
Floyd himself, J. L. Kelley, R. H. Bing, and Victor Klee amongst many others. 
His ability as an expositor was recognized early in 1938 when he won the 
Chauvenet Prize for his paper “On Structure of Continua.” He published con- 
tinuously until his death, advancing the field of analytic topology to its present 
highly developed state. 

The second paper to appear in the June-July issue will be by Professor 
Saunders MacLane of the University of Chicago. Professor MacLane received 
his advanced training at Yale and Géttingen and spent his early teaching years 
at Harvard before transferring to Chicago in 1947. His early interests were 
classical algebra and the extension problem for groups. This led naturally into 
his work with Ejilenberg in the development of homological algebras, which 
eventually grew into category theory, one of the fastest growing branches of 
modern mathematics. He won the Chauvenet Prize in 1941 for his papers on 
“Modular Fields” and “Some Recent Advances in Algebra.” But despite a life- 
time in algebra, Professor MacLane has always been proud of a wide interest in 
all fields of mathematics. Category theory spans many facets. His present paper 
entitled “Hamiltonian Mechanics” testifies to his versatility in bringing new 
methods to bear on classical problems. 

Of all the Chauvenet Prize winners Professor Mark Kac, of Rockefeller 
University, is the only one who has been awarded the prize twice. He won the 
Prize first in 1950 for his paper on “Random Walk and the Theory of Brownian 
Motion” and again in 1968 for his paper “Can One Hear the Shape of a Drum?” 
Professor Kac was also of European origin, having been born in Poland and 
educated at the John Casimir University where he received his doctorate in 
1937. His publications range widely throughout the fields of analysis, probabil- 
ity theory, statistics and number theory. His present paper applies recent 
developments in analysis to probability theory. 


All six of the Chauvenet Memorial Lectures will be available bound separately for library and 
personal use next fall after the appearance of the Henkin lecture. At that time, copies can be ordered 
from Professor J. C. Abbott, U. S. Naval Academy, Mathematics Department, Annapolis, MD 
21402. 


FINITE-DIMENSIONAL HILBERT SPACES 
P. R. HALMOS, Indiana University 


Prologue. | used to like linear algebra because it gave me a motivation for 
the study of operators on Hilbert space and because it gave me insight into the 
algebraic skeleton of operator theory, which made that study easier. Now I like 
what I learned about Hilbert space because it keeps shedding light on more and 
more new aspects of linear algebra and because it succeeds in keeping that clas- 
sical subject alive and exciting. The purpose of this report is to illustrate the 
latter point by describing three non-trivial parts of finite-dimensional linear 
algebra, the original impetus for which came from operator theory on infinite- 
dimensional Hilbert space. The subjects are (1) an algebraic characterization of 
pairs of subspaces of a finite-dimensional Hilbert space, (2) a geometric charac- 
terization of linear transformations in terms of rotations and projections (dila- 
tion theory), and (3) a statement of some fragmentary results and challenging 
open problems about lattices of invariant subspaces. 

A finite-dimensional Hilbert space is, by definition, a finite-dimensional 
unitary space (complex inner product space). The only prerequisite for an in- 
telligent reading of this paper is acquaintance with the language, notation, and 
principal facts of finite-dimensional unitary geometry; see, for instance, [6]. As 
for the insistence on complex numbers: the geometric language of Hilbert space 
is motivated by the real case, but the algebraic hurdles are most easily over- 
come if complex numbers are allowed. The customary way out (followed here) 
is to use complex coefficients, and, at the same time, continue to use real lan- 
guage; this does not seem to lead to serious or permanent confusion. 


Two subspaces. What are all the different ways in which a subspace can be 
placed in a finite-dimensional Hilbert space? The question is vague, but it has 
a reasonably definite answer. If H=C” (where C is the complex number field), 
then one way to get an r-dimensional subspace of H (0Sr Sx) is to form the set 
of all those vectors whose last n —r coordinates vanish. More to the point is that 
to within isomorphism this is the only way; every r-dimensional subspace of C” 
can be obtained from this one by a suitable rotation. 

What are all the different ways in which two subspaces, or three, or any 
number can be placed in a finite-dimensional Hilbert space? The difficulty of the 
answer seems to increase with the number. The preceding paragraph shows that 
there is no difficulty about putting one subspace M into a space H; just put it 
down, anywhere, let M+ fall where it may, and there is nothing left to ask. Before 
the position of two subspaces M and N in H can be said to be known, many 
questions must be asked and answered. Is M included in NV? Is M orthogonal 
to N? Does M+ have a nontrivial intersection with N+? If the relation between 
M and N is not describable in the simple terms of inclusion and orthogonality, 
does it make sense to ask for the “angle” between them? Such questions were 
first raised by Dixmier [3]; the point of view described below is somewhat dif- 
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ferent and more recent [11]. As for three subspaces, or more, the mind boggles. 
There is, in fact, reason to believe that the problem of three subspaces will be out 
of human reach for a long time to come. A comment of Chandler Davis [2 | 
indicates that if we knew all about three subspaces, then we could learn more 
about unitary equivalence than, apparently, we are meant to know. 

In the study of pairs of subspaces there are four thoroughly uninteresting 
cases, the ones in which both M and WN are either 0 or H. In the most general 
case the entire space is the direct sum of five subspaces: 


MON,MO N+, M£ON, MEN Nb, 


and the rest. The parts of M and N in the first four are “thoroughly uninterest- 
ing”. In “the rest”, the orthogonal complement of the span of the first four, 
M and N are in generic position, in the sense that all four of the special intersec- 
tions listed above are equal to 0. 

The simplest example of two subspaces in generic position consists of two 
distinct non-orthogonal lines in a plane, and there is no loss of generality in taking 
one of them to be the first coordinate axis. To get a useful generalization, sup- 
pose that J is a non-singular linear transformation on a finite-dimensional 
Hilbert space K, write H=K@K, let M be the “horizontal axis” consisting of 
all vectors of the form (/, 0) in H, and let N be the graph of 7, i.e., the set of 
all vectors of the form (f, Tf) in H. The assertion that M and N are in generic 
position needs a little proof. The first step is to show that Mf\N=0. Indeed, 
how can an (f, 0) be equal to a (g, Tg)? Answer: only if Tg=0, whence g=0 
(because T is non-singular), and therefore f=0. For the rest of the proof it is 
necessary to know M+ (trivial: all (0, f)) and N+ (easy and standard computa- 
tion: all (— 7“, f)). From this it is easy to deduce that M+(\N+=0: since T* is 
just as non-singular as T, the proof just given applies again. The equations 
M(\N+=0 and M+ON=0 are trivial. 

The basic result in the theory of two subspaces is that this way of construct- 
ing pairs of subspaces in generic position is, to within unitary equivalence, the 
only way. More precisely: 4f M and N are subspaces in generic postition in a 
fintte-dimensional Hilbert space H, then there exists a finite-dimenstonal Hilbert 
space K, and there exists a non-singular linear transformation T on K, such that 
the pair (M, N) ts unitarily equivalent to the pair (K ®0, graph T). 

What follows is an outline of the proof; with suitable analytic caution the 
proof is generalizable to the infinite-dimensional case. Let P be the projection 
with range M. Assertion: the restriction of P to N is a non-singular linear trans- 
formation from N onto M. Suppose, indeed, that Pg=0 for some g in N. It 
follows that geM+C\N, and hence (generic position) that g=0; this proves that 
the kernel of P in N is 0. To prove that the image PWN is equal to M, suppose 
that feM and f 1 PN. This means that if geN, then 0=(f, Pg) =(Pf, 2) =U, g), 
so that feM\N+. It follows (generic position) that f=0; the proof of the 
assertion is complete. 
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The existence of a non-singular linear transformation from N onto M implies 
that M and N have the same dimension. (This could have been proved more 
quickly, but the slower approach is needed for the rest of the proof anyway.) 
Since M and M+ on the one hand and N and N+ on the other hand enter the 
hypotheses with perfect symmetry, it follows that all four of these subspaces 
have the same dimension. Since this applies to M and M+ in particular, there 
exists an isometric linear mapping from M onto M+; the idea from now on is to 
identify each element of M+ with the element of M that it corresponds to. 

Now put K = M. To define T at an element f of K, recall first that f= Pg for 
a uniquely determined vector g in N, project g into M+, and let Tf be the ele- 
ment of M that is identified with the element of M so obtained. (A simple 
2-dimensional picture should make that long-winded sentence crystal clear.) 
The verification that K and T do what is expected of them is straightforward. 

Given a line in the plane, distinct from both the horizontal and the vertical 
axes, rotate the plane through the negative of half the angle of inclination. The 
given line and the horizontal axis become, after the rotation, a line and its re- 
flection through the (new) horizontal axis. This half-angle rotation can be 
generalized to yield a different and useful representation for a pair of subspaces 
in generic position; the result is that any such pair is unitarily equivalent to a 
pair of the form (graph To, graph (—7o)), for a suitable linear transformation 
T». From this representation, in turn, it is easy to recapture Dixmier’s main 
theorem on pairs of subspaces in generic position: the result is that a single 
Hermitian transformation, namely the sum of the two projections whose ranges 
are the given subspaces, constitutes a complete set of unitary invariants for 
the pair. 

Unitary dilations. Suppose that H is a subspace of a finite-dimensional 
Hilbert space K, and let P be the projection from K onto H. Each linear trans- 
formation B on K induces in a natural way a linear transformation A on H 
defined for each f in H by 


Af = PBf. 


Under these conditions the transformation A is called the compression of B to 
H, and B is called a dilation of A to K. This geometric definition of compression 
and dilation is to be contrasted with the customary concepts of restriction and 
extension: if it happens that A is invariant under B, then it is not necessary 
to project Bf back into A (it is already there), and, in that case, A is the restric- 
tion of B to H, and B isan extension of A to K. Restriction-extension is a special 
case of compression-dilation, the special case in which the linear transformation 
on the larger space leaves the smaller space invariant. 

Compressions and dilations can be usefully described in terms of matrices. 
If K is decomposed into H and H+, and, correspondingly, transformations on 
K are written in terms of matrices (whose entries are transformations on H, and 
on H+, and between the two), then a necessary and sufficient condition that B be 


460 P. R. HALMOS [May 


a dilation of A is that the matrix of B have the form 
A X 
( 7) 

The purpose of dilation theory is to get information about difficult trans- 
formations by finding their easy dilations. The program is spectacularly suc- 
cessful. Unitary transformations (rotations) are among the easiest to deal with, 
and it turns out that, except for an easily adjusted normalization, every trans- 
formation has a unitary dilation. Some normalization is clearly necessary: if 
B is unitary, then || Bf|| =||f|| for every vector f, and it follows that || Af|| s]|fl| 
for every vector f; in other words, if A has a unitary dilation, then A must not 
increase the norm of any vector. (In the appropriate geometrical technical term, 
A must be a contraction.) That much normalization is sufficient: every contraction 
has a unitary dilation. 

As a heuristic guide to the proof, consider the very special case in which 
the given Hilbert space is 1-dimensional real Euclidean space and the dilation 
space K is the plane. In that case the given contraction is a scalar @ (with 
| o| <1), and, in geometric terms, the assertion is that multiplication by @ (on 
the line) can be achieved by a suitable rotation (in the plane), followed by pro- 
jection (back to the line). A picture makes all this crystal clear again. 

The proof in the general case can be obtained by first transcribing the 
synthetic proof just outlined to analytic form, and then imitating the analytic 
geometry with matrices in the place of numbers. The conceptual problems that 
the program encounters are familiar ones, and so are their solutions; see [5] 
for the details. 

The least unitary looking contraction is 0, but, of course, even it has a uni- 
tary dilation; one such is 

0 i 
(, 0) 


This dilation does not have many useful algebraic properties. It is not neces- 
sarily true, for instance, that the square of a dilation is a dilation of the square; 
indeed, the square of the dilation of 0 exhibited above is 


(1) 


which is not a dilation of the square of 0. Is there a unitary dilation of 0 that is 
fair to squares? The answer is yes: 


Oo - © 
—-~ © © 
oO oOo - 
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is an example. The square of this dilation is 
0 1 0 
0 0 1] , 
1 0 0 


which is a dilation of the square of 0. Unfortunately, however, this dilation is 
not perfect either; its cube is 


1 0 0 
O01 01, 
0 0 1 


which is not a dilation of the cube of 0. That is (in self-explanatory language): 
the 3X3 matrix is a 2-dilation of 0, but not a 3-dilation. 

Question: does every contraction on an n-dimensional Hilbert space have a 
unitary k-dilation for every k? Answer: yes, on a Hilbert space of dimension 
n(k+1); an elementary proof was given by Egervary [4]. More is true: Nagy 
proved [15] that every contraction A has a unitary dilation B such that B* is 
a dilation of A* simultaneously for every positive integer k. (Nagy’s paper came 
between [5] and [4].) The result is true for infinite-dimensional Hilbert spaces 
too, but its impressive generality has a price: even if the given space # is finite- 
dimensional, the dilation space K may have to be infinite-dimensional. 

To show how dilation theory can be used, consider the finite-dimensional 
special case of a beautiful and powerful analytic theorem of von Neumann 
[18]. The assertion is that if A is a contraction and if g is a polynomial such that 
| a(z)| <1 whenever | 2| =1, then g(A) is a contraction. The proof via dilation 
theory goes as follows: If the degree of gq is &, find a unitary k-dilation B of A. 
It follows then that q(B) is a dilation of q(A), and hence it is sufficient to prove 
that q(B) is a contraction. (In other words, dilation theory reduces the problem 
to the consideration of unitary transformations only.) But that is trivial: a 
unitary B has a diagonal matrix, whose diagonal entries are complex numbers 
of modulus 1; since the corresponding matrix of q(B) has diagonal entries whose 
moduli are not greater than 1, the desired conclusion becomes obvious. 


Reflexive lattices. The set of all subspaces of a finite-dimensional Hilbert 
space H is a lattice (with respect to the operations of intersection and span) with 
zero element 0 and unit element H. Certain of its sublattices, the ones called 
reflexive, are of interest in linear algebra. The definition of reflexivity requires 
of a lattice & that a two-step process performed on £, which always yields a 
lattice of subspaces at least as large as £, should, in fact, yield exactly £, and 
nothing more. The two steps are these: (1) form all linear transformations that 
leave invariant each subspace of £, and then (2) form all subspaces of H that 
are invariant under all those linear transformations. 
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Here is an example. Suppose that H is 2-dimensional, and let £& consist of 
0, H, and two distinct lines (i.e., 1-dimensional subspaces of H). To say of a 
linear transformation that it leaves invariant each subspace in £& is to say just 
that it has two prescribed eigenvectors, and hence that its matrix with respect 
to the basis they form is diagonal. Since the only subspaces simultaneously 
invariant under all such diagonal transformations are the ones in &, the lattice 
£& is reflexive indeed. 

Here is a non-example. Suppose again that H is 2-dimensional, and let £ 
consist of 0, H, and three distinct lines. It is very hard for a linear transforma- 
tion on A to have three distinct eigenvectors; the only linear transformations 
that can do it are the scalar multiples of the identity. Such scalar multiples, on 
the other hand, leave invariant every subspace of H. The two-step process 
applied to this & drastically enlarges £; instead of the 5-element lattice £, the 
enlargement is the infinite lattice of all subspaces of H. 

The non-example of the preceding paragraph fails to be reflexive the worst 
way anything can; the enlargement it effects is maximal. Another way of saying 
the same thing is that a linear transformation that leaves invariant every sub- 
space of the lattice is necessarily a scalar. A lattice that is non-reflexive in this 
extreme way is called transitive. 

A basic open problem about operator theory on infinite-dimensional Hilbert 
spaces is to characterize all reflexive lattices and all transitive (extremally 
non-reflexive) lattices. A little progress has been made, but not very much. The 
finite-dimensional specialization of what is known amounts to two statements: 
(1) every chain (totally ordered set) of subspaces is reflexive, and (2) every 
Boolean algebra of subspaces is reflexive. The infinite-dimensional case of (1) is 
Ringrose’s generalization [17] of a result of Kadison and Singer [14]. For a 
statement of the appropriate infinite-dimensional formulation of (2) see [10]; 
the proof has not been published yet. In the finite-dimensional case both results 
become almost trivial. 

As for transitive lattices, even less is known. The example above (the one 
with three lines) is in a certain sense degenerate. From the point of view of pro- 
jective geometry, which is quite appropriate here, the space of that example has 
dimension 1, not 2, and the example does not help to answer the question 
whether higher-dimensional examples exist at all. 

An interesting unpublished observation of J. E. McLaughlin shows that 
they do exist; one such, in C”, consists of all those subspaces that are invariant 
under the formation of complex conjugates. More explicitly: call a subspace M 
of C” symmetric in case M contains, along with each of its vectors, the vector 
whose coordinates are obtained from the given one by complex conjugation. As- 
sertion: the set of all symmetric subspaces is a transitive lattice. The proof 
requires a moment’s thought, but there is nothing profound about it. 

The example of the preceding paragraph yields many examples. Given a 
finite-dimensional Hilbert space, coordinatize it (i.e., establish an isomorphism 
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between it and C”); the lattice of symmetric subspaces with respect to that 
coordinatization is a transitive lattice. Question: are all transitive lattices 
obtainable in this way? 

The answer is no for two reasons, both trivial, and the heart of the question 
is still unanswered. First, a topological distinction arises: are the lattices under 
discussion closed or not? (There is only one reasonable topology for the space of 
subspaces; the question makes unambiguous sense.) If a lattice is dense in a 
transitive lattice, then it itself is transitive; the question loses no vigor at all if 
attention is restricted to closed lattices only. As long as restrictions are in order, 
here is one spot where the complex field makes life more complicated, not less; 
the question retains all its interest if attention is restricted to real spaces only. 
Second, for spaces of even dimension 2” there is a construction that yields a 
transitive lattice whose non-trivial elements are all of dimension n. Such a lattice 
imitates an already observed misbehavior; it is isomorphic to a sublattice of the 
lattice of the projective line. 

The result of the indicated specializations is the following question: is every 
closed transitive lattice of subspaces of an odd-dimensional Euclidean space 
(real inner product space) equal to the lattice of all subspaces? The answer does 
not seem to be known. 


Note added in proof. K. J Harrison (Monash University) has recently dis- 
covered a new transitive lattice of 18 elements that shows that the answer to the 
question as it stands is no. A modification, however, restores the question: just 
add the hypothesis that the atoms of the lattice span the whole space. Harrison’s 
discovery: makes the problem of determining all transitive lattices even more 
challenging than before. 


Epilogue. Three topics were discussed above to illustrate the thesis that 
operator theory on Hilbert space yields non-trivial questions and answers about 
finite matrices. Choosing the examples was not an easy task; many more are 
available than can be included in one lecture, or one paper, of reasonable length. 

I could have chosen the theorem about “near” projections (if the projections 
onto two subspaces are near enough in norm, then the subspaces have the same 
dimension) [8, Problem 43]. I could have discussed the “power inequality” 
(w(A”) S(w(A))*) for the numerical range [8, Problem 176], which started in 
the theory of partial differential equations and ended as a problem about 
matrices, a problem that refused to become trivial even in the 2X2 case. The 
topological properties of sets of reducible and irreducible operators were of re- 
cent research interest [9, 16], and so also were partial isometries [7, 12]; in 
both these cases new and interesting facts about the finite-dimensional case 
emerged. The theory of matrices whose entries are matrices still has some life 
in it; thus, for instance, the “ultra-invariant” subspaces of binormal operators 
(2X2 matrices whose entries are commutative normal operators; see [1]) are 
still being studied (by R. G. Douglas and C. M. Pearcy), and there are small, 
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amusing, and until recently unnoticed questions even about determinants. 
(Sample: if all four ways of forming the formal determinant of a 2X2 matrix 
whose entries are matrices yield an invertible matrix, does the matrix itself have 
to be invertible? See [13 ].) 

The subjects just given honorable mention, as well as the three actually 
discussed in detail, have been receiving serious research attention in the course 
of the last twenty years (and many still are), and they have all led to questions 
that could and should have been asked in the finite-dimensional case long before, 
but as a matter of historical fact they were not. The reason perhaps is that the 
powerful tools of finite-dimensional linear algebra are too good; they sometimes 
conceal the elegant and intricate structure that the difficulties of the infinite- 
dimensional theory bring out. Most of the problems of operator theory can be 
formulated in the finite-dimensional case, and there are two reasons why it is 
good to do so. The old reason is that the finite can suggest what should and 
should not be tried with the infinite; the new reason is the joy of seeing the 
infinite inspire and guide the finite and contribute to a new flowering of an 
old subject. 
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COMPLEX METHODS IN HARMONIC ANALYSIS 
GUIDO WEISS, Washington University, St. Louis 


1. Introduction. The purpose of this paper is two-fold. First, we plan to 
show how properties of analytic functions of a complex variable can be used to 
obtain several results of classical harmonic analysis (that is, the theory of 
Fourier series and integrals of one real variable). This will be done in Section 1. 
Second, in Section 2 we shall indicate how some of these applications of the 
theory of functions can be extended to Fourier analysis of functions of several 
variables. 

Some of these applications of the theory of functions seem very startling 
since the results obtained appear to involve only the theory of functions of a real 
variable or the theory of measure. This is true of the following results all of 
which were originally obtained by these complex methods. In fact, “real vari- 
able” proofs of (a) and (c) are still not known. 

(a) Suppose yp is a finite Borel measure on (—7, 7), then its Fourter- Stieltjes 
coefficients are Cp, = (1/27) f_% e~i”® du (0), n=0, £1, £2,---+.Ifc,=0forn<0 
then wu is absolutely continuous. This result, obtained by F. and M. Riesz [9], 
has many applications. One of them is the following result of Helson [8]: If 
sn(0) = >on, Cee, n=1, 2, 3,-++, are the partial sums of a trigonometric 
series and they satisfy 


sala = in sn(0)| d0S A < ~&, 


where A is independent of n=1, 2, 3, -- +, then the coefficients c, tend to 0. 
Since {*%, e°d8=0 when 1 #0 this applies to the important special case s,(0) 20 
forn=1,2,3,---. 

(b) If fEL?(— 0, ©), 1S p<, then the Hilbert transform f of f is defined 
by the limit 


(1.1) jl) = lim al, ~ 1) ae 


This definition, of course, only makes sense if the limit on the right exists. It can 
be shown that this limit exists and is finite a.e. If we also assume that 1, 


fle = Aall/llo, 


where A, is independent of fEL” (this result is known as the M. Riesz in- 
equality). When p=1 it is not true that f belongs to L!; however, the following 
more general condition is satisfied by f: if m denotes Lebesgue measure then 


. A 
(1.2) m({a E (— 0, 0): | F(x) | > s 2 0}) < ~Ilfih 


where A ts independent of fELi(—«, «). (Inequality (1.2) is a special case of 
465 
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what are known as weak-type inequalities for operators on L? spaces. For the 
definition of this notion and for many illustrations of its use, we refer the reader 
to Chapter XII of [16].) 

(c) A recently obtained theorem of Calderén [3], having important applica- 
tions in the theory of partial differential equations, extends the last result: Sup- 
pose that a isa Lipschitz function on (— ©, ©), then the operator 


Ta(f)(«) = lim Ho) = 20 
«0 J |x-tlze>0 (x — t)? 


t 


is well defined almost everywhere when fEL?(— ©, ©), 1<p< o, and satisfies 
the inequality || T.f||>SApl|f||p, where A, is independent of f. 

(d) The Fourier series analogue of the Hilbert transform is the conjugate 
function operator which can be defined for fE L1(—7, 7) by the principal value 
integral 

- 4 sin @ 
f(@) = lim — f(@ — 6) ————_ do. 
eo WY wz1dlze>0 1 — cos¢ 
This operator satisfies the properties described in example (b). That is, f(0) exists 
almost everywhere and, if 1<p<o, there is a constant A,, independent of f, 
such that ||f||><Aollfl|>- 

The classical theory of Fourier series has been described by A. Zygmund, in 
the preface of his book [16], “As the meeting ground of real and complex vari- 
ables.” This should come as no surprise once we make the following observa- 
tions. Suppose 


a [oo] 
(1.3) > + >> (a, cos kO + by sin 29) 
k=] 


is a trigonometric series with real coefficients. Then if we add to it (formally) 
the series 


i >> (a sin R80 — db, cos 26) 
hex 
we obtain 
a co 
— + » (ax — iby) e** , 
2 k=] 
which is the restriction to the unit circle |z| =1 of the power series 
ao ~ . 
(1.4) ry + » (ax — ib,) 2". 
k==n1 


We see, therefore, that each trigonometric series with real coefficients is, for- 
mally, the real part of a power seriesdevelopment about the origin. Hence, itis not 
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unreasonable to expect that some results concerning trigonometric and Fourier 
series could be obtained by the following general procedure. If we desire to 
establish some properties of the series (1.3) we construct the power series in the 
manner described above, which, hopefully, defines an analytic function Fin the 
interior of the unit circle. We may, if this is the case, then apply results of the 
theory of functions to obtain certain properties of F. By tending to the bound- 
ary of the unit disc in a radial direction (that is, by having z = re approach e* by 
letting r tend to 1) we may then expect to obtain an almost everywhere defined 
function of @ having properties enabling us to obtain the desired result. In case 
we are dealing with functions defined on the entire real line and their Fourier in- 
tegrals the situation is much the same. In this case we might think of the real 
line as embedded in the plane with the upper half plane assuming the role of the 
interior of the unit circle. 

A classical theorem of Fatou [6] is, perhaps, the most basic tool that can be 
used for carrying out the above described procedure. This theorem asserts that 
any bounded analytic function F(z) = F(re®) defined for |z| <1 has radial limits 
almost everywhere. That is, for almost every 0€(—z7, 7), lim, F(re®) = F(e*) 
exists. The corresponding result for a bounded analytic function F in the upper 
half plane {g=x-+1y: —-o0<X<K ow, 0<y} is that limyso F(x-+7y) = F(x) exists 
for almost every x. 

Let us describe in more detail how Fatou’s theorem can be applied to obtain 
result (b) stated above. Suppose, then, that fEL7(—», ~), 1Sp<o. We 


want to show that 
dt 
ff -9= 
ltlze>0 t 


tends to a limit as e tends to 0. This problem is reduced to one involving an 
analytic function in the following way: We form the Poisson integral 


y 
f? +- +? 


dt 


1 fora) 
ule, 9) --{ fle —2) 


and the conjugate Poisson integral 


1 ° t 
(a9) =— J Jl@e—9 pat 


It is then an easy thing to check that u(x, y)+70(x, y) = F(x-+7y), y>0, are the 
values of an analytic function on the upper half plane. An elementary argument 


shows that 
i ‘ [ tw-o—54-f[ ped =| 0 
im x —_ x—t)—> = 
e—0 —o t? + é [t]ze>0 t 


almost everywhere as y tends to 0 (see Chapter 3 of [16] where this argument is 
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given for the conjugate function operator). Thus it suffices to show that 


f(x) = lim a [1 — ft) dt 


e--0 WT e i? + ¢? 


exists for almost every xG(— ©, ©). 

By considering the negative and positive parts of f separately we can reduce 
the problem to the case fS0. Thus, since y/(?+~?) >0 if (¢, y) belongs to the 
upper half plane, ~<0 and the function G=exp{u+iv} =e? has absolute value 
e“<1. By Fatou’s theorem G(z) has “vertical” limits a.e. on the boundary (that 
is, as z=x-+1y, y>0, tends to x by letting y—0). Thus, v(z) must have vertical 
limits modulo 27 at almost all boundary points xC(— ©, »). That is, the limit 
points of u(z), as 2 tends vertically to x, have the form a+2k7, where & is an in- 
teger. But it is an immediate consequence of the continuity of vin the upper half 
plane that, if the values v(z) have two distinct limit points, a and ), as z ap- 
proaches x in this manner, then all points between a and J must also be limit 
points of this set of values. Hence, v(z) can have such a limit modulo 27 only if 
it has a limit. This proves the first part of result (b). 

In case we are dealing with an integrable function f defined on the interval 
(—7a, 7) whose Fourier series is given by the expression (1.3), then the power 
series (1.4) does converge in the interior of the unit circle (since |a,|, || are 
dominated by (1/27) f_ | f(0)| 6). The function F having this power series de- 
velopment can also be constructed by forming the Poisson integral of f which, 
in this case, is 
— 72 


1 W 
u(re) = f 1 ; dd 


— 2r cos(@ — @) + 7? 
and the conjugate Poisson integral 


(ret) = + 7) ag 
v(re®) = — ep. 

wd _» 1 — 2rcos(@ — ¢) +r? 
We then have F=u-+ziv. The M. Riesz inequality can be obtained by any one of 
several complex methods. For example see Sections 2 and 3 of Chapter 7 of [16]. 
The weak type inequality (1.2) can also be obtained by making use of the prop- 
erties of analytic functions (see Chapter 5 [11]). 

It is not our intention to prove all the results stated here. Rather we would 
like to indicate how the theory of functions can be used to obtain them. The 
above argument, using Fatou’s Theorem, is typical of the situation. The follow- 
ing more general form of Fatou’s Theorem is also often used. We may express 
the fact that an analytic function F(z) is bounded in the interior of the unit circle 
by asserting that 

Ho(r) = sup | F(re#”)| SA < &, 


s0< 


—T = 
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for OSr<1. That is, we assert that the L*® norms of the functions obtained by 
fixing r, OSr<i1, and whose values are F(re“), are bounded uniformly in 7. We 
may, instead of these norms, consider other L? norms. For example, suppose F 
satisfies 


7 1/p 
(1.5) up(r) = ( f | F (re) > a) <A<o 
for some p>0 and 0Sr<1. Jt can then be shown that 


T 1/p 

(1.6) lim F(re®) = F(e*) exists a.e. and lim (f | F(re®) — F(e*) |? io) =(,. 
rl rl] —7 

Result (a) is a consequence of this more general form of Fatou’s Theorem. More 

precisely, it follows from the case p=1. 

The spaces of analytic functions satisfying condition (1.5) were first intro- 
duced by G. H. Hardy [7] and are now known as H? spaces. Corresponding 
function spaces can be defined for an analytic function Fin the upper half plane 
by saying that such a function belongs to the class H” corresponding to this do- 
main provided 

00 1/p 
Ly(y) = (f | F(a + iy) [> dr) SA< om, 
for some p>0 and all y>0. The basic result in the theory of these spaces is that 
for FC H? 


(1.7) lim F(x + iy) = F(#) ae. and lim f | F(x + iy) — F(x) |? de = 0. 
y—0 y—0 00 


2. Extensions to several variables. In view of the remarks made in the last 
section, perhaps the most efficient way of describing how these complex methods 
arise in the case of harmonic analysis on Euclidean m dimensional space R” is to 
present several possible definitions of H? spaces of functions of several variables. 
The most natural extension of the classical H? spaces of analytic functions de- 
fined in the interior of the unit circle is the space of all functions F that are 
holomorphic in the polydisc 


D* = {g = (21, 22, °° +, 2) © C*: | z1| <1, | 22| <1,---, | z| <1} 


and satisfy 


T TT 
f ee f | F (rye, eee ) rnevn) 
—7* —7r 


Pd§,:-:di, 3 A< © 


for OS", 72, °° +, Y¥n<1. The theory of these spaces was first developed by A. 
Zygmund [15]. He showed that (1.6) has the following extension 
lim F(rye®), roe? + rpeMn) = P(e, et2, +, etn), 


71,72,°°*,Tn—l 
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exists for almost every (0), 62, °° -,O0,)€ [—a7,r]xX -++xX[—n, a | and 
lim J J ae f | F(r:e%t, «+ , ren) 
T1;72,°°°,tnl —_7rV —x4 —7 
— F(e,.. +, en) |7do,- + - dd, = 0. 


We note that the entire boundary of the polydisc is not involved. Rather, only 
the product of ” perimeters of unit circles is being used as the domain of defini- 
tion of the boundary values. 

The analogous extension of the spaces defined at the very end of Section 1 
is the class of all holomorphic functions F defined in the product of half planes 


12= («1+iy1, 8 Mn ttyn)iyi> 0, +: ,Yn>0} satisfying 
J | F(a, + inn, cone Xn + iyn) [Pda - -- dt, SA< @ 
for all yi, yo, + + + , ¥n>0. Again, it can be shown that for such a function F we 
have (see [16]) 
lim F(x, + i941, X2 + ive, ++ +5 Xn bt iyn) = Fei, Xo, +++ 5 Xn) 
Yr1,Y2,°°* Yn 
exists for a.e. = (x1, °° * , %n) CR” and 


lim J af | F(x1 + iyi, + +, tn + ayn) 


Yr,Ya,-"* Yn—0 


— F(a, +++, 42) |? day- ++ dx, = 0. 


The more complicated structure of R", when n>2, however, allows us to 
consider much more general function spaces that directly extend the classical 
H? spaces. For example, one can obtain such a function space associated with 
every tube domain in the complex n-dimensional spaces C”. These domains are 
defined in the following way: Let B be an open convex subset of R* then the tube 
Tp having base B is the domain 


Ty = {x + iytx ER, y © B}. 


Then a function F is said to belong to the space H?(Ts), p>O, provided it is 
holomorphic in Tz and satisfies 


(2.1) { | F(a + ty) |pdx SA<com 
R” 


for all yCB. When the dimension z is 1 and B=(0, ©), H®(T3,) is the space 
defined at the end of the last section. Since we are assuming that B is convex, 
the only other possible types of tube domains are those with base a finite inter- 
val or the entire real line. (If B is merely open and connected, but not neces- 
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sarily convex, and F is a holomorphic function in 7’, satisfying (2.1) then F has 
a unique holomorphic extension to T',°, where B* is the convex hull of B, which 
satisfies (2.1) for all yC@Bé (see [11]).) In the latter case it follows from condition 
(2.1) that FC H*(Tx) implies F=0. In the former case we obtain an interesting 
function space having many of the features of the better known classical space 
associated with the upper half-plane. 

One would expect, in view of result (1.7), that, in general, for FC H?(T 3) 
and yo a boundary point of B, F(«-+7zy) tends to a limit function F(x+7yo) as 
y€B tends to yo (either almost everywhere or in the mean). Unfortunately, 
when 2>1 this is not true even when p=2 and we consider only limits in the 
mean. (These are the simplest H? spaces one can study since many of their 
properties can be obtained by making use of the behavior of the Fourier trans- 
form as an operator in L?(R”). In particular, many of these properties are conse- 
quences of the Plancherel theorem. Bochner ([1] and [2]) was the first to intro- 
duce the spaces H?(T3). The more general H®(T3) spaces, 0<p, were first 
introduced in [10].) In order to assure the existence of such limit functions we 
must restrict the approach of y to the boundary point yo to be “nontangential” 
(see [10] for a precise definition of this notion). Despite such difficulties the H? 
theory is sufficiently rich to enable one to obtain several applications. For exam- 
ple, it can be used to derive a characterization of the Fourier transforms of L? 
functions vanishing outside certain compact and convex subsets of R” (this is 
an extension of the Paley-Wiener theorem; see the third chapter of [10] for 
details). 

By restricting the base of the tube domain we can obtain other direct exten- 
sions of results in the classical one-dimensional theory. For example, let T be an 
open cone with vertex at the origin; that is, I’ is an open set in R* having the 
property that whenever x, yCI and a, b>0 then ax+odyEIr. We say that I is 
regular if it does not contain any entire straight lines. Associated with a tube 
domain Ty, with I’ a regular open cone, one can find a Poisson kernel (see [10]) 
which makes it possible to obtain an H” space theory for all )>0. One of the 
applications of this theory is the following extension of result (a): 


If w ts a finite Borel measure on R" whose Fourter-Streltjes transform has sup- 
port in a regular open cone then yw ts absolutely continuous. 

The proof of this result is very similar to the classical one: The hypotheses 
imply that the Poisson integral of the measure gives us a function belonging to 
H}(Ty«), where I'* is a cone associated with I’ (called the “dual” to I). Once 
this is done, the rest of the proof is essentially the same as that of the 1-dimen- 
sional result. (This application was brought to the author’s attention in a con- 
versation with Professor R. R. Coifman.) 

The theory of H”? spaces becomes much richer if we consider certain special 
cones. For example, if I’ is the circular cone {y=(y1, yo, +++, Yn) ER*:9% 
—ye- +++ —y2>0and >0} the Poisson kernel has the following particularly 
simple form for (x, y)ETr: 
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c(y, n{2 
P(x) = (y, 9) 
([(x, 2) — (y, 9) ]® + 4(a, y)?)2/?? 
where (u, ¥) =U —Uoqte— + * + —UpV, and c is an appropriate constant. Similar 


formulae can be obtained for the Poisson kernel associated with other cones 
that are “classical domains.” These expressions make it possible to obtain 
certain estimates for Poisson integrals useful in extending one-dimensional 
results. For details see Chapter 3 of [11]. 

One can extend the H? theory associated with the unit disc to 7 dimensions 
by a method analogous to the one just described. This involves the use of 
Reinhardt regions instead of tube domains. The problems arising from this 
development are similar to those encountered in the H? space theory associated 
with tubes. 

There exist other ways of considering n-dimensional extensions of complex 
methods. We shall finish this article by describing certain H? spaces for which 
the basic novelty in the generalization is not in the domain used, but in what is 
meant by an analytic function of 2 variables. If F=u-+i is analytic in some 
region of the plane R?, we know that the Cauchy-Riemann equations v,+u, =0 
and vy—uz=0 must be satisfied in this region. Conversely, if two differentiable 
functions u and uv satisfy these equations in a simply connected region UCR?, 
then F=u-+12v is analytic in U. The second equation asserts that (v, u) is the 
gradient of a function h, while the first assures us that h is harmonic. Conse- 
quently, F=u-+1v is analytic if and only if (v, #) is the gradient of a harmonic 
function # in every simply connected subregion of the domain of F. 

In view of these observations it is reasonable to consider the gradient of a 
harmonic function defined in a domain UCR®* to be a type of “generalized 
analytic function.” If F=(u, ue, - ++, Un) is such a gradient then, clearly, the 
components uj, u2, +++, Un satisfy the system of partial differential equations 


(2.2) 


Ox OX Ln OX, OX; 


for j, k=1, 2,--+-,n. Wecall a solution F= (uw, us, - ++, Un) of (2.2) in a con- 
nected (not necessarily simply connected) region UCR" an M. Riesz system of 
conjugate harmonic functions. (A simple calculation shows that any differentiable 
solution of (2.2) consists of functions that are harmonic.) Such a vector valued 
function F is an example of the generalization of the notion of analytic function 
we mentioned in the previous paragraph. 

We can define the H” space (p>0) associated with the upper half space 
R¢tt = } (x, y) = (41, Xa, °° +) Xn,» Y) ERX CR?, y>0o} to consist of all those 
M. Riesz systems of conjugate harmonic functions F=(u1, Ue, + ++ , Ungi) defined 
in R4*? which satisfy 


(2.3) (fire pac)” <Acm 
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for all y>0, where | F| =(uit+ugt --- +u2,,)/2. We see that these spaces also 
furnish us with a simple direct generalization of those introduced at the end of 
the last section. Unfortunately, we do not obtain a complete extension of (1.7). 
The best result that has been established on the existence of boundary values for 
FEH?(R"t') (see [12]) is that for p= (n—1)/n 


(2.4) lim F(x, y) = F(x) 
v0 
exists for a.e. xCR” and 
I/p 
(2.5) im ( J | F(x, y) — F(x) pas) = 0 
y—0 Rn 


when p> (n—1)/n. 

The significant feature of this result is that we do obtain boundary values 
for indices pX1. (If © >p>1 and wu is a harmonic function on R%" satisfying 
Cf re| u(x, y) | rdx)\lp <A < oo for all y>0O then the boundary values limy.ou(x, y) 
=u(x) exist for almost every xCR” and limyso(/r| u(x, y) — u(x) | Pdxe)1/? = 0. 
When / =1, the last equality fails in general but the almost everywhere defined 
“vertical” boundary values do exist. When <1 even these boundary values 
may fail to exist (see [11]). By considering a system of conjugate harmonic func- 
tions the results we have just announced tell us that we obtain boundary values 
for » below 1, the index which is so significant in determining the boundary 
behavior of a single harmonic function.) For example, this fact allows to es- 
tablish yet another extension of the theorem of F. and M. Riesz (see [12] for a 
formulation of this extension). A study of the proof of (2.4) and (2.5) shows that 
the basic reason for being limited to values p2(n—1)/n is that a system 
F=(u1, Ue, + * +, Ungi) satisfying equations (2.2) has the property that | F P is 
subharmonic for p2(n—1)/n. In our case this is equivalent to saying that 
A| F |?=0 whenever F(x, y)+0, where A is the Laplace operator (02/0x}) 
+ +--+ +(02/dx2)+(02/dy?). It turns out that the range of indices p for which 
| F | ? is subharmonic is larger for other systems of conjugate harmonic functions. 
For each such system we have corresponding H? classes, defined by (2.3), and 
boundary value results (2.4) and (2.5). We shall end this exposition by present- 
ing some of these systems. 

We introduced the M. Riesz system as a gradient Vi = F= (uy, U2, ++ +, Un) 
of a harmonic function h. By considering the gradient Vu; = (uj, uj, + ° + » Ujn) 
of each of the components u,;, j=1, 2,---, m, we obtain an n?-tuple 
FO) = (uy, Uy, * + * » Unn) Which we shall call the second gradient of h. We can 
perform this operation again on each of the components of F® and obtain the 
third gradient of h. Continuing in this way we obtain the kth gradient F™ of h. 
Calderén and Zygmund [4] have shown that | PF () | is subharmonic for 
p= (n—2)/(k+n—2). Thus, by considering H? spaces of systems of harmonic 
functions on R%"' that are kth gradients we obtain the boundary value results 
(2.4) and (2.5) for indices p2 (n—1)/(k-+n—1) (observe that the dimension of 
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the domain is +1), That is, we obtain an H” space theory for indices p arbi- 
trarily close to 0. 

These higher gradients satisfy certain partial differential equations which 
can be included in a class of equations closely connected with the irreducible 
representations of the rotation group SO(n) and its covering group (see [13]). 
Even more general systems of partial differential equations can be written in 
the form 


” OF 
(2.6) d A4;— =0, 
j=l ON; 
where A; is an mXk constant matrix, F=(u; u,--+, uz) is a k-tuple, and 


OF /0x;= (0u1/0%x;, Ou2/Ox;, +++, Ouz,/Ox;). If each solution of (2.6) must be 
harmonic, these partial differential equations are known as generalized Cauchy- 
Riemann systems. It can be shown (see [5]) that for each such system there 
exists an index po<1 such that, if F is a solution then | F|? is subharmonic for 
p=pbo. It would be interesting to know which of these generalized Cauchy- 
Riemann systems are the ones known to be associated with the irreducible 
representations mentioned above. 
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CRITICAL POINTS AND CURVATURE FOR EMBEDDED 
POLYHEDRAL SURFACES 


T. F. BANCHOFF, Brown University 


The Gauss-Bonnet theorem for a surface M? in Euclidean 3-space E? and 
the Critical Point Theorem for height functions on an embedded surface are 
two of the earliest and most important theorems of “geometry in the large.” 
Both theorems relate geometric properties of the embedded surface (the total 
curvature of the surface or the sum of a set of geometrically defined indices of 
singularity) to a topological property of the surface, the Euler-Poincaré char- 
acteristic y(M?). Both theorems are very geometric in character despite the 
fact that the standard definitions of total curvature and index of singularity 
appear to involve the use of differential calculus and the hypothesis that the 
surfaces are smooth. In fact both theorems have analogues for polyhedral sur- 
faces embedded in E*, and the proofs in the polyhedral case are entirely ele- 
mentary. 

Some of the most interesting results in global geometry have exploited the 
connection between total curvature and critical point theory, as in the work of 


y2" 


pe 
Cx?) i(pe, )=—1 


Ps 
t(ps, é) =—1 


CF i(pa, £) =1 


“. x(T?) =0 
pa 


Fic. 1 
Thomas Banchoff wrote his Berkeley thesis under Chern in 1964. He was a Peirce Instructor 


at Harvard, a Fulbright Research Associate at the University of Amsterdam, and presently is an 
Assistant Professor at Brown. His research is in global! differential geometry. Editor. 
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Kuiper [4]. In this paper we shall follow this same procedure to prove the 
critical point theorem and use it to prove the Gauss-Bonnet theorem. In the 
polyhedral case, a new feature is an interpretation of the Theorema Egregium 
of Gauss which relates the extrinsic curvature to the intrinsic curvature on a 
surface. 

All of the theorems in this paper have appeared in a generalized (and tech- 
nical) form in the author’s paper [2]. 


1. The Critical Point Theorem. Consider a closed smooth surface M? em- 
bedded in E* and consider a linear function — on E*® given by projecting all of 
E® to the line determined by a unit vector ~ A point » of M? is said to be a 
critical point for & if the tangent plane to M? at p is perpendicular to &; all other 
points of M? are called ordinary points for & In the “standard” example of a 
height function on a torus of revolution held vertically there are just four 
critical points, a maximum, a minimum, and two (nondegenerate) saddle points. 

The Critical Point Theorem for height functions states that if & has a finite 
number of critical points on M? and all are of the three types described above, 
then (number of local maxima) -+ (number of local minima) — (number of saddle 
points) =x(M7?), where x(M7?) is the Euler-Poincaré characteristic of M?. 

We express this theorem more succinctly by indexing each critical point by 
i(p, &)=1 if p is a local maximum or minimum and 7(p, &) = —1 if p is a (non- 
degenerate) saddle point. The theorem then states 


>  i(p, €) = x(M?). 
p critical for & 

In classical critical point theory (= Morse Theory) the index is given by 
considering the sign of the determinant of the matrix of second derivatives, as in 
[5|, but since we are interested in developing the polyhedral analogue of the 
theorem, we proceed to give a more geometric presentation of this indexing 
procedure. 

If a point g is ordinary for the height function &, then the tangent plane at 
g is not horizontal. Thus the tangent plane divides a “small disc neighborhood” 
U of g on M? into exactly two pieces and it meets a “small circle” about gq in 
precisely two points. This distinguishes an ordinary point from a local maximum 
or minimum (where a “small circle” about the critical point will not meet the 
tangent plane at all) and from a nondegenerate saddle point » (where the plane 
at p perpendicular to gq meets a “small circle” about p on M? in four distinct 


points). 
| ass 


2 0 0 4 
Fic, 2 
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We may then give an arithmetic definition of the index as follows: 
a(p, £)=1—§ (number of points in which the plane through p perpendicular 
to — meets a “small circle” about p on M2). 

This definition agrees with the previous indexing procedure and has the addi- 
tional property that z(qg, £)=0 if g is not a critical point. In the smooth case, 
however, the definition is somewhat unsatisfactory due to the difficulty of 
defining precisely the notion of a “small circle.” In the polyhedral case, on the 
other hand, this is exactly the sort of definition which we want. 

Consider a polyhedral surface M? in E® which is expressed as a union of V 
vertices, E edges, and J triangular faces. The Euler characteristic of M2? is 
defined to be 


x(M?2) =V—E+T. 


A height function £ on E? is said to be general for the polyhedral surface M? 
if E(v) #£(w) whenever v and w are distinct vertices of M*. If € is general for M?, 


Fic. 3 


4(tm, £)=1 \ local 
t(ue, £)=1 f maxima 


i(v, )=—-1 non-degenerate 
saddle 


+i(w, 4) =1 local 
é —_—————— minimum 
x(P) =2 
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then the point q is said to be ordinary for & if the plane through p perpendicular 
to — cuts the disc neighborhood Star(g) into two pieces, where Star(q) is the 
union of all vertices, edges, and faces which include g. (When we say M? is a 
polyhedral surface, we mean that for each point g, Star(q) is the image of an 
open disc in the plane under a one-to-one continuous map.) With this definition, 
any point g in the interior of a face or an edge has to be ordinary since no face 
or edge can be perpendicular to the vector é if & is general for M?. 

For vertices, however, there are critical points corresponding to all the types 
presented for: smooth functions, as, for example, in the indicated polyhedron. 
We may then use the indexing procedure developed for smooth surfaces, where 
instead of a “small circle” we use the embedded polygon which is the boundary 
of the star of the vertex v. The number of times the plane through v perpendicu- 
lar to — meets this polygon is then equal to the number of triangles A in Star(zv) 
such that one of the vertices of A lies above the plane and the other lies below. 


u 


v is the middle vertex of A for é. 


Fic, 5 
In such a case we say that »v is the middle vertex of A for &, and we may write 
the index as follows: 
a(v, £)=1—4 (number of A with v middle for &). 


Again this definition corresponds to the definitions given in the smooth case 
and it gives index 0 for an ordinary point. 
The critical point theorem then states: 


THEOREM 1. If & is general for M?, then 
D1, i(v, &) = x(M?). 


eCM 


1970] CRITICAL POINTS AND CURVATURE 479 


We require a lemma on polyhedral surfaces: 
Lemma. For a polyhedral surface, 3T =2E. 


Proof of Lemma. Since an edge in a polyhedral surface has precisely two 
triangles in its star, 


3T = number of pairs (A, edge of A) = 2E. 
Proof of Theorem. If & is general for M?, 
>, i(v, ) = >> (1 — ¥(number of A with v middle for £)), 


veGM 0M 


= V —1 >> (number of A with 2 middle for 8), 


7&M 
= V — $T (since each A has exactly one middle vertex for &), 


= V — $(2E — 2T) (since T = 2E — 2T by the lemma), 
=V—E-+T. 


REMARK. For a smooth surface M? embedded in E? it is a classical result that 
for almost every unit vector & on S? (i.e., except for a set of measure zero on S?), 
the height function & has only finitely many critical points, and furthermore, 
for almost all £, this height function has as critical points only local maxima 
and minima, and nondegenerate saddle points. In the polyhedral case it is im- 
mediate that almost all & are general for M? (since the nongeneral & lie in the 
finite union of great circles { EE S?| E(v) =£(w)}, one for each pair of distinct 
vertices v, w). The stronger result, however, is not correct in the.polyhedral case. 
Consider an isolated critical point of a smooth surface which is*degenerate— 
the “monkey saddle” (so called in Hilbert and Cohn-Vossen [3], p. 191, since 
a monkey riding a bicycle would need three depressions in his saddle, one for 


aT 


each leg and one for his tail). Although on the smooth monkey saddle, height 
functions 7 near § on S? have only nondegenerate critical points, any 4 near 


i(2, é)=—2 
Fic. 6 
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has exactly six triangles with v middle, so for each such 7, 2(v, 7) = —2. 

However, in the proof of Theorem 1, we never required that the indices be 
only 0, 1 or —1. The proof goes through without change for any polyhedral 
surface M and & general for M, regardless of the complexity of the stars of the 
vertices of M. 


2. Total Curvature and the Gauss-Bonnet Theorem. The total curvature or 
Gaussian curvature of a neighborhood U on a smooth surface M? in E® has 
several definitions which appear in texts in differential geometry. The defini- 
tion which most easily leads to an extrinsic curvature theory for polyhedra is 
the one originally given by Gauss. We sketch his procedure in the smooth case 
and then develop the analogous theory for arbitrary embedded polyhedral 
surfaces in E'. 

Consider a “small” neighborhood Ui; on a convex surface M? in E%, The 
Gauss map g:Ui—>S? is defined by setting g(p) =outward unit normal vector 
to M? at p. If the mapping g restricted to Uj; is one-to-one, and g is orientation- 
preserving (outward normals at corresponding points correspond) then Uj is 
said to be strictly convex. The total curvature K(U;) of U; is then defined to be 
the area of the spherical image g(U;) on S?. 

If U2 is a region of a nonconvex surface M? on which g is one-to-one and 
orientation-reversing, then K(U2) is defined to be the negative of the area 
of g ( U,). 


Fic. 7 
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We may use the index 2(p, &) to describe the total curvature of Uj; in these 
cases. Let dw denote the area integrand for S?, so 


J dw = 4r. 
s? 


Then the area of g(U;) is the integral over S? of the characteristic function: 
Fay) (&) =1 if € is in g(U;) and O otherwise. But if — is in g(Ui), then the 
height function & has a critical point at a point p on U4. In fact, 2(g, &)#0 for 
a point » of U, if and only if — is in g(U;) or —€ is in g(U;). We may assume 
that U,; is small enough that g(Ui) contains no pair of antipodal points. The 
total curvature may then be described as follows: 


~ 1 
RU) = [_Pawy@do = —f_ D ip, Ode. 
S? 27S nc 

But the right-hand expression also serves to define K(U:2), since this expres- 
sion gives the negative of the area of g(U,). 

The following two paragraphs explain the procedure of defining the total 
curvature in the smooth case, and this serves to motivate the definition for the 
polyhedral analogue. In the polyhedral case, however, the technical difficulties 
concerning the convergence of the integrand do not occur. 

In order to define the total curvature of a neighborhood U on which g is not 
one-to-one, we begin by expressing U as a countable disjoint union of sets U; 
on which g is one-to-one together with a set V= U—U;2, U; such that g(V) has 
measure zero on S?. Then we set K(U) = >_2., K(U;,) if this sum converges, and 
we obtain a totally additive set function on M?*. This definition then coincides 
with 


ms 1 
K(U) = >> up, £)du, 
27S? 5cU 
where the integrand is well defined almost everywhere since almost every & 

has only finitely many critical points. 

Remark. In the case that M? is sufficiently smooth, the set function K(U) 
is absolutely continuous with respect to the area measure A(U) on M?, and we 
may define the point function K (p) as the limit of K(U)/A(U) for any collection 
of neighborhoods U; of p with the limit of the diameters of U; going to zero. 
This function is called the Gaussian curvature at p, and by integrating this 
function with respect to the area measure, we obtain 


Kw) = f Kaa, 
The classical Gauss-Bonnet theorem for embedded closed surfaces states that 


K(M?) = [koa = 2rx(M?). 
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In the case of a polyhedral manifold M? in E*, we may use the same defini- 
tion as that developed for smooth surfaces. For any open set U of M?, we set 


1 
KU) =>], x i(v, £)dw. 


When M? is a convex polyhedron, if U is a neighborhood containing only one 
vertex v, then K(U) gives the measure of the exterior angle at », i.e., the set of 
normals to support planes to M? at v, and this approach has been used in the 
classical theory of convex polyhedra, for example in [1]. 


Fic. 8 


The expression above also yields a definition of curvature for nonconvex 
vertices of saddle type as well as for the monkey saddles. 
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The Gauss-Bonnet theorem for embedded surfaces, smooth or polyhedral, 
follows immediately: 


THEOREM 2. K(M?) =27x(M?). 
Proof. 


7 1 1 
Kam) =— |, & ib, de = > ~ {_xryde = — x(t) Jia = Inx(M?). 


3. The Theorema Egregium. Total curvature was defined for a set U ona 
smooth surface M in a way which involved the extrinsic properties of the sur- 
face, that is, the way M is situated in E®. Gauss, however, proved that in fact 
K(U) depends only on intrinsic properties of U, i.e., on properties that are 
determined by measurements made along the surface not taking into account 
the way the surface is situated in space, and Gauss called this result his 
Theorema Egregium. 

We shall prove the analogous theorem for embedded polyhedral surfaces. 
Observe first of all that for an open set U of M, 


K(U) =— | Sia, Hdw = xs 5 J gai £)dw 


Ss yeU 


sO we may set 


K(U) = >> K(v), where K(v) = —f i(v, E)dw. 
veU 27S? 
THEOREM 3. K(v) is intrinsic, in fact, K(v) =24—(sum of interior angles at v 
of triangles containing 2). 
Proof. Let m(v, A, &) be the function on E® defined by m(v, A, &)=1 if v 
is the middle vertex of A for & and m(v, A, &)=0 otherwise. Then 
1(v, £) = 1 —2 sem m(v, A, &) and 


_ if. 1 
K(v) = — {i £)dw = 2 a! — == m(2, ‘A, td 


1 1 1 
=— | dw — — >, mv, A, dw =2r - > = fant A, &)dw. 


2/7S? 4JS* ,emM AcM 4 


The proof is then complete once we establish the following lemma: 
LEMMA. fs? m(v, A, &)dw=4 (interior angle of A atv). 


Proof. First of all, observe that for vectors n at v in the plane E? containing 
A, m(v, A,) =1 if and only if n lies in the region between the lines perpendicu- 
lar to the edges u-—v and w—v and the angle which determines this region is 
equal to the interior angle of A at v. Any vector & of S? may be written uni- 
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Fic. 10 


quely as £=n+¢ where n lies in the plane E? containing A and ¢ is perpendicu- 
lar to E?. Then m(v, A, &) =1 if and only if €-(u—v) >O0>£:-(v—w) or ~-(u—v) 
<O<&-(v—w) so m(v, A, &)=1 if and only if m(v, A, 7) =1. Thus the set of 
£ on S? centered at v for which m(v, A, &)=1 forms a double lune with axis 
perpendicular to E? and with each angle equal to the interior angle of A at v. 
But the area of a lune is twice the angle of the lune, so 


Jono A, &)dw = 4 (interior angle of A at 2). 


This completes the proof of the Theorema Egregium for polyhedra, no matter 
how complicated the vertex stars are. Compare Hilbert and Cohn-Vossen 
((3], p. 195) for a similar argument for vertices corresponding to nondegenerate 
saddle points. 

REMARK. The theorems of section 2 may be considered as a generalization 
of the approach of G. Pélya for embedded convex polyhedral discs [6]. 
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FUNDAMENTA MATHEMATICAE: AN EXAMINATION 
OF ITS FOUNDING AND SIGNIFICANCE 


SISTER MARY GRACE KUZAWA, Holy Family College, Philadelphia 


Fifty years ago in Poland, a small but determined group of mathematicians 
launched a daring project—a specialized mathematical journal, FUNDAMENTA 
MATHEMATICAE. Intuitively aware that the mathematical works of the late 
nineteenth, and the early years of the twentieth centuries, were but a forecast 
of tremendous and triumphant achievements to come, many scholars imme- 
diately recognized the value of the mathematical research incorporated in the 
pages of a journal. 

One of the first scholars to grasp and to acknowledge the value of the 
journal issuing from Poland was the greatly respected editor of the AMERICAN 
MATHEMATICAL MONTHLY, Raymond Clare Archibald. As early as September 
1921 he wrote: “Of the ten mathematical periodicals started since January 1919, 
none are of such notable importance for mathematical research as FUNDAMENTA 
MATHEMATICAE of which two volumes have been published: the first (224 pages) 
in 1920, the second (287 pages) in 1921, before May 1” [1]. 


Growing awareness of the Journal. Consequently, during the years from 
1920 to 1939, the publication FUNDAMENTA MATHEMATICAE was widely read 
and generally acclaimed to be a major achievement by the mathematical ex- 
perts. When at the end of 1935 the twenty-fifth volume of the journal came off 
the presses, both Polish and foreign scholars greeted the journal with consider- 
able enthusiasm. The Polish newspaper, WIADOMOScI LITERACKIE (Literary 
News) hailed the appearance of this volume as a remarkable event, “heralding 
a holiday for Polish mathematics” [2]. THe BULLETIN oF THE AMERICAN 
MATHEMATICAL SOCIETY observed that “the twenty-fifth volume of FUNDA- 
MENTA MATHEMATICAE represents a notable event in the mathematical life of 


the whole world” [3]. 
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Throughout the ensuing years the AMERICAN MATHEMATICAL MONTHLY 
kept its readers informed about the research emanating from Poland either 
through comments or reviews of works published. It is fitting and proper, 
therefore, that today, fifty years and sixty-three volumes later, the MONTHLY 
presents for a more thoughtful consideration, a brief history of the journal. It is 
sincerely hoped that the following pages will reveal to the younger generation 
of mathematicians that FUNDAMENTA MATHEMATICAE was and is a unique 
periodical which attracted international recognition and cooperation, and that 
it had considerable influence on the development of the modern theory of func- 
tions and point sets. 


Founder’s concept of scope and content. In 1920, Zygmunt Janiszewski, 
Stefan Mazurkiewicz, and Waclaw Sierpifski, professors of mathematics at the 
University of Warsaw, founded the publication. The idea of initiating a spe- 
cialized periodical with articles written in French, English, German, and Italian 
and explicitly devoted to one or two branches of mathematics was due chiefly to 
the gifted mathematician, Zygmunt Janiszewski. 

Nowhere is the originality and insight of Janiszewski more apparent than 
in the article of 1918, On the Needs of Mathematics in Poland, wherein this idea 
was first expressed. For with bold strokes of the pen, Janiszewski not only 
sketched a detailed plan for a school of mathematics but, what is more impor- 
tant, ensured its existence and success by proposing this daring and novel ven- 
ture, a specialized publication. Deeply convinced of the journal’s feasibility, he 
assured his countrymen in the article that a publication of this kind would be 
indispensable to mathematicians specializing in the branches covered by the 
journal and that not only would the work gain readers but also respected 
co-workers from abroad. Janiszewski underscored the fact that “the very 
existence and distribution of such a journal would assure the world of our cul- 
tural life” [4]. 

This account makes it obvious that Janiszewski envisioned a publication in- 
ternational in scope, one in which not only would articles be written in varied 
languages but to which foreign mathematicians would be encouraged to submit 
their original studies. 


Appearance of first volume. The first volume of FUNDAMENTA MATHE- 
MATICAE, however, deviated slightly from the proposed plan in that it carried 
articles written only by Polish mathematicians. This departure from the original 
objectives called attention to the fact that there was no lack of creative Polish 
mathematicians who were able not only to initiate such a specialized publica- 
tion but also to ensure its continuance. Janiszewski planned this strategy. “My 
aim,” he wrote, “is to introduce all the Polish mathematicians who specialize in 
the theory of sets in the very first volume of this publication” [5]. 

Janiszewski’s untimely death on January 3, 1920, deprived him of enjoying 
the realization of the publication. But Stefan Mazurkiewicz and Wactaw Sier- 
piniski, who collaborated with Janiszewski in the project and who were imbued 
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with the same ideas and ideals as Janiszewski, completed, edited, and published 
the first volume of FUNDAMENTA MATHEMATICAE that year. 

The first volume opened with a portrait frontispiece and a brief sketch of 
Janiszewski. Twenty-four articles dealing with the theory of sets and related 
questions by ten authors comprised the bulk of the publication. An added inno- 
vation expressly challenging the mathematicians, a set of ten original problems 
contributed by specialists in the field, concluded the 224-page journal. 

Immediately upon publication, the journal succeeded in attracting the in- 
terest of mathematicians throughout the world. Some noted that “of the ten 
mathematical periodicals started since January, 1919, none are of such notable 
importance for mathematical research as FUNDAMENTA MATHEMATICAE” [6]. 
There were others, however, who greeted this venture with some scepticism; 
among them was a leading mathematician of the period, Henri Lebesgue. In a 
letter to Professor Sierpifiski, Lebesgue praised the papers in the first volume but 
expressed his strong doubt as to the possibility of obtaining enough material to 
maintain such a highly specialized journal as a going concern [7]. 


Lebesgue and the Journal. Despite his doubts, Lebesgue nevertheless con- 
tributed his article Sur les Correspondances entre les points de deux espaces to the 
second volume of FUNDAMENTA MATHEMATICAE, published in April 1921. After 
the appearance of this volume, Lebesgue also wrote a twelve page article, A 
Propos d’Une Nouvelle Revue Mathématique: FUNDAMENTA MATHEMATICAE, for 
the Bulletin des Sciences Mathématiques. 

Interesting to observe are some of the comments contained in Lebesgue’s 
article. Here is how Lebesgue describes the emergence of the Polish School on 
the International scene: 

“In 1919, among the events which became known, is that there happens to exist in Warsaw a 
small group of mathematicians daring enough to venture to produce a novel mathematical periodical 


and sufficiently in love with their science to find time to work in a manner so successful that the 
two volumes of FUNDAMENTA now published are almost entirely filled with their own treatises” [8]. 


(Besides Lebesgue’s article, the second volume carried an article by Hans Hahn 
of the University of Bonn, and one by Nicolas Lusin of the University of Mos- 
cow.) 

Continuing the discussion of FUNDAMENTA MATHEMATICAE, Lebesgue quoted 
the editorial note, “that the journal is devoted to the theory of sets and the 
questions related to it; the immediate applications of the theory of sets, analysis 
situs, mathematical logic, the foundations of mathematics.” 


Journal’s goal in concentration. For the sake of clarity and completeness of 
information it is necessary to digress here. Up to the year 1928, the editorial 
staff of FUNDAMENTA MATHEMATICAE consisted not only of Stefan Mazur- 
kiewicz and Waclaw Sierpifski but also of Jan Lukasiewicz, an internationally 
recognized authority on mathematical logic, and Stefan Leéniewski, who were 
to be responsible for the articles on mathematical logic and the foundations of 
mathematics. It was the inclusion of these last two mentioned fields into the 
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early plan of the publication that induced the founders to name the journal 
FUNDAMENTA MATHEMATICAE. Also, a particular manner of the presentation of 
materials in the journal was agreed upon: the first volume would be devoted 
exclusively to the theory of sets and the second would present articles treating 
mathematical logic and the foundations of mathematics. This alternating pat- 
tern was to be a characteristic feature of the FUNDAMENTA MATHEMATICAE. 
However, after much deliberation, the editors of the publication decided that 
there was yet an insufficient number of Polish mathematicians specializing in 
the second area. The original plan was abandoned. In the final layout, the theory 
of sets and questions relating to it—topology and the theory of the real func- 
tions—became the subjects of concentration for the journal. 

More to the purpose of the study, however, is Lebesgue’s shrewd observa- 
tion that “the theory of sets has been placed outside of the field of mathematics 
long enough by the high priests of the theory of analyticfunctions” [9]. He further 
commented that “this ostracism is now abating due to the fact that the theory 
of sets which branched from the theory of analytic functions is proving itself 
useful to its older sister and is convincing people of good faith of its value and 
riches” [10]. 

From the preceding citations it can be deduced that in 1922, the date of 
Lebesgue’s article, the theory of sets was still in its early stages. It is also clear 
that an early venture into the theory of sets by a publication such as FUNDA- 
MENTA MATHEMATICAE would most likely have the opportunity to play some 
part in the development of the new mathematical discipline. 


FUNDAMENTA MATHEMATICAE effect international reverberations. Perhaps 
it may still be too early to assess precisely the role played by the individual 
mathematicians in the enormous evolution occurring today in the mathematical 
sciences and being caused by the new methods associated with set theory and its 
related fields. But surely it is not too soon to assert that there is sufficient evi- 
dence that the important contributions to the above fields in the FUNDAMENTA 
MATHEMATICAE played an important role in the great upsurge of set theory and 
its related fields that began in the early twenties and is continuing today. 

Some means of recognition of this fact is provided by the references to the 
FUNDAMENTA MATHEMATICAE found in many of the scholarly treatises on set 
theory and its allied fields. For example, in Abstract Set Theory (1953), Abraham 
Fraenkel, professor at the University of Jerusalem, listed in his bibliography 
over one hundred references from the FUNDAMENTA MATHEMATICAE. Professor 
Solomon Lefschetz of Princeton University referred to thirty-five articles ap- 
pearing in various volumes of FUNDAMENTA MATHEMATICAE in his book Topics 
in Topology. Twenty-four references are alluded to in Set Topology (1960) by 
R. Vaidyanathaswamy, professor at the University of Madras, India, while the 
outstanding Dutch mathematician, Adriann C. Zaanen of the University of 
Amsterdam, includes eleven references from FUNDAMENTA MATHEMATICAE in 
his Theory of Integration. 
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Another detail of evidence of the role of FUNDAMENTA MATHEMATICAE in the 
progress of set theory and its related fields is the fact that not only did mathe- 
maticians of other countries draw upon the findings in the journal but they also 
contributed many original studies to the publication. Thus, the twenty-fifth 
volume of FUNDAMENTA MATHEMATICAE, which was twice as large as the other 
volumes (597 pages), had among its contributors: Alexandroff of Moscow, Borel 
of Paris, Zermelo of Freiburg, Hardy and Littlewood of Cambridge, Hausdorff 
of Bonn, Hurewicz of Amsterdam, Fraenkel of Jerusalem, Vietori of Innsbruck, 
Cech of Brno, Hille of Yale, von Neumann of Princeton, Moore of Austin, and 
others. 


FUNDAMENTA MATHEMATICAE and Set Theory. More convincing and perhaps 
even greater evidence of the part played by FUNDAMENTA MATHEMATICAE in 
the upsurge of set theory and its related fields are the following facts. 

Between the years 1920 to 1935, one hundred and seventy authors, two- 
thirds of whom were scholars of countries other than Poland, contributed 732 
scholarly papers. Four years later, with the publication of the thirty-second 
volume (1939), the numbers increased to 972 works by two hundred and sixteen 
authors. This constituted an increase of 240 works and forty-six authors over 
the four-year period. Surely, this is an excellent indication of the growing in- 
terest in set theory and its related fields. 

Once the interest of the creative mathematicians was aroused, from all parts 
of the world a seemingly endless, constant flow of excellent and scholarly articles 
on set theory and its related fields was directed to the offices of FUNDAMENTA 
MATHEMATICAE. As a result, during the summer of 1939 the thirty-third volume 
was under preparation. Much of the volume was already set up in type and the 
remaining articles were being processed. And then, on September 1, 1939, 
Germany invaded Poland. 


War severs continuity, progress. World War II—history’s most terrible 
conflict—began. The savage, six-year struggle was greatly detrimental to the 
mathematical progress and creativity in Poland. 

The thirty-third volume of FUNDAMENTA MATHEMATICAE never came off the 
presses. The German director of the printery, Walter Hande de Dresde, had the 
plates destroyed and the galleys and remaining manuscripts burned. On Septem- 
ber 1, 1942, fires which broke out all over Warsaw completely destroyed the 
offices of FUNDAMENTA MATHEMATICAE and the mathematical holdings at the 
University of Warsaw. Nothing remained of the archives and the first editions 
of FUNDAMENTA MATHEMATICAE. The methodical block-by-block destruction of 
the city of Warsaw from August to December of 1944 by the Brandkommando 
resulted in the loss not only of the private libraries of the Polish mathematicians 
living in Warsaw but also of their manuscripts containing the original results 
produced during the period 1939-1944. 

Most damaging to the field of mathematics, however, were the vile and in- 
human acts perpetrated by the Nazis. The systematic and brutal slaughter of the 
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inhabitants deprived Poland of some of her finest mathematicians. In 1940, 
Aleksander Rajchman perished in the concentration camp at Dachau; Stefan 
Kempisty died in a prison camp. Victims of a surprise dragnet, four outstanding 
mathematicians, Antoni Lomnicki, Stanistaw Ruziewicz, Wiodzimierz StozZek, 
and Julius P. Schauder were among the more than thirty Professors from the 
Lwéw area shot by the ruthless Gestapo in July, 1941. November, 1942, marked 
the murder by the Nazi conquerors of Stanistaw Saks, the well-known author 
of the mathematical classic, Théorie de Il’ Intégrale. The gaschamber at Treblinka 
(January 1943) claimed the life of Jézef Zalcwasser [11]. 

Of the 100 prominent creative mathematicians, 62 were murdered or died 
from sickness, starvation, or disease in the enemy concentration camps [12]. 
There were also about 32 younger and promising mathematicians (who had not 
yet attained world renown) who perished during the war. Among the latter, 
Jézef Marcinkiewicz, who died in a prison camp in 1940 at the age of thirty, 
was endowed with extraordinary mathematical talent. Examining some of the 
mathematical findings left by Marcinkiewicz, some mathematicians, among them 
Antoni Zygmund, claim that if Marcinkiewicz lived and continued to work he 
would surely have become one of the leading mathematicians of today [13]. 

Some mathematicians, fortunately, did survive the concentration camps. 
These relate that the study of mathematics continued in Poland in spite of hard- 
ships and unfavorable circumstances. One of those more active in teaching 
mathematics in the underground was Professor Kazimierz Zarankiewicz, who 
in 1944 was sentenced to hard labor in the heart of Germany. Prison walls could 
not and did not halt the imprisoned Professors from creating and exchanging 
ideas. Professor Stanistaw Urbaficzyk of the University of Cracow in his book, 
Uniwersytet za Kolczastym Drutem (A University Behind Barbed Wires), credits 
the survival of many of the intelligentsia to the intellectual life and cultural 
atmosphere which the professors had created for themselves. (The enemy was 
completely unaware of the lectures and discussions which kept up the morale of 
the war victims.) [14]. 

The coming of peace in 1945 found much of Poland reduced to rubble and 
ashes. The uncertainties of the postwar era and the fear of communism seemed 
to toll a death knell for the mathematical field in Poland. But, surprisingly, no 
sooner was peace declared, than FUNDAMENTA MATHEMATICAE appeared, con- 
taining articles just as creative and promising for further research as those found 
in the preceding volumes. 

If credit is to be given the individuals who contributed most to this reactiva- 
tion, the competence, courage, and devotion of the great Polish mathematicians, 
Waclaw Sierpitski, Kazimierz Kuratowski, and Karol Borsuk must be recog- 
nized. As editors of the FUNDAMENTA MATHEMATICAE, they dedicated the thirty- 
third volume of FUNDAMENTA MATHEMATICAE to the memory of their colleagues, 
contributors to the journal, who perished during the war. On the same page of 
the journal the editorial staff announced the loss of one of their great mathe- 
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maticians, Stefan Banach, who died on August 31, 1945. Banach was greatly 
affected by the atrocities committed by the Nazis. When peace came, it found 
him a broken-hearted man for whom life no longer held a promise. Also of special 
note to the reader in the thirty-third volume are the 115 pages which contain 
work of the Polish mathematicians which evolved secretly during the war years. 

The mathematical world did not expect FUNDAMENTA MATHEMATICAE in 
1945. The sentiments of the mathematicians regarding the journal’s continuance 
are best expressed by Professor Paul Aleksandrov of Russia. Upon receiving a 
copy of the publication, Aleksandrov wrote to the Polish mathematicians that 
he was greatly aggrieved by the unfortunate disasters that plagued the progress 
of mathematics in Poland. In fact, he wondered if the journal would survive the 
tragedy. Subsequently, he observed that this early and excellent publication of 
the periodical “delights me very much for it is a symbol of triumph of the eternal 
ideals of man’s culture” [15]. Unquestionably the thirty-third volume proves a 
monument to the undaunted spirit that characterized and characterizes the 
Polish mathematicans. 


Conclusion. Although the continuity of publication, the international scope 
of its author-contributors, and the quantity of articles representing a spectrum 
in mathematical research are most impressive, perhaps the greatest claim to its 
significance is the excellence of the scholarly works which gained worldwide 
recognition for FUNDAMENTA MATHEMATICAE. Some of the more outstanding 
treatises helped significantly in the development of the mathematical literature 
of the world. A few from among these are: Lebesgue’s Sur les correspondances en- 
tre les points de deux espaces (Volume II), which laid the foundations for measure 
theory; Banach’s Sur les opérations dans les ensembles abstratts et leur application 
aux équations intégrales (Volume III), which initiated a new mathematical dis- 
cipline, functional analysis: Urysohn’s Mémoire sur les multtplicités Cantortennes 
(Volumes VII and VIII), which presented all the fundamental results in measure 
theory; and Lusin’s Sur les ensembles analytiques (Volume X), which embodied 
all the principal results of the theory of analytic functions [16]. 


Journal remained an unchanging clearinghouse. At this point it should also 
be noted that collaboration in mathematical research was always encouraged by 
the editors of FUNDAMENTA MATHEMATICAE. As a result, a great number of 
studies in the journal were and are the work of two or even three scholars, not 
necessarily from the same mathematical centers. For example, in the twenty-fifth 
volume, the article Note on the differentiabtlity of multiple integrals was authored 
by B. Jessen of Copenhagen, J. Marcinkiewicz and A. Zygmund of Poland. 

In concluding the discussion of FUNDAMENTA MATHEMATICAE, a final but 
important observation must be made. The excellent planning and the extra- 
ordinary cooperation of the members of the Polish School of Mathematicians 
produced a journal which in general retained its original format in the 63 vol- 
umes published. 
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Two changes that occurred were necessitated by growth. Commencing with 
1929, studies on functional analysis appeared in the newly founded periodical, 
StupIA MATHEMATICA, which became an outstanding publication in that field. 

The section Problémes was assumed by the Scottish Book, a notebook totally 
devoted to problems. The book, placed in a Lwéw café, was at the disposal of 
every mathematician who demanded it. Sharing Hilbert’s view that the formula- 
tion of a problem is halfway to its solution, the Poles were not only fond of 
problems but were able to devise very challenging ones. In 1947, the Paris pub- 
lication INTERMEDIAIRE DES RECHERCHES MATHEMATIQUES reprinted all the 
problems appearing in the thirty-two volumes published in FUNDAMENTA 
MATHEMATICAE. It added: “This publication, specializing in the theory of sets, 
has contributed very strongly to the progress of modern mathematics” [17]. 

In concluding, there is need to add that although so many great Polish 
mathematicians have perished during the war, mathematics is flourishing. At 
present in Poland there are 500 creative mathematicians who contribute on the 
average about 400 creative mathematical works per year to the nine mathema- 
tical journals [18]. There is no doubt that the Polish mathematicians contri- 
buted significantly and continue to contribute strongly to the progress of 
modern mathematics. 
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THE JEEP ONCE MORE OR JEEPER BY THE DOZEN 
DAVID GALE, University of California at Berkeley 


1. Introduction. In 1947, N. J. Fine solved the by now famous problem of 
the jeep [1]. We recall that the problem concerns a jeep which is able to carry 
enough fuel to travel a distance d, but is required to cross a desert whose dis- 
tance is greater than d (for example 2d). It is to do this by carrying fuel from its 
home base and establishing fuel depots at various points along its route so that 
it can refuel as it moves further out. It is then required to cross the desert on the 
minimum possible amount of fuel. 

Our purpose here is first to give a very short derivation of the solution of the 
jeep problem which makes use of a theorem (also famous) of Banach [2]. 
Second, we consider the situation in which it is required to send a jeep across 
the desert every day, say, for a week. Of course, having found the best procedure 
for a single jeep, one could simply repeat this seven times. We shall show, how- 
ever, that there is a more economical procedure for the case of several jeeps, and 
in general that the more jeeps one sends across, the lower fuel consumption per 
jeep. This phenomenon is an example of what economists refer to as “increasing 
returns to scale,” a subject of some economic interest. (It also accounts for the 
subtitle of this article, for which I apologize herewith.) 


2. The problem. To formalize the problem, let us assume that the jeep 
starts from the origin and moves along the positive x-axis. We choose for the 
unit of fuel the maximum amount which the jeep can carry, and refer to this 
unit as a load. The unit of distance will then be chosen as the distance the jeep 
can travel on one load. 

Figure 1 below gives a schematic representation of a typical jeep’s journey. 


L ——e a 
Fic. 1 


The wiggly path represents the jeep’s travel. Of course, actually the path lies 
entirely on the x-axis. It has been stretched vertically simply to make it visible. 
Note that because of our choice of units, the length of this path is precisely equal 
to the amount of fuel consumed. In the figure, the jeep reaches a point at dis- 
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tance d from the origin. The original jeep problem asks for the minimum fuel 
consumption (hence path length), which will allow the jeep to reach this point. 
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It turns out to be somewhat more convenient to turn the problem around (see 
e.g. [3]) and obtain a formula for the function d(f) giving the farthest point 
which the jeep can reach on f loads of fuel. Our first aim is to prove the formula 
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The key idea in our solution is to make use of a formula of Banach for the 
path Jength of a curve in one dimensional space. (Path length in one-space is 
usually referred to as total variation; we prefer the geometric terminology as 
being more suggestive in the present context.) To utilize Banach’s Formula we 
define for each point x on the interval [0, d], the valance n(x) as the number of 
times during its journey that the jeep is at the point x. Figure 2 above gives the 
graph of the valance 2(x) corresponding to the jeep's journey plotted in Figure 1. 
Strictly speaking, if one allowed very general paths, n(x) might be infinite for 
some points. This would not affect Banach’s Formula, which states 


d 
total path length = f n(x)dx. 
0 


Of course, Banach was not concerned with jeeps. He considered a continuous real 
valued mapping x=¢@(t), and n(x) was simply the number of points which 
mapped onto the point x, or the cardinality of @-!(~). We can also think of the 
problem this way if we take the wiggly line of Figure 1 to be the graph of the 
jeep’s position plotted, say, as a function of the time. 

For “reasonable” paths, Banach’s Formula is obvious. A reasonable path 
for a jeep is one with a finite number of points at which it reverses direction (it 
would be a remarkable jeep indeed that could execute an unreasonable path). 
To prove the formula, partition [o, d| into sets Xi, Xo, - ++, where 


X, = {x| n(x) = &}. 
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Because of reasonableness, there are only finitely many nonempty X,;, and each 
of them is a union of disjoint intervals. (Some of them may consist of single 
points. In the case of the jeep’s tour, this will be the case whenever k is even. Do 
you see why?) Over each interval of X; lie exactly k intervals of the jeep’s path. 
Therefore, 


total path length = >> & (length of X;). 


ke=l 


But the term on the right is precisely the definition of [§n(x)dx. (Notice that this 
is the definition of the integral in the sense of Lebesgue rather than Riemann! 
Of course for continuous functions, and in particular for reasonable functions, 
the two concepts are equivalent.) We remark that Banach’s formula holds for 
unreasonable as well as reasonable paths (though the general proof is fairly in- 
volved), and hence our solution of the jeep problem will hold for unreasonable 
as well as reasonable jeeps. 


3. The solution for one jeep. We return now to the problem of computing 
d(f), and assume for the moment that / is an integer. We wish to determine how 
far the jeep can get on f loads of fuel. For any jeep’s tour, we define the sequence 
of points xo, x1, - - - , x, on the interval [0, d] where x, =0, xo =d, and in general 
x; is the point such that the total path length (hence fuel consumption) to the 
right of x, is exactly & units. Clearly the points x, form a strictly decreasing se- 
quence and there will be exactly one unit of path length between x41: and x,. The 
basic observation we need is the following: 


Lemna 1. If x<xxz, then 
(2) n(x) = 2k + 1. 

Proof. Since x is to the left of x,, the jeep must consume more than & loads of 
fuel to the right of x. Since the jeep can only carry one load at a time, it must 
therefore cross the point x at least k+1 times from the left. But between any 
two crossings from the left there must be a crossing from the right, so there 
must be at least k crossings from the right. Then the jeep must arrive at the 


point x at least 2k-+-1 times, which is what the lemma asserts. 
We now combine this result with Banach’s Formula and get 


1 = (path length between x,4; and xz) 
tk 
=| n(x)dx = (2k + 1)(%, — E41) 
Thy 
SO X4—Xe41 S1/(2k +1). Summing from 0 to f—1 implies 


1 
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which gives an upper bound for d(f). It remains only to show that this bound can 
be achieved, and this is easily done by induction. The formula is clearly correct 
when f =1. Suppose now we are allowed f-+1 loads. Then let the jeep take f+1 
loads to the point 1/(2f+1). This will involve f+1 outward trips and f return 
trips, hence 2f+1 trips of length 1/(2f+1); therefore a total of one load will be 
consumed leaving f loads deposited at the point 1/(2f+1). By the induction 
hypothesis, the formula of (1) holds from this point on, completing the proof. 
For the case where f is not an integer, the same type of argument shows that 


(4) 1) <1¢e4..-4—1 4 4 

3 2ifl—1 © 2[f] +1 
where [f] and {f } are the integral and fractional part of f respectively. In other 
words, one simply interpolates linearly between integral values of f. The graph 
of d(f) is plotted in Figure 3. 


—— f —e 
Fic. 3 


Since the odd harmonic series diverges, it follows that a desert of any size 
can be crossed. 

I should remark that after finding the solution just given, I became aware of 
the rather similar one given in [4]. The use here of the Banach Formula seems 
to tighten the argument somewhat and make the reasoning more transparent. 
Also we do not need to assume a priori that there will be only a finite number of 
depots. It is at least conceivable that the optimal solution would involve in- 
finitely many deposits, or even a sort of continuous smear of fuel spread out 
along the route. One could no doubt formulate a very general problem in terms 
of measure theory. The argument above shows, however—thanks again to 
Banach’s Formula—that this more general behavior could not give any improve- 
ment in fuel consumption. 

Before leaving the single jeep, we consider the case in which the jeep is re- 
quired to cross the desert and then return. For this case the arguments are the 
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same as before except that (2) becomes 
(5) n(x) = 2k + 2. 


Letting d(f) be the longest possible round trip (e.g., twice the distance to the 
farthest point), we get the even simpler formula 


6 Uf) =1+—4—4---4+— 
(6) (f) = 773 ee 


where as before equality can be achieved. We note the familiar fact that a round 
trip can be substantially cheaper than two one way trips. In fact comparing (6) 
with (1), we have 


1 

of’ 
but this is bounded by >57., (—1)"+1(1/n) =log 2. Thus for long distance, the 
increase in fuel cost for a round trip as against a one way trip becomes negligible. 


1. 1 1 1 
Uf) — > a7) slostyrgten 


4, Several jeeps. We turn now to the problem of m jeeps, and we shall com- 
pute the function d,,(f) giving the most distant point which all jeeps can reach if 
they have f loads of fuel to share between them. In other words, the sum of the 
path lengths of all of the jeeps must not exceed f. 

We proceed by defining the points x, exactly as in the one jeep case, as the 
point to the right of which the total combined path length of all jeeps is k. The 
analogue of Lemma 1, however, is slightly more complicated. 


Lemma 2. For any x in |0, d], n(x) 2m. If x<Xmir (720) then 
(7) n(x) 2 m+ Qr + 2. 


Proof. The first assertion simply corresponds to the fact that all m jeeps are 
required to reach the point d. Concerning inequality (7), we see that more than 
m-+r loads must be transported beyond the point x, so this point must be crossed 
at least m+r-+1 times from the left. But since there are only m jeeps, in order to 
achieve m-+r-+1 crossings from the left there must be at least r-+1 crossings from 
the right, giving m-+2r+2 crossings in all. 

We proceed to calculate dn(f). Notice that for fm, the problem is trivial and 
dn(f) =m/f (why?) so we assume that f=m-+s, where s>0. Again restricting 
ourselves to integral values of s, we claim 


1 1 
din = | a 
Y) toot maha aries 
(8) ' ' 
={+ ——_——. ) 


mh2'’m+4. | Bom 


for the path length from x;41 to x; is one, so 
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Xk 
1 = i) n(x)dx 2 m(x%, — Xe41), fork <m, 


Tet+t 


and 


1 = {- n(x)dx = (m + 2r + 2) (Xmngr — Vmtres). 


mt rtl 
Therefore x,—Xz4151/m for k<m, and 


1 
Loner — Nm+r+1 < Oe for 7 = 0. 


m+ 2r+ 2 


Summing for OSkSm—1and 0SrSs-—1 gives (8) as an inequality. Again an in- 
ductive proof shows that equality can be achieved. Assuming the formula correct 
for m-+s loads of fuel, we see that m+s-+1 loads can be moved to the point 
1/(m+2s+2) by having one jeep make s+1 round trips and then having all m of 
them make the one way trip to this point. This will use up one load, so that 
m-+s loads are deposited, and the induction can be continued. Note that there 
are various ways in which this optimal journey can be performed. One way is to 
have one of the jeeps do all the work of setting up the various depots, while the 
others simply move outward without turning around, refueling as they go. It is, 
however, not possible to prescribe the same path for all jeeps, for if all jeeps 
followed the same route, then the function n(x) would have to be divisible by m 
at all points, which it clearly is not in an optimal trip. We shall return to this 
point later. 

To prove the result about increasing returns, we wish to compare the distance 
which one jeep goes on f loads with that which m jeeps go on mf loads. For this 
purpose we rewrite (8) for the case s=m(f—1) as follows: 


1 1 1 
saat tat Gat tet 


dm (mf) =1 +( 
(9) 


(—— 1 eee io) 
Na — 3)m +2 (2f — 1)m)” 


where each term in parentheses contains m summands and there are f terms in 
all. We may also rewrite (3) as 
1 


1 1 1 1 
ay =1+( tote teyt(Stete)ee 
3m 3m 3m 5m 


t(qtot +o) 
(2f — 1)m (2f — 1)m/’ 


where there are also m summands in each term in parentheses. A term by term 
comparison between (9) and (10) shows the advantage of using many jeeps. 


(10) 
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1 1 
+ (Ga- 7) 
4f—4 4f—2 


1 ( 1 1 n 1 1 n n 1 1 ) 
2\2 3 #4 5 2f—-2 22f—1/ 
Thus the longer the trip the greater the saving. On the other hand since the 
terms in parentheses above are part of a convergent series, it follows that the 
total amount which is saved remains bounded as the trip gets longer. 
For the round trip problem, the formula is again simpler. Let dm(f) be the 
round trip distance which m jeeps can achieve on f loads. Then 


1 1 
+ t... pe. 
m+t+1lmt+2 mf 

So again, the more jeeps there are, the further out they can get on the same 
amount of fuel; or returning to the original problem, the less fuel per jeep is 
needed to reach a preassigned point. But there is a limit to the advantage one 
obtains in increasing the number of jeeps; for as m approaches infinity, a stan- 
dard calculation (comparison with {dx/x) shows 

(12) lim dm(mf) = 1 + log f, 


m— 


For the special case m = 2, we get 


1 1 1 1 


(11) dm(mf) = 1+ 


so no matter how many jeeps there are, one cannot go further than 1+log f on f 
loads of fuel per jeep. In terms of the original problem, we get in the limit 


(13) f= et 


so the amount of fuel needed increases exponentially with the length of the des- 
ert, as one might have guessed. 

The multi-jeep round trip problem can be interpreted in another way. In- 
stead of thinking of m jeeps, each making a round trip, one may consider a 
single jeep which must make round trips, say on m successive days. The solution 
here is the same as for the m jeep problem with the consequent saving of fuel. In 
fact, on the first day the jeep can set up all the depots it will need for the follow- 
ing days, as one easily sees. 


5. Some Final Remarks and Questions. The last paragraph above raises an 
interesting question. Suppose instead of being concerned about m round trips 
across the desert, one has decided to go into the desert crossing business and 
plans to make a round trip daily into the indefinite future. What sort of routine 
should one then use? As we have just seen, it would be uneconomical to use the 
single round trip routine each day. Similarly, repeated use of the m-day routine 
would be inferior to using the (m-++r)-day routine, so that no periodic program of 
this sort could be optimal. On the other hand, if one is to consider programs 
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which are not periodic, it is no longer clear what one should mean by an optimal 
program. In any case, it appears that the problem of finding the best “steady 
state” routine has no exact solution at all, so that in practice one would have to 
settle for a routine that was “almost optimal.” 

There are many other jeep problems one can think of. Helmer [5] has con- 
sidered some fairly complicated situations in which the number of depots one is 
allowed to establish is limited. To get a feeling for this sort of problem, the 
reader might look at the problem of crossing a desert of length 2 when only 3 in- 
termediate depots are permitted. 

An apparently simple question is the round trip problem in which fuel is 
available at both ends of the desert, but I must confess with embarrassment that 
I have not been able to find the solution. It is not hard to see that one can do at 
least as well in this case as in the case of two jeeps making one-way trips, but it 
may be possible to do better. The difficulty here as with many optimization prob- 
lems is that there does not appear to be any simple way to determine whether or 
not a given solution is optimal. The upper bound given by Banach’s Formula 
does not seem to be available for this case. I put this problem forward as a 
challenge to jeepologists. 

I conclude with some historical remarks. Shortly after the publication of 
Fine’s solution, Phipps [3] derived the same result by arguing that the single 
jeep problem is equivalent to a problem involving a convoy of jeeps which travel 
together, some being used to refuel others. The solution to the convoy problem 
is very simple, but the argument that this problem is equivalent to the original 
problem does not seem to be quite complete. Using this equivalence, however, 
one can also easily derive the result given here on increasing returns. Finally, 
there seems to be a feeling among many people that the jeep problem can be 
solved by the functional equation method of dynamic programming. In fact the 
problem occurs as an exercise in the book of Bellman [6], but the solution is not 
given there and I know of no way of solving the problem by this method. 


6. Addendum. It has been pointed out to me that if one accepts the convoy 
equivalence of Phipps, then dynamic programming can be used (see e.g. [7]). 
However, for the convoy problem, the solution is almost obvious anyway. 
Imagine that f jeeps set out. Since all but the last must return home, we suppose 
that f—1 of them consume fuel at the rate 2 and the last one, the “red, white, 
and blue” jeep that will make the final crossing, consumes at the rate 1. Thus, 
initially, the convoy is consuming at the rate 2f—1, which it does until one load 
has been consumed, after which the first jeep can be abandoned. The remainder 
then go on consuming at the rate 2f —3, etc. 

I should also remark that for Helmer’s variation in which the number of 
depots is specified, dynamic programming does seem to be the appropriate tool 
to use. 
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A NOTE ON SECOND CATEGORY TOPOLOGICAL GROUPS 
E. J. Howarp, San Diego State College 


In this note we prove that if G is a topological group and A is the closure of 
the identity, then G is of second category if and only if G/H is of second cate- 
gory. Consequently since G/H is a Hausdorff group the study of second category 
topological groups can in most cases be restricted to Hausdorff topological 
groups. 

Let X be a topological space. A subset A of X is nowhere dense in X if the 
interior of A (the closure of A) is empty. A subset A of X is said to be of first 
category in X if it is the union of a countable family of nowhere dense sets. If a 
subset A of X is not of first category, then it is said to be of second category in X. 
Throughout this paper the topology of G/H will be the quotient topology. 

We begin by giving some well-known facts concerning topological groups. 


LemMaA 1. Let G be a topological group and H a normal subgroup of G and 
6:G—G/H the natural homomorphism, then: 

a. The mapping 6:G—G/H 1s a continuous and open mapping. 

b. If H 1s compact, then 6:G-—G/H 1s a closed mapping. 


c. If ets the identity of Gand H = {e}, then H 1s a compact closed normal sub- 
group of G and G/H is Hausdorff. 


Proof. The following page references are all in [1]. For proofs of (a) and (b) 
see pages 36 and 37. For (c), that H is a closed normal subgroup is on page 33 
and that G/H is Hausdorff is on page 45. By Proposition 3, Section 20 of [3] 


H= fe} =(\U,, where {Ua} 
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is the collection of all neighborhoods of e, thus for any open cover of # there is 
an element U in the cover containing e, hence (\U,C U. Consequently H 
is compact. 


Lemma 2. Let G be a topological group and H={e} and $:G—G/H, the 
natural homomorphism. If C is a closed subset in G and C°=@ (C° denotes the 
intertor of C), then [p(C)]°=2. 


Proof. Suppose C is closed in G, C°= @ and [#(C) |°#@, then there exists 
a nonempty open set V in G/H such that VC@(C) and an open neighborhood 
U of the identity in G such that for some x in G, @(xU) CV. Hence xUHCCH. 
The set x UH=W is an open subset of G. We shall show that CHCC. Let 
y CCH, then y=ch, where cEC and hE H. Now by (5.4) of [1] and the fact that 
H is a subgroup of G we have {ch} =(ch)H=cH={c}. Since cEC and C is 
closed, {ch} Cand hence chEC. Thus the nonempty open set W is contained 
in C contrary to the fact that C=2. 

Our main theorem is a generalization of a theorem in [2]. 


eee 


THEOREM. Let G be a topological group and H = {e}; then G ts of second cate- 
gory uf and only tf G/H 1s of second category. 


Proof. We first show that if G is of second category and G/H=WU£,, then 
for some n, E2? ~ @. Consequently, G/H is also of second category. To see this 
we observe that G=U¢—(E,) =U@—!(En), and because ¢ is continuous each 
@-1(E,) is closed. Since G is of second category, for some 2, @~!(E,) contains a 
nonempty open set, and because ¢is open, E, also contains a nonempty open 
set, i.e., E2~@. (Note: this proof holds for any normal subgroup of G.) 

Now for the converse assume G is of first category; then G=\/F,, where 
{ F,} is a countable collection of nowhere dense sets. By Lemma 1, for all x, 
o(F,) is closed and by Lemma 2 has empty interior. Thus G/H is a countable 
union of nowhere dense sets. Hence G of first category implies G/H is of first 
category. Consequently G/H of second category implies G is of second category. 
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ON THE SETS OF DIRECTIONS DETERMINED BY 2 POINTS 
P. R. Scott, University of Adelaide 


1. Introduction. Some years ago P. Erdés [1] raised the problem of finding 
the least number of different distances determined by m points in the plane. 
We consider here the somewhat analogous problem of finding the least number 
of different directions (gradients) assumed by lines joining ” noncollinear points 
in the plane. Some generalizations of the problem are also given. 
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2. The function R,. Let P, be a set of 2 noncollinear points in the plane to- 
gether with the set of all straight connecting lines which join these points. Let 
k(P,) be the number of different directions assumed by the connecting lines of 
P,,. If now @ denotes the family of all such planar sets Pp, let 

k, = min R(P,). 
P.E® 
It is clear that kg=3 (for any triangle), kj =4, ks =5. In the following theorem 
we give bounds for Ra. 


THEOREM 1. The minimum number ky, of directions determined by a set of © 
satisfies the inequalities 


{1+ -V8n—7} S ky S 2[n/2]. 


Proof. The upper bound is easily obtained. If 2 is even, we take the points 
as vertices of a regular 2-gon. In this case, the sides and diagonals of the regular 
n-gon lie along sets of parallel lines. There are 

(a) n/2 distinct directions determined by sets of parallels containing pairs 
of opposite sides of the n-gon, and 

(b) 2/2 distinct directions, each determined by a set of parallels perpendicu- 
lar to a diagonal joining an opposite pair of vertices of the 2-gon. 

Further, if for each set of parallels in (b) we draw representative parallel 
lines through the given opposite vertices, it is readily seen that no line of (b) 
is parallel to a line of (a). Hence x directions are determined. For odd 1, we 
simply add the center point; this addition leaves the number of determined di- 
rections unchanged. 

We now obtain the lower bound. Suppose that P; is a set of ® for which the 
value &, is attained. Let OCP; be an extreme (corner) point of the convex hull 
of Px, and let h, +--+, J, be the connecting lines joining Q to the remaining 
points of P>. If 1; contains u; points of the set P7(1<i7Sr), we can easily show 
that the points on each pair of lines /,, l,, determine at least u;-++-u, —1 directions. 
For let R;, R;, be the points closest to Q on 1;, , respectively. Joining R; to all 
points on i, and R, to all points on 1;, gives a set of uj-+-u,—1 pairwise inter- 
secting lines (the line R;R; occurring twice). Hence at least u;+u4,—1 directions 
are determined, and so 


Uj + Up Skhn +1 1sj7<ksSr). 


Summing over all values of 7, & we obtain 


r rir—1 r r 
f=1 i=1 
If Xu; is replaced by n+r—1 (since Q has been counted 7 times) we then get 
2(n+r—1 
Rn = AmtrTh) —_ 1. 


r 
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The expression on the right takes its minimal value when 7 is as large as possible. 
Clearly for Px, rSk,; hence 


kn = 4{1 + V8n— 7}, 


and the proof is complete. 
Experiment would seem to indicate that k, =2[n/2], but I have been unable 
to prove this. 


3. k, for convex sets. We recall that a set of ” points is said to be convex 
if none of the given points lies in the interior of the convex hull of the points. 
A complete result for convex sets is given by 


THEOREM 2. The minimum number of directions determined by a convex set 
of 8 1s Rx=N. 


Proof. As before, let Px be a (convex) set of @ for which the value &, is 
attained, and let Q be an extreme point of P;. Using our previous notation, if 
i, and 1, are sides of the convex hull meeting at Q, then there are n—u,—u,+3 
connecting lines through Q. Also, there are at least u41+u,—3 further directions 
determined by the remaining points on J, and 1,. Hence k,2n. 

For even n, this result coupled with the upper bound of Theorem 1 shows 
that k, =n. For odd values of 2, the same conclusion follows by considering the 
regular n-gon, a figure which determines exactly directions. 


4, Extensions and generalizations. The problem has an obvious projective 
interpretation. Let S, be a set of ~ noncollinear points in the projective plane 
together with all lines joining these points. We ask ‘What is the smallest num- 
ber, k(S,), of intercepts of the lines of S, by a general line in the plane?’ It is 
clear that k(S,) is not less than the k, of Theorem 1. 

We may also pose the original problem for nonplanar sets P,. We then obtain 

THEOREM 1’. The minimum number k, of directions determined by a nonplanar 
set of @ in 3-space satisfies the inequalities 


V/6n —-6 Sk, S Qn — 2. 


THEOREM 2’. The minimum number k,, of directions determined by a strictly 
convex nonplanar set of @ in 3-space satisfies the inequaltites 


n+2s ky S 2n — 2. 


Proof. The lower bounds are obtained using methods exactly similar to those 
used in the planar case. The lower bound in Theorem 2’ is best possible for 
small » (for example the vertices of a tetrahedron), but is probably a poor 
bound for large n. 

A figure for which the upper bound is attained is the pyramid having a 
regular (1—1)-gon as its base. The bound can be slightly improved to 2n—3 
for even 1, by placing 2 suitable pyramids on opposite sides of a common regular 
(nw —2)-gon base. 


1970] RESEARCH PROBLEMS 505 


Reference 
1. P. Erdés, On sets of distances of 2 points, this MONTHLY, 53 (1946) 248-250. 


RESEARCH PROBLEMS 


EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (4f any are known to the author) and by a brief description of known 
partial results. Material should be sent to Victor Klee, Department of Mathematics, University 
of Washington, Seatile, WA 98105. 


A CONJECTURED REPRESENTATION OF GENOCCHI NUMBERS 
J. M. GANpDuI, Western Illinois University 
Genocchi numbers are defined by the formula 
(1) (G+ 1)*¥ + Gy = 1, N> 1, with G, = 1, 
where after expansion G‘ is to be replaced by G; for each 13 .N. We remark that 
the Genocchi numbers can also be generated by the formula [1, pp. 250-263]: 
2t ~ Gy 
e+1 ao NI 
The first few Genocchi numbers are G;=+1, G.a=—1, Gg=+1, Ge= —3, 


Gg= +17, Gio=—155, Geo=+2073, Gis= —38227, Gig = +929569, etc. with 
Goy4i1=0. We conjecture that 


(3) Gon = (—1)" D7 1? D7 2? D7 3?- + DW — 1)%, 
where the >) notation used in (3) has the following meaning: 
DR? = kh? — (k — 1)? 
>, BR? > (k++ 1)? = kh? DS (R +1)? — (R — 1)? DS R? 
= h{(k + 1)? — BP} — (& — 1)*{h? — (& — 1) 


(2) 


and in general we have the recurrence 

DR e+ 12D (+22? DREN)? 

(4) =k >) (R+1)? Do (R+2)?--- DO (R+ NY)? 
—(k—1)? >, kB? >) (R+1)?--- D(R+M — 1)? 


We note that (V-++1) >.’s on the left hand side of (4) are reduced to N >.’s and 
the process can be continued till there are no >.’s left. The >) notation used 
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can be easily understood by actually calculating the first few Genocchi numbers: 


Ga= D1? =1. 
Ge = — D512 >) 2? = — 12 5) 2? +02 Do 12 = — 12/2? — 12) = — 3. 


Ge = >) 12 D> 2? Do 32 = 12 Dy 2? dD) 32 = 12(22 D> 32 — 12 D> 22) 
= 12/22(32 — 2%) — 1272? — 12)] = + 17. 
Gio = — D512 D5 2? F332 DS 42 


= — 17[22(3? D7 4? — 2? D7 3%) — 12(2? D732 — 17 D7 2%)]; 
using (5) and simplifying we get Gip= —155. 
The formula (3) has been verified to be true for all values of G’s up to Gia. 
For similar >> notation as used in this paper, though slightly different, 
reference may be made to [2]. 
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INTEGRATION OF TOTAL DIFFERENTIAL EQUATIONS 
B. FisHER, The University, Leicester, England 
Consider the total differential equation 

(1) P(x, y, 2)dux + O(a, y, z)dy + R(x, y, z)dz = 0, 
where we shall suppose first of all that 

O(«, y, z) = P(y,2,«) and R(x, y, 2) = P(z, x, y). 
By treating x in (1) as a constant and integrating, we get 
(2) U(x, y, 2) = const. = f(x). 


Assuming (1) is integrable, there exists an f such that (2) is a solution of (1). The 


problem is to find f. 
To do this, we have, by the symmetry of (1), 


(3) U(y, 2, «) = fly) 
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also is a solution of (1). Solving (3) for z, we have z= V(x, y, f(y)), and substitut- 
ing this value of 2 in (2), we get U(x, y, V) =f(x). By giving y some suitable fixed 
value a, then f(a) becomes some arbitrary constant c, and we get 


f(x) = U(x, Qa, V(x, Qa, c)); 
giving us the function f. 
One must of course check that (1) is integrable before using this method, since 
the method of obtaining an f works even if (1) is not integrable. 
Example: y2(y+2)dx-+2x(g+x)dy+xy(x+y)dz=0. We have 
yay + 2) ax(z+ a) xy(x + y) 
) 6) 6) 
Ox oy OZ 
yaly + 2) 2u(a+x) axy(x + y) 
so the equation is integrable. 
Treating x as a constant, we have 2(2-+x)dy+y(x+¥4)dz=0, and integration 
gives 
(w+ y)(e + 2) 
yZ 
By symmetry, we have [(y-+z)(y+<x) ]/zx =f(y). Putting y=1 and solving for g, 
we get 


(4) f(x). 


x+i1 x+i1 
(cy —1)x—1 cx — 1° 


where f(1) =cq is an arbitrary constant and cy—1=c. 
Substituting in (4) with y=1, we get 


1 
1 


Thus the solution is [(x-+y)(x+2) |/yz=cx?+1, which can be rewritten to 
give the symmetric form x+y+2=cxyz. 
A similar method can be used for integrating (1), where we have 


Q(x, y, 2) = Ply,x,2) and R(x, y,2) = Ry, x, 2). 


By treating x in (1) as a constant and integrating, we get (2) again: U(x, y, 2) 
=f(x%), and by the symmetry we this time get 


(S) U(y, %, 2) = f(y). 


Solving (5) for 2, substituting this value of z in (2), and giving y some suitable 
fixed value a, again gives us f. 


f(x) = {+= Mee = 1) = ot +4, 
cx — 
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One must again check that (1) is integrable before using this method. 


Example: y2(y —2)dx +x2(x —2)dy+xy(x+y)dz=0. We have 


yay — 2) xe(x—2) xy(x + y) 
a F) F) 
Ox ay OZ 
yely— 2) x2(x—2) xy(x + y) 


so the equation is integrable. 
Treating x as a constant, we have 2(x—z)dy+y(x+y)dz=0, and integration 
gives 


(6) (@ + y)@ — 2) = f(x). 
v2 


By symmetry we have [(x+y)(y—z) ]/xz=f(y). Putting y=1 and solving for z, 
we get 


x +1 x+1 
Z = 
(i. t+ijx+ti1i c+ 


where f(1) =c is an arbitrary constant and a+1=c. 
Substituting y=1 in (6), we get 


fe) = | rh +1) =cx?-—1 
x) = <x hd Cx = Cx . 
Thus the solution is 
FM TD eg 
yZ 


which can be rewritten in the more symmetric form x+y—2=cxyz. 


A FINITE-VALUED FINITELY ADDITIVE UNBOUNDED MEASURE 
D. P. Gresy, University of Southern California 


0. INTRODUCTION. In trying to characterize the conjugate space (space of 
continuous linear functionals) of £,(X, @, mw), the space of all u-essentially 
bounded complex-valued @-measurable functions on X (see for example, “Real 
and Abstract Analysis” by Edwin Hewitt and Karl Stromberg, Springer-Verlag, 
New York, 1965, especially sections 20.14 and 20.27 ff), one quickly discovers 
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that each continuous linear functional Z on £,, has the form 


Ly) = [far 


where 7 is a complex-valued function on @ satisfying: 

(i) sup | |7(A)| : Aca} <o(7 is bounded on @), 

(ii). T(AUB) =7(4)+7(B) if A, BE @ and ANB =¢ (r is finitely additive on 
@), and 

Gii) r(A) =0 if A€@ and A is locally p-null. 

In my real analysis seminar, a student raised the question of whether (1) can 
be replaced by 


(1’) | 7(A) | <o for each A€@Q@ (r is finite-valued on @), 


that is, whether (1’), (ii), and (iii) imply (i), as is the case for countably additive 
7, where the even weaker “la(X )| <o” can replace (i’). It is interesting that 
the answer to this question can be obtained (in the negative) by purely algebraic 
means. 

I wish to thank Mr. Stephen Weinstein who brought this problem to my at- 
tention and Mr. Hal Forsey who suggested a simplification in my original proof. 


1. Lemma. Let @ and @ be algebras of subsets of X, ®CEC, and let p bea finitely 
additive finite-valued measure on ®. Given CEC, CER, and a (finite) scalar, y, 
there exists a finitely additive finite-valued measure tT on © such that | @=p and 


T(C) =¥. 


Proof. If & is the linear space of all functions on X which are finite linear 
combinations of characteristic functions of sets in ®, and 9M is the linear space 
formed in the same way from @, then £CM, and the characteristic function of 
C, Xe, is in 9 but not &. The fact that xc€ & follows from the fact that @ is an 
algebra so that in the first place, each element of & can be represented using 
pairwise disjoint sets, and in the second place CEG so C is not a finite union of 
sets in ®. If f is defined on £& by 


r( > Bex.) = >) Bio(Bi) 
t=1 i=1 
then f is a well-defined linear functional on £. The proof of this last statement is 
the same as the usual proof that defining [x( yr a; Xa,)dp to be yor (As) 
yields a good definition of an integral on the set of simple functions supported 
by sets of finite measure and that this integral is a linear operator. 

Then by extending a basis for £ to a basis for 9M including x¢ in the extended 
basis, we construct a linear functional g on 91% which is an extension of f such that 
g(xc) =¥. If we define r by r(S) = g(xs) for each SEG, then it is easily verified 
that 7 is the desired measure. 


2. THEOREM. Let (X, @, mw) be a measure space such that there are infinitely 
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many p-essentially distinct sets of finite nonzero u measure. Then there exists a func- 
tion rT on @ Satisfying (i’), (ii), and (iii), but not (i). 


Proof: Under the hypotheses of this theorem, it is easy to construct algebras 
Bo, 1, Be, - ~~, and sets Bi, Bo, -- +, such that 

(a) ®o contains all locally u-null sets in @, 

(b) ®oCBiCB@eC --: C@, 

(c) for each n21, B,€B, but B,EB,_1, 

(d) for each n21, 0<y(B,)< ©. 
Define 7) on @o to be the identically 0 function. Clearly 79 is a finitely additive 
finite-valued measure. Inductively, choose rT, to be a Lemma 1 extension of Tp_-1 
from @n_1 to Ba such that 7,(B,) =n. Finitely, let r be a Lemma 1 extension of 


oO 


Ur_1 Tn from U7_, ®, (clearly an algebra) to @. Then 7 is the desired measure. 


3. REMARKS. The restriction in Theorem 2 that X contain infinitely many 
u-essentially distinct subsets of finite nonzero uw measure is easily seen to be 
necessary as well as sufficient to the conclusion of that theorem. 

With notation as in the statement and proof of Theorem 2, if f is defined on 
the @-simple functions on X by f(s) = fxs dr, then f is well-defined and linear (as 
in the proof of Lemma 1) but f is not continuous, since »~!yxz,—0 in £, but 
f(n— xz,) =1 for all n. Since the @-simple functions are dense in £,, this gives 
an example of an unbounded linear functional on a dense subspace of a Banach 
space. 


SOME-RESULTS CONCERNING THE OCCURRENCE OF SPECIFIED PRIME 
FACTORS IN (”") 


G, J. Stumons, University of New Mexico 


It is well known that for any positive integers r and n2r the falling 
r-factorial of 2 


(n)ry = n(n —1)---m—7+1) 


is always divisible by 7! Thus p*| (x), for each prime Sr, where p* with h20 
is the highest power of # dividing r!, written pr. This note shows conversely 
that there are infinitely many mu such that p'|r! for each prime pSr, where 
p'|| (n),. The even stronger result is true that if NV and r are arbitrary integers, 
there are infinitely many m such that 


: (= 


It is known that the binomial coefficient (7) is not divisible by a fixed prime p 
if and only if when 7 and m—,r are represented in p-ary notation, forming their 
sum does not generate a carry in any digit position. This result was first given 
by E. Kummer [1] and later extended to multinomial coefficients by L. E. 
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Dickson [2] in his dissertation at the University of Chicago and by J. W. L. 
Glaisher [3, 4]. If r and m—r satisfied the Kummer conditions for each prime 
factor of NV, then the conclusion given by (1) would be proved. A direct proof of 
the existence of m with these properties by this method is possible, but obscure. 


LEMMA. Given any integers n=r>0 and an arbitrary prime p, let p* be deter- 
mined by p| (n),. Then p*| (n-+kp**!), for each nonnegative integer k. 


Proof. 
(n+ bo), = TT f(s) + Bp}, 


j=0 


hence 


r—I 
(n + kp), = [] (# —Zf) = (nm), (mod p**), 
j=0 


THEOREM. Given any positive integers r and N, there exist infinitely many tn- 
tegers m2r such that 
m 
(("), 9) = 1. 
r 


Proof. Let m=r+kN(r!)? where & is an arbitrary nonnegative integer, and 
let p be any prime factor of N. We prove that 


A() 


First, if pir!, i.e., pt (r),, then by the lemma with m=r and h=0, we see that 


p{(m), and hence 
AC), 


Second, if p| r!, let h be determined by p'||r!, so that pi*}| (r!)? since h21. Then 
by the lemma with n=r we see that p’||(m),, and hence 


(7) 


This work was supported by the United States Atomic Energy Commission. 
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EDITED BY J. G. HARVEY AND M. W. PowNALL 


Material for this Department should be sent to either of the editors: J. G. Harvey, Department 
of Mathematics, University of Wisconsin, Madison, WI 53706; M. W. Pownall, Department 
of Mathematics, Colgate University, Hamilton, NY 13346. 


MEETING REPORT: CONFERENCE ON SCIENCE IN THE TWO-YEAR COLLEGE 


Thirty-two representatives of seventeen national scientific and educational 
organizations* convened in Washington on June 18-19, 1969 for a Conference on 
Science in the Two-Year College that was supported by the National Science 
Foundation and sponsored by the Commission on Undergraduate Education in 
the Biological Sciences (CUEBS). This meeting followed two earlier conferences 
called by the college science commissions, and provided an opportunity for a 
common focus of the independent two-year college efforts of all the organiza- 
tions represented. 

Following an address by Dale Tillery of the Center for Research and De- 
velopment in Higher Education, University of California at Berkeley, the con- 
ferees addressed themselves to the development of recommendations on two- 
year college science with particular reference to matters of administration, 
curriculum, and personnel. The specific consensus of recommendations, several 
of which were reached without unanimity, have been distributed to the con- 
ferees for transmittal to their respective organizations for endorsement and 
implementation. By late spring 1970, the endorsed guidelines, with a supporting 
rationale, will be disseminated widely to regional accrediting associations, pro- 
fessional organizations, two-year college administrators, teachers, and so on, for 
further implementation and to serve as a basis for continuing dialogue among 
groups concerned with science in two-year colleges. 


* American Association for the Advancement of Science, American Association of Junior 
Colleges, Advisory Council on College Chemistry, American Council on Education, American 
Chemical Society, American Geological Institute, American Institute of Biological Sciences, Amer- 
ican Institute of Physics, Commission on Education of the National Academy of Engineering, Com- 
mission on Undergraduate Education in the Biological Sciences, Council on Education in the Geo- 
logical Sciences, Committee on the Undergraduate Program in Mathematics, Mathematical Asso- 
ciation of America, National Faculty Association of Community and Junior Colleges (representing 
the National Education Association), National Science Foundation. 


PROBLEMS AND SOLUTIONS 


EDITED BY Emory P. STARKE 


ASSOCIATE EpIToRS: JOSHUA BARLAZ, GRATTAN P. MuRPHY. COLLABORATING EDITORS: 
LEONARD CARLITZ, GULBANK D. CHAKERIAN, HASKELL COHEN, ISRAEL N. HERSTEIN, 
Murray S. KiaMKIN, DANIEL J. KLEITMAN, RoGER C. Lynpon, Marvin Marcus, 
ALBERT WILANSKY, OSWALD WYLER, and UNIVERSITY OF MAINE PROBLEMS GROUP: 
GEORGE S. CUNNINGHAM, CLAYTON W. DopGE, Howarp W. EVEs, WILLIAM R. GEIGER, 
CHARLES A. GREEN, Eric S. LANGFORD, PHILIP M. LOCKE, JOHN C. MAIRHUBER, 
EDWARD S. NorTHAM and WILLIAM L. SOULE, JR. 


All problems (both elementary and advanced) proposed for inclusion in this Department 
should be sent to E. P. Starke, 1000 Kensington Ave., Plainfield, N.J. 07060. Proposers of 
problems are urged to enclose any solutions or information that will assist the editors. Ordt- 
nartly, problems in well-known textbooks and results in generally accessible sources are not 
appropriate for this Department. No solutions (except those accompanying proposals) 
should be sent to Professor Starke. 


ELEMENTARY PROBLEMS 


Solutions of Elementary Problems should be sent to Problems Group, Mathematics Depart- 
ment, University of Maine, Orono, ME 04473. To facilitate their consideration, solutions of 
Elementary Problems in this issue should be typed (with double spacing) and should be mailed 
before August 31, 1970. Contributors (in the United States) who desire acknowledgement of re- 
ceipt of their solutions are asked to enclose self-addressed stamped postcards. 


E 2235. Proposed by Gideon Nettler, Montclair (N.J.) State College 


Prove that 
7*(d) = » » » u(a), 


cjd ble afb 


where pu(z) is the Mébius function and r(z) is the number of divisors of n. 


E 2236. Proposed by M. S. Klamkin, Ford Sctenttfic Laboratory and D. J. 
Newman, Yeshwa Unwersity 


Show that if the integral of the reciprocal of a nonconstant polynomial is a 
rational function, then the polynomial must be of the form (ax-+0)*. 


E 2237. Proposed by Arthur Marshall, Madison, Wisconsin 


Let 2 be an odd, composite, square-free natural number, such that no prime 
factor divides the Euler totient of any other prime factor. Prove that kdé(m) 
=—1 has no solutions for any natural number k. (This problem was posed in 
D. H. Lehmer, On Euler's totient function, Bull. A.M.S., 38(1932) 745-751.) 


E 2238. Proposed by Emanuel Vegh, U. S. Naval Research Laboratory 


Let p be a prime and h, fk, ---, tf, a reduced residue system mod p—1. 
If g is a primitive root of p, then it is well known that the integers gt, g#, -- - , 
g' are distinct mod p. (In fact, these integers are the primitive roots of p.) Is 


522 


1970] ELEMENTARY PROBLEMS AND SOLUTIONS 523 


there an integer h, not a primitive root of p, such that h4, h,---, h are 
distinct mod p? 


E 2239. Proposed by G. Sabbagh, Paris, France 


Let R be a binary relation on a nonvoid set EF such that: 

(1) For no xCE is xRx true. 

(2) For each pair (x, y) of distinct elements of £ one and only one of the 
following relations holds: xRy, yRx. 

(3) Ris dense, which means: If xRy then there isa zC FE such that xRz and 2Ry. 
Must £ be infinite? 


E 2240. Proposed by Yeong-shyeong Tsai, Taiwan Provincial Chung-Hsing 
University 

Let 2 be an arbitrary positive integer, and let A be the set { 1,2,3,--:+,n } ; 
Take nu vertical line segments between two horizontal lines, indexing the top 
points and the bottom points by the numbers 1, 2, - - -, 2. Now draw as many 
horizontal line segments as you please between adjacent vertical lines, with the 
restriction that all endpoints of these segments are to be distinct. Now, starting 
at any upper point 7, proceed downward until the first horizontal segment is 
reached, then go across that segment and again proceed downward; continue in 
this way, passing over each horizontal segment as it is met until a bottom point, 
say k;, is reached. 
(1) Prove that the mapping 1k; («=1, 2,-++, ”) is a one-to-one mapping 

of A onto A. 
(2) Prove that in completing the mapping 7—&; for all 7 as described above, 

every part of a vertical line is traced exactly once, and every horizontal 

segment is traced twice, once in each direction. 


E 2241. Proposed by Linda Pleska, Bowling Green State University 


Prove or disprove: If lim,.,, f(x) exists for each xo€ [a,b], then the Riemann 
integral [? f(x)dx exists. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


The following solutions were sent in response to the list of unsolved problems published in Vol. 76, 
No. 6, p. 711. The editors will welcome further contributions. 


Another Inequality for the Altitudes and Exradii of a Triangle 
E 1847 [1966, 81]. Proposed by J. I. Nassar, Muhlenberg College 


Let h;, 7; («=1, 2, 3) be the altitudes and exradii respectively of a triangle, 
and let ai, ae, a3 be any nonnegative numbers. Prove that 


y , &2, 3 a2 a 
Shi hj hy S Tivj Vk y 


where the summation is taken over all permutations of 1, 2, 3. The equality 
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holds if and only if the triangle is equilateral. Here x* is taken to be the positive 
number x* for x >0, a0. This is an extension of E 1675 [1965, 187]. 


Solution by Anders Bager, Hj¢rring, Denmark. It is clear that we must avoid 
the case a1=a2=a3=0 if we wish to preserve the stated condition of equality. 

In a more usual notation we prove the following slight generalization. Let 
a, B, y be nonnegative numbers, not all zero. Then 


a 8B a £8 a £8 a B a B a B 
(1) haliste + lishelta + halal, Starste + tovela + totals 
with equality if and only if @=b=c. By circling the other way round the triangle 
we get another inequality of this sort, and by adding the two we get the stated 


inequality with the condition of equality preserved. 
We set s,= —a+bd-+c, 5 =a—b-+c, s=a+b—c and get 


(2) 2(Sa + 53) = 6, 2(% + Se) = @, 2(Se + Sa) = Ob. 


As ahg=bhy=ch, =1oSa=1s5t =%-Se (double area of the triangle) (1) may be 
rewritten as 

—a —§ — —a —f — —a —B — —a —B — —a —f — —a —B — 
(3) a b c +b ca +e ab Ss, % So +5 Sco Sa +S. Sa Sp 
Clearly we need only consider the following cases: 

(i) a=B=y>0. By (2) and the theorem of the arithmetic and the geometric 
mean we get (s,5,)'/2Sc¢ (equality if and only if a=b), and two analogous in- 
equalities. We multiply these and get 

SaSpSo & abe, Sa Sb So = abc, 
which is (3) in this case (with equality if and only if a=b=c). 

(ii) a>0, B=y=0. As the function f(x)=x-¢(«>0) is convex, we get 
f (Se) +f(s,) 2 2f(c) (equality if and only if a=0b) and two analogous inequalities. 
By adding the three inequalities and dividing by 2 we get 


(Sa) + f(s) + f(se) 2 f(a) +f) + fC) 


(equality if and only if a=b=c), which is (3) in this case. 

(iii) a>0, 8>0, y=0. Here we use Theorem 99, p. 80, in Hardy-Littlewood- 
Pélya, Inequalities (Cambridge, 1964). Consider the function B(x, y) =x-*y-4 
defined in the first quadrant (x>0, y>0). We get 


Bio(x, vy) = ala + 1)x-e-2y-F > 0 


for all x, y, and 
28—2 


(Si2byy — b,,)(x, 9) = Bla +B+1)e “"*y ~*>0 
for all x, y. Hence 


P(Sa, 54) + B(S, Se) 2 2¥(c, a) 
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(equality if and only if a=b=c) and two analogous inequalities. By adding the 
three inequalities and dividing by 2 we get 

P(Sa, 5s) + B(S, 5c) + BlSe, Sa) 2 Pla, 6) + BB, c) + B(c, a) 


(equality if and only if a=b=c), which is (3) in this case. 
(iv) a>y>0, B2y. By earlier cases we get 


YY ~~ Y —Y —-Y —7 
(4) abc SSa % S& , 
a7 (4—-Nh-E—-v) 4. §-le-) ¢— By) + ola) g— 8-7) 
(5) ~(e—) —B—7) .  —(e—7) —(6—Y) ~(a—~y) —(B—7) 
S Sq Sp + Sp Se + S¢ Sa . 


We multiply (4) and (5) and get (3), with the usual condition for equality. 


Note. In the original statement of the problem the sign of inequality was unfortunately re- 
versed. 


Nesting Habits of the Laddered Parenthesis 
E 1903 [1966, 666]. Proposed by George Eldredge, El Cerrito, California 


Let an n-ladder of twos, L,, be defined as follows: 


2 


2 


L,=2° 


where there are twos. Let N, be the number of distinct integers that can be 
obtained from L, by the appropriate insertion of a set of unambiguous nested 
parentheses. For example, N3=1, N4=2. Find Ny. 


Solution by Michael Goldberg, Washington, D. C. Since 
(22)2 = 22) = 16, 
the value is independent of the placing of the parentheses, and N3;=1. Also 


2 2 
2) 9 (2 )) 16 


) =16 =256, while2” =2°. 


Hence, N4= 2. Expressions like 


are excluded since the parentheses are not nested. Hence each new two must be 
added at only the top or bottom of the ladder of lower order. 
For 2=5, we have the values 
(256)2 = (28)2 = 216, (216)2 = 282. 2256. (2°) 
and N,;=4. 
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For larger values of 2, the values obtained by adding the two at the top of 
the ladder fall very much below the values obtained by adding the two at the 
bottom of the ladder. Hence, Nny1=2N, or Nz =2— for n=3, 4,5,°°°. 


Note. This solution, originally submitted in 1966 was among a number which were inadver- 
tently lost. 


A Sequence Problem with a Unique Solution 


E 1905 [1966, 774]. Proposed by R. L. Graham and L. A. Shepp, Beil Tele- 
phone Laboratories 


Let *1=%, %n41=x""* forn=1,2,---.Ifa>e1/*, prove there is a y for which 
limMps0¥n/Gn41 exists and is a positive number. Is y unique? 


Solution by J. H. Lindsey, II, the RAND Corporation. Fix a> e/*. One can 
easily show by induction that for y>1, yn is monotonic increasing. If limn,,, da 
exists, say @=limn.. Gn, then limy,, dn=a=limns, Gn4i1=limnz,, a” =a* and 
a=atle) <eMle) a contradiction. Therefore, the sequence a, is monotone in- 
creasing to infinity. 

Now dn <@n41 and (Gn41)n > (An41)1=An41. AS Yn iS a continuous function of 
y for y>1, by the intermediate value theorem, we may find f(z) with a</f(n) 
<dn41 and (f(m))n=Qny1. If b>c>1 then one can show by induction that 
bn >Cn. Therefore, f(z) is uniquely determined by a and n. 


For any y>1, 
(yn) (yn) 
Yor = as = ve = my bantt 
(an+1) (logy a)\an+1 
Gnt2  @ (vy (e872) 
where ¢ = (Yn/ Gn4i1) —log,a. Therefore, 
Yn+1 
= log, a + logs ( - ) 
An+1 On+1 An+2 


In particular, 


(f(”))n—1 1 
TO log s(n) a-r a. log y(n) ( 


Qn n 


(f(7))n 


Qn+1 


) = logyiny a < 1. 


As (f(n—1))a-1/dn =1, we have f(n—1) >f(n). 
Since the f(m) are decreasing and bounded below by a, we may write 
x =limn..f(n) for some x Za. 
We shall show that lim,... %./@n41= log, a. First, 
Xn—1/ On < (f(n))n—1/n = logsiny a < loge a. 
Let log, a>e>0. As 


lim (logy(m) @2) =log,a@ and lima, = ©, 
m— 


n-> 2 


there exist M and N so that m>M and n>WN together imply 
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loge(a) — e] — oer a + —— logyim (loge (2) — a | <0. 


On+1 
For m>n, m>M, and n>WN, we shall show successively that (f(m));:/aé41 


> log, (a) —¢ fort=m, m—1,---,n. This is true for m=1, since (f(m))m/Qm41 
=1>log, (a)—e. If the statement is true fori+1 with 127, then 
(f(m)); 1 (f(m)) 41 
= logy(m) @ + logy (m) (=) 
Qi+1 Ay41 Qi+2 
> logy (my @ + [logy im) (loge (a) — «)] 


t+1 
> logs (a) — «. 


Therefore, for 2 >N, m>M, and m>n, (f(m))n/Gn4i1>loge (a) —e. Taking the 
limit of this inequality as m—o, we have, for n2N, xn/dQns1Z log: (a) —e. 
Therefore 


un 


lim 
n> Ontt 


= log, a. 


Suppose limns. (Yn/Gn+1) =C(Y), a positive number. Then we may let ” approach 
infinity in the equality 
Yn 1 Yn+1 


= log, a + log, 
On+1 On+1 On+2 


and conclude c(y) =log, a. Suppose that there are two solutions x and y for y 
with x<y. Then 1 <x and c(x) =log, a>log, a=c(y). However, 


Vn 


c(x) = lim = lim = c(y), 


no On+1 no On+1 
a contradiction from which we can conclude the uniqueness of y. 


An Unordered Pair Undefined 
E 1917 [1966, 891]. Proposed by K. O. May, University of Toronto 


The ordered pair is customarily defined in terms of the unordered pair by 
(a, b)={ {a,b}, {a}}. Prove that it is impossible to reverse this by defining 
unordered pair in terms of ordered pair. 


Solution by D. W. Hadwin, Augustana College, South Dakota. There are two 
parts of this problem which need clarification. First I will assume that when the 
proposer says “a redefinition of the unordered pair {a, bt” he means an ex- 
pression involving a and b which is unaltered by interchanging a and b. The 
most ambiguous point in this problem is the phrase “in terms of.” In a sense 
‘ (a, b), (b, a)} is an expression written in terms of the ordered pairs (a, 0) and 
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(b, a) which is unaltered by interchanging a and b. In view of this example I 
am led to believe that the proposer’s intentions will be expressed in the re- 
formulation of the problem given below. 

Define a sequence of sets {.S,} inductively by: 


So = {a,b}, Saar = Sn U {(u, 0) | u, 0 € Sp}. 


Then S, is the set of all possible expressions which can be formed from a and 0 
using at most # ordered pairs. Let S=U,;_, S, and define a mapping x«—># from 
S to S inductively as follows: @=b, b=a, and if (wu, v)ES, then (u, v) = (4, 3). 
Note that % is merely the expression obtained from x by interchanging a and B. 

The original problem can now be restated: there is no x€S for which x=. 

Proof. By way of contradiction assume that there is an x©S such that 
x=. Let n be the smallest nonnegative integer for which there is an «CS, 
such that x= and let x»CS, be such an x. Clearly 140. Since x»€S, and 
xo Sn-1, there exist u, v€S,_1 such that x9 = (uw, v). Thus x9 = (u, v) = &o = (4, 3). 
Hence u=#, v=3, contradicting the minimality of n. 

Also solved by Gerald Edgar, S. J. Garland, and by J. H. Lindsey, II. 


Edgar shows that the result holds if we replace “ordered pair” by any function f such that 
if f(x, y) =f(u, v), then x =u. 


Derivatives of Periodic Functions 


E 1959 [1967, 198]. Proposed by R. A. Struble, North Carolina State Uni- 
ver sity 

If f(x, y)€C! where f is periodic in x with least (positive) period P(y), then 
the partial derivative f,(x, y) is periodic if and only if P’(y) =0. 


I. Comment by D. A. Hejhal, University of Chicago. The “only if” part is 
false without some additional hypothesis. Consider the following counterexam- 
ple. Let f(x, y)=y? sinx+e¥ sin 2x. Then P(0)=a but P(y) =2m for all suffi- 
ciently small y40. Thus P(y) is discontinuous at y=0, whence P’(0) does 
not exist. 


II. Solution of the problem amended by the assumption that P(y) ts continuous. 
LEMMA 1: If P(y) is continuous, then it is differentiable. 
Proof by the proposer. For all x, y, Ay we have 
(«+ Ply + Ay), y) — f(x + PO), 9») 
= f(x + Py + Ay), y) — f(x, 9) 
= fle + Ply + Ay), y) — fe + P(y + Ay), y + Ay) + f(a, y + Ay) — f(x, 9). 
Thus, using the mean value theorem, we have further 
4) file + Ply) + & y)[PO + Ay) — POD] 
= — f(x + Ply + Ay), » + nay + fils, 9 + SAY 
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for suitable 7 and £& satisfying |£| <|P(y+Ay)—P(y)|, with |n| s|Ay| and 
|¢| <|Ay|. Note that 


(2) |é| 0 as | Ay| — 0 (uniformly in x) 
since P(y) is continuous. Now we consider y fixed and choose x» which satisfies 


fe(Xo0, y) #0. This is possible since f(x, y) has a least positive period in x. Moreover, 
since f,(x, y) is continuous, there exists a 6>0 such that 


(3) | xo — 2 | <6 implies f,(z, y) ¥ 0. 

Because of (2), there exists a 6;>0 such that 

(4) | Ay| <6: implies |é| <6 for all x. 

Hence for each Ay satisfying | Ay| <61, there exists a number x; satisfying 
(5) Jat PQ) +&- | <4, 


where €=£(x,, Ay) satisfies (1) with x =x1. Moreover, because of (2), we may 
impose the condition x1;—>x)»— P(y) as | Ay| 0. Hence, using (1), (3) and (5), 
for 0<|Ay| <é&, we may write 


P(y + Ay) — Py) _ —fy(ar + Ply + Ay), y+ 0) + fy(%, y + §) 
Ay fela1 + Ply) + & y) 


where the right hand member possesses a limit as | Ay| 0. Thus P(y) is 
differentiable. 


LEMMA 2: The assertion of the original problem is true if P(y) is differen- 
tiable. 


Proof by Paul Nelson, Jr., University of New Mexico. We may differentiate 
the identity 
f(a + Ply), ») = fe, ») 


with respect to y, obtaining 


fala + Ply), Py) = fila, v) — fy(x + PCy), 9). 


The result now follows from the observation that f,(x+P(y), y) is not iden- 
tically zero in x. 


Some Nonregular Polyhedra 
E 2141 [1969, 82, 1067]. Proposed by H. S. Hahn, West Georgia College 


(1) Find convex nonregular equilateral (equal-edged) polyhedra with their 
n (4<n<16) vertices on a sphere. 

(2) Prove or disprove that there is no convex nonregular equilateral poly- 
hedron with an odd number 1 of vertices all on a sphere, except for n=5, 9, 
11, 15, and 55. 


Comments by Michael Goldberg, Washington, D. C. The editorial note with 
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the original solution indicated that no proof or disproof of part (2) was at- 
tempted. Johnson’s paper in the Canadian Journal of Mathematics 18(1966), 
169-200, tabulated all the convex polyhedra with regular faces, but no rigorous 
proof was given. Since then, however, a rigorous and laborious proof was pub- 
lished in the U.S.S.R. It has been translated from the Russian and published 
as Convex Polyhedra with Regular Faces, V. A. Zalgaller, Consultants Bureau, 
New York, 1969. 


A Nonsingular Matrix? 


E 2184 [1969, 825]. Proposed by J. C. Nichols, Monmouth (Illinois) College 
and (independently) by J. A. Huckaba, University of Missouri 


Let a;; be nonnegative real numbers such that (a) a;,=1, (b) aia;;<1, and 
(c) @4j0;4 Sa for all 2, 7 with 1S7, jSn and 14}. 
Show that the matrix [as] has a positive determinant for »=3. This is 


obviously true for »=1, 2; can you establish this result for n>3? At least 
show that [a,;| is non-singular for all x. 


I. Solution by E. F. Schmeichel, College of Wooster. Let D=det|a,;|. Mul- 
tiply the first row of D by dea31, the second row by a3, and the third by dau. 
Now remove the factor deid31 from the first column, subtract the second row 
from each of the others, and remove the factor a3; from the resulting first row. 
This yields 
bir O40 
do, | be, dee 
Note that by 30, bee S0, by. = 0, bo <0. If by >O0, we have D= (011022 — 32021) / Aa 
= — by2bo1/d21> 0. If by. =0, then oi = 023031, and 16) by = (3223) 031 — 231 <0, since 


Aze0o3 <1. Similarly be<0. Therefore D = (011be2 — 0-021) /ae1 > 0, as required. 
Noting that 


1 | @s2@21 — @31 ei — Go3@a1 
D=— 
@e1 | @21012 — 1 @21413 — 23 


_ il 


2(1—1/n) 1 
1 2 {> 
2(1—1/n) 1 


bof ft ee 


where 1 is a positive integer, fulfills the given conditions, and has determinant 
1/n?, we see that 0 is the best possible lower bound. 


II. Solution by L. J. Lange, University of Missouri. The matrix [a,;] may be 
singular for 7 =6. 

Let A = |a,;] be an 2 Xn circulant of real numbers where a;;=a,;_; and, for 
JK, Oj =An4j—i. If w; (€=1, 2, -- +, 2) are the x nth roots of unity, it is well 
known that the characteristic roots of A are 


n—1 
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If, in particular, n=6, w:=—1, do=1, a1=2/3, d2=1/2, ag=2/3, ag=1/2, 
ds = 2/3, one easily sees that the conditions of the problem are met and a;=0. 


Also solved by J. C. Nichols. 

Editorial Note. Modification of Lange’s method will yield singular matrices for all even 1 
greater than 6. For n=2k, let ao=1, daign = (R—1)/k (¢=0, 1, - ++, R—1), and ae;=(R—2)/(R—-1) 
(¢=1,2,-++, k-1). 


Minimal Curve for Fixed Area 
E 2185 [1969, 825]. Proposed by Michael Goldberg, Washington, D. C. 


Given a convex quadrilateral. Find the shortest curve which divides it into 
two equal areas. 


I. Solution by C. S. Ogilvy, Hamilton College. It is well known that the mini- 
mal curve which, together with two sides of an angle, encloses a fixed area, forms 
a circular sector. (Courant and Robbins, What is Mathematics?, p. 502, No. 97.) 
This implies that the solution curve is that circular arc perpendicular to two 
sides of the quadrilateral that halves the area. Let A be such an arc perpendicu- 
lar to two adjacent sides and B be such an arc perpendicular to two opposite 
sides. Depending on shape, some quadrilaterals will have solutions of type 4, 
some of type B. There may be transitional cases with equal solution arcs, one 
A and one B. (The square has two equal B solutions, both straight lines.) To 
characterize the shapes leading to each solution may necessitate numerical 
calculations. 


II. Solution by the proposer. The curve may be considered as a restraining 
member under tension produced by internal fluid pressure in the restricted area. 
The ends of the curve are free to slide along the sides. Hence, the curve must be 
normal to two sides of the quadrilateral. Furthermore, since the fluid pressure 
is uniform, the curve must take the form of a circular arc. With the foregoing 
conditions in mind, the various possibilities must be investigated separately for 
each shape of quadrilateral. The method may be generalized to other polygons. 
Also, it may be extended, with further complications, to nonconvex polygons. 


Also solved by M. T. Bird & V. E. Hoggatt, Jr., and Bill Sands. One unsigned solution was 
received. 


self-inverse Functions 
E 2186 [1969, 825]. Proposed by R. G. Kuller, Northern Illinois Unwwersity 


Let g be a continuous real valued function defined on a real interval J. 
Assume that g is not the identity function 7, but that for some integer m= 2, 
gogo:::og=TI, where there are m g’s and the symbol o denotes composi- 
tion. Show that gog=T. 


Solution by Douglas Lind, Stanford University. Suppose g: JJ is continuous 
with g™@=go---og=TI. Then g(x)=g(y) implies x =g™(x) =g"(y) =y, so g is 
injective and hence strictly monotone since g is continuous. If g is increasing, 
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then assuming g(x) >x leads to x=g™(x) >g™ (x) > +--+ >g(x) >x, and a sim- 
ilar contradiction results upon assuming g(x) <x. Hence g(x) =x for all x in J, 
so g=I. If g is decreasing, then x<~y implies g(x) >g(y), so g2(x) <g?(y), showing 
g? is increasing. Since (g?)™=(g”)2=J?=T, g?=T by the first case. 

Also solved by D. M. Bloom, John Bryant & Robert Gilmer, Orin Chein, E. H. Davis, E. P. 
Del Norte, N. G. Fine, M. H. Fish, Kenneth Fogarty, W. F. Fox, Gilles Gauthier, M. G. Greening 
(Australia), C. V. Heuer & G. A. Heuer, A. E. Hoffman, Eleanor G. Jones, J. F. Leetch, Gesing 


Leung (Hong Kong), Andrej Makowski (Poland), Dan Marcus, J. V. Michalowicz, Simeon Reich 
(Israel), A. A. Sardinas, E. F. Schmeichel, D. E. Smith, R. A. Struble, A. C. Williams, Mark Yu, 


and the proposer. 
Gauthier shows that the assertion holds for any Darboux function on set E to Z. Makowski 


observes that the result follows at once from Theorems 15.2 and 15.3 of Kuczma, Functional Equa- 
tions of a Single Variable (Monografie Matematyczne, vol. 46). 


ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to Joshua Barlaz, Rutgers—The State 
Uniwersity, New Brunswick, N. J. 08903. Solutions of Advanced Problems in this issue 
should be typed (with double spacing) on separate, signed sheets and should be mailed before 
August 31, 1970. Contributors (in the United States) who desire acknowledgement of receipt of 
their solutions are asked to enclose self-addressed stamped postcards. 

An asterisk (*) means neither the proposer nor the editors supplied a solution. 


5734.* Proposed by G. G. Lorentz, University of Texas 
Let E, be the degree of uniform approximation of x"*! on the interval [0, 1] 


by polynomials 
(1) Pi(x) = >> aguk(1 — x), a, = 0 
k=0 


with positive coefficients in x(1—x). In other words, let E, be the minimum of 
Maxogesi| xmt1— P,(x)| for all polynomials of the form ({). Prove that y=lim,..,, 
nE,, exists, and find y. 


5735.* Proposed by Paul Erdis and T. Motzkin, University College of Swansea, 
Wales 


Denote by F(n) the number of pairs 1 Sab Sn for which a and b have the 
same prime factors. Prove that lim,.,, F()/n exists and is finite. 
5736. Proposed by M. S. Klamkin, Ford Scientific Laboratory 


Solve the nonlinear difference equation of rth order 


m m+1 


m+1 m m+1 m m 
D, = €,Dn—1 + @2Dn-1Dn—2 +e + + OrDn—1Dn—2 + ++ Dy—r4.1Da—r 
(m, r, a, constants). 


5737. Proposed by Simeon Reich, Israel Institute of Technology 
Let p(2)=adotayet +» + + +an_12"~!+an2", Gn%0, be a polynomial with 
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complex coefficients. Denote 


{ max | z/an | } 4 
Osksn-1 


by Q. It is known (Morris Marden, The Geometry of the Zeros, AMS, 1949, p. 96) 
that all the zeros of p(z) lie in the circle 

[2] S1+ 0". 
Suppose that a,_1=0, and that Q>1. Prove that the zeros of p(z) lie also in the 
circles 
(1) lz] SQ +@+---+0™4, and 
(2) lz| s max {(1 + | ax/an| (1 + | aj/an| $4! 


OSJj,kKSn—-1,j 4k 


5738. Proposed by E. M. Reingold, Cornell University 


Prove that there are infinitely many primes whose representations in base 
b begin (on the left) with an arbitrary string of digits Gndn-1- + + @o (where 
0Sa;<b). 


5739. Proposed by Richard Johnsonbaugh, Morehouse College 


Find all connected, locally compact abelian groups with connected dual 
group. 


SOLUTIONS OF ADVANCED PROBLEMS 


Theorem of the Primitive Element 
5668 [1969, 423]. Proposed by P. M. Perdew, University of Hawati 


Prove the following generalization of the Theorem of the Primitive Element. 
Let K= F(a, B), where a and £6 are algebraic over F. Denote by r and s the 
multiplicities of a and 8, and set t=min(r, s). Then (¢, char(F))=1 implies 
K/F is a simple extension. 


Solution by Robert Gilmer, Florida State University. If F has characteristic 
zero, the result follows from the Theorem of the Primitive Element. If char(F) 
=p 0, then, and s are powers of p: hence (t, p) =1 implies that r or s is p°=1, 
so that a or B is separable over F. It is then known (B. L. van der Waerden, 
Modern Algebra, vol. I, 2nd edition, Section 40) that F(a, 8) is a simple ex- 
tension of F. 


Also solved by E. P. Del Norte, E. F. Schmeichel, and the proposer. 


Polynomial Rings and Left Ore Domains 
5669 [1969, 423]. Proposed by M. E. Harris, University of Illinois in Chicago 


A left Ore domain R is an associative ring without zero divisors such that the 
intersection of any two nonzero left ideals is nonzero. 
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Prove: if R is a left Ore domain then so is R[x] (the polynomial ring in one 
variable over R). How about the formal power series case? 


Solution by D. Z. Djokovié, University of Waterloo, Ontario. The left Ore 
domain R has a left quotient ring Q which is a division ring (cf. N. Jacobson, 
Theory of Rings, p. 118). The polynomial ring R|x] is canonically embedded in 
Q|x] which is a noncommutative principal ideal domain. Hence, Q[x] is a left 
and right Ore domain (loc. cit. p. 31). If fi(x) and fe(x) are elements of R[x] 
there exist gi(x) and go(x) in Q|x] such that gi(x)fi(x) =g0(x)fo(x). If fi(x) and 
fe(x) are nonzero then gi(x) and ge(x) can be chosen nonzero. The coefficients 
of gi(x) and ge(x) have the form a~1b, where a and 0 are elements of R with 
a0. Since R is a left Ore domain, all the elements a have a common left mul- 
tiple mC R. Hence mgi(x) and mg2(x) are in R|x], which proves that the inter- 
section of the left principal nonzero ideals R[x |fi(x) and R[x |fo(x) is nonzero. 
The result is true for arbitrary nonzero left ideals of R[x] since such ideals con- 
tain principal nonzero left ideals. This proves that R[x] is a left Ore domain. 

The above method does not work in the case of R[|x]], the ring of formal 
power series in one variable over R. 


Also solved by R. Berghout & R. Groenhout (Australia), Israel Kleiner, and the proposer. 

Note. Berghout and Groenhout provide the reference: M. A. R. Qureshi, (¢, d)—extensions of a 
ving, Jour. Natur. Sci. and Math., 7 (1967) 71-83 (Math. Rev. vol. 38, #192). Kleiner provides the 
reference: L. Small, Orders in Artinian rings, Jour. Alg., 4 (1966) Theorem 2.43, where generaliza- 
tions of the problem are given. However, as implied by the above, we have received no material on 
the major thrust of the problem as it relates to the ring of formal power series. 
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DYNAMIC TOPOLOGY 
G. T. WHYBURN, University of Virginia 


1. Introduction. This title is not meant to be either laudatory or derogatory 
but simply to be in contrast with what might be called static topology. The 
paper of mine which attracted the notice of the Chauvenet Prize Committee in 
the 1930’s was called “On the structure of continua” and dealt largely with 
cyclic element [1] and other types of structure analysis of certain topological 
spaces. Even in this, however, the problem was formulated as to what type of 
mappings will preserve all topological properties, especially when applied to 
special domain spaces such as Euclidean spaces. 

Briefly put, dynamic topology refers to the body of results of a topological 
nature in which the function concept plays an essential role. The function con- 
cept is basic in nearly all fields of mathematics. We study spaces and systems by 
observing how they act when undergoing transformations affected by the action 
of functions upon them. This is similar to methods used in the physical and na- 
tural sciences in studying concrete problems and phenomena of the world 
around us. In most cases movement, growth, change under natural or artificially 
induced conditions, evolution, or the like appropriate to the medium under re- 
view are involved in physical, chemical or biological studies. In mathematics 
the function concept enjoys an ever increasing role in nearly all areas, whether it 
be analysis, algebra or topology. This is especially true now and is of fairly re- 
cent vintage in topology. However, there is no lessening of interest and activity 
in the static study of structures. The Poincaré conjecture, for example, is the 
source of much fruitful effort as are also manifold structure and characterization 
and the relation between a set and its complement in Euclidean and other 
manifolds [2, 3, 4]. Even these areas of topology, however, become more and 
more pervaded by usage of the function device. How a space changes or remains 
invariant under action of various types of functions is most revealing as to its 
structure. 

Only a few phases of this subject will be dealt with in this lecture and these 
are selected more because of their relevance to my own work and interests than 
on other grounds. However, it is hoped that they will be of interest to you also, 
at least in that they are significant examples of the work going on in certain 
areas of topological research. We will be concerned largely with results and prob- 
lems related to compactness of mappings and compactifications of mappings. 
Thus an appropriate subtitle would be “Compactness and compactifications of 
mappings.” 

To simplify our base of discussion, unless otherwise indicated, all topological 
spaces will be assumed to be Hausdorff spaces, i.e., distinct points lie in disjoint 
open sets. A mapping f:X— Y of a space X into another space Y is a continuous 
function from X, its domain, to Y, its range. The double arrow f: X= Y will indi- 
cate that the function maps X onto Y. The boundary U — U of an open set U is 
denoted by dU. 


556 
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2. Compactness and quasi-compactness. A mapping f: X— Y is compact 
[5, 6] provided the inverse f-1(A) of every compact set A in Y is compact. 
Since X and Y are Hausdorff spaces, note that all mappings f:X—Y are com- 
pact when X is a compact space. However, the mapping w=e” of 0S@< 27 onto 
| w| = 1 is not compact, for example. In general a mapping is compact if it is 
closed and has compact point inverses. The converse holds precisely [7] when 
X and Y are Hausdorff spaces and Y is a k-space, i.e., when the topology in Y is 
determined by its compact subsets in the sense that a set in Y is closed if and 
only if it meets every compact set in a closed set. Assuming the above conditions, 
compactness of a mapping also is readily characterized in terms of filters or di- 
rected families, f: X= Y being compact if and only if for every directed family 
N in Y converging to yCY the inverse family f-!(N) is directed toward f-1(y) 
in the sense that every directed underfamily has a cluster point in f-!(y). Thus 
compactness of a mapping means precisely that compactness of sets is invariant 
both forward and backward, under both f and f—1. It is such a useful property 
because it assures one of action on noncompact domain spaces closely resembling 
the action of a mapping on compact spaces with which so much can be done. 

A mapping f:X—>Y is quasi-compact [8] provided the image of every closed 
(open) inverse set is closed (open). This property was introduced by Alexan- 
droff-Hopf [9] and called strong-continuity. More recently it has come to be 
known as the property of being a quotient mapping. It will be noted at once 
that every closed mapping is automatically quasi-compact, as is also every open 
mapping, and in particular every compact mapping, granted in the latter case 
that the domain and range spaces are Hausdorff and that the range is a k-space. 
It also readily follows that every retraction is quasi-compact. In particular note 
that the exponential mapping w=e* is quasi-compact but not compact. The 
same is true of the mapping given by any transcendental entire function. 

Quasi-compactness is closely related to and is especially useful in the study 
of mappings generated by decompositions of a given space X. Indeed if X is de- 
composed into a collection G of disjoint sets {g} and a new space Y’ is set up 
with elements of G as its points and a topology introduced by defining a set in 
Y’ to be open if and only if the union of its elements is an open set in X, the so- 
called quotient topology, then the natural mapping ¢: X= Y’ which maps each 
x€X into the element g of G containing x is quasi-compact. If we begin with a 
mapping f: X= Y and let X be decomposed into the collection { f-1(y) i of point 
inverses, y€ Y, then letd?: X= Y’ be the natural mapping of this decomposition, 
the function h: Y’=Y defined by h(y’) =fo—1(y’) is 1-1 and continuous. Further 
h-! will be continuous and thus / will be a homeomorphism if and only if f is 
quasi-compact. 

Openness and closedness of a mapping are nicely characterized in terms of 
quasi-compactness as follows: A mapping f: X= Y is open (closed) if and only if 
it is quasi-compact and generates a lower (upper) semi-continuous decomposi- 
tion of X into point inverses. Another remarkable property of quasi-compact 
mappings is that local connectedness is invariant under all such mappings, 
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thus under all open or closed mappings and all retractions. The latter results 
from the unexpected fact that a mapping f: X= Y quasi-compact on any cross 
section (i.e., any set in X mapping onto Y) is automatically quasi-compact on X. 

The inverse does not hold, however. Indeed, in contrast to the situation 
with open or closed mappings, it does not follow [10] that a quasi-compact 
mapping is quasi-compact when restricted to an inverse set—not even to its 
kernel | =the set of all xX for which x =f-f(x) |. This fact has important im- 
plications in the case of invariance of connectedness under the inverse f~! of a 
mapping f: XY. In case point inverses are connected then X will be connected 
if Y is connected, provided f is quasi-compact. However, it does not follow from 
this that the restriction f | I of f to an inverse set I will enjoy the same property, 
because this restriction may fail to be quasi-compact. What is needed here is 
that f be hereditarily quasi-compact, i.e., quasi-compact on every inverse set. 
It turns out that this latter property can be assured for all quasi-compact map- 
pings onto a given range space Y, provided Vis an accessibility space [11]. A regu- 
lar Hausdorff space Y is an accessibility space provided each limit point p of a 
set Min Y is accessible by closed sets in Y in the sense that M+ > contains a closed 
set C having p as a limit point. Any quasi-compact mapping onto a regular ac- 
cessibility space is automatically hereditarily quasi-compact. Thus, in particular, 
if point inverses for such a mapping are connected then connectedness is fully 
invariant under the inverse. The notion of accessibility space is readily extended 
to spaces other than regular. Thus it may be said that for mappings with con- 
nected point inverses, connectedness is invariant both forward and backward, 
provided the mapping is hereditarily quasi-compact and this is true for all quasi- 
compact mappings into accessibility range spaces. In particular it holds for such 
mappings into locally compact or first countable range spaces as these are al- 
ways accessibility spaces. We may summarize by saying: under a compact 
mapping compactness is invariant both forward and backward; and under a 
quasi-compact weakly monotone mapping onto an accessibility space, connected- 
ness is invariant both forward and backward. 


3. Monotoneity. We next consider compactness and quasi-compactness of a 
mapping and the compact part of a mapping in a somewhat more restricted 
setting involving local compactness of the domain and monotoneity of the 
mapping. A mapping f: X—Y is monotone, provided its point inverses f-1(¥y) are 
continua, i.e., compact connected sets. The sequence of results which we indi- 
cate next leads to interesting applications for mappings on the Euclidean spaces. 


(1) If X ts locally compact, every quasi-compact monotone mapping f:X=> VY is 
closed (and thus compact) and its range Y is also locally compact. 


To prove this, take any closed set K in X and any pC Y-—/(K), and let 
P=f-1(p). Then take an open set U about P with U compact and U:-K=@, 
using compactness of P and the fact that X is a Hausdorff space. Then 0U and 
C=f(0U) are compact and thus closed. Let W= Y—C and R= U-f-1(W). Then 


1970] DYNAMIC TOPOLOGY 559 


R is open, PCRCU, and R=f-'f(R) because f-!(y) is connected and meets R 
for each yCf(R) but cannot meet dU. Accordingly f(R) is open and contains p 
but does not meet f(K). Thus f(K) is closed. That Y is locally compact follows 
from the invariance of local compactness under closed mappings. 


(2) If X and Y are noncompact but locally compact spaces, a mapping f:X=>Y 
ts compact tf and only tf tt has a continuous extension f* from the 1-point compactt- 
fication X* of X to the 1-point compactification Y* of Y. 


The extension is unique when it exists. To prove (2) we suppose f is com- 
pact. Let X*—X =x*, Y*— YV=y* and define f(x*) =y*. Let V be any open set 
in Y*. If V contains y*, Y*—V=Y—V and this set is compact. Accordingly 


PUY —V) = X* —f"V) = X — f*V) 


and this set is compact since f | X is compact. Hence X *—f—1(V) is closed so that 
f-(V) is open in X*. If V doesn’t contain y*, f-1(V) is open in X and thus also 
in X*, Hence the extension of f to X* is continuous. 

On the other hand, suppose f has a continuous extension g:X*—Y%, so that 
f=g| X. Then since X* and Y* are compact but Y is noncompact, g must be an 
onto mapping and it obviously is compact. Thus f is compact, because X is an 
inverse set for g and any restriction of a compact mapping to an inverse set is 
compact. 


(2’) CoroLuary. If X and Y are locally compact, a monotone mapping f:X=Y 
ts compact tf and only tf it has a continuous monotone extension from X* to Y*. 


Proof for Corollary (2’). In case neither X nor Y is compact the proof for (2) 
shows that when f is compact its extension to X* is monotone when f is mono- 
tone, and that when f has a continuous extension to X*, f is automatically com- 
pact. 

There remains the case in which X or Y is compact. If X is compact, Y is 
necessarily compact and all mappings from X to Y are compact and all have 
trivial extension to X *, which is the same as X. If Y is compact, then for f:X=>Y 
compact, X =f—!(Y) is compact, and again f is trivially extendable to X*. On the 
other hand, still assuming Y compact, if f has a monotone extension g to X* 
and if X* were different from X, say X**—X =x*, we would have g(x*) =ywEY 
and g—1(yo) =x*+/f—1!(yo), a disconnected set, contrary to monotoneity of g. 

Monotoneity is essential in (2’). For if f is the 1-1 mapping of a plane (or 
punctured sphere) obtained by identifying the two poles of a sphere and then 
deleting one of the poles from the domain of f, then f has a continuous but non- 
monotone extension to its 1-point compactification (the sphere) but f is not 
compact of course. 


(2’’) COROLLARY. Every rational function w =f(z) generates a compact mapping 
of the z-plane onto the w-plane. 


For any function f:X=Y and any subset Y’ of Y, any subset X’ of X 
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mapping onto Y’ is called a trace of Y’ under f. Note that a trace of Y itself is the 
same as a cross section of f as used earlier in this discussion. 


(3) THEorem. If f:X= VY 1s a monotone mapping where X 1s locally compact, 
then any set Hin Y which has a compact trace K has a compact inverse {12 ]. 


We set N=f-1(H) and consider the restriction f | N=g of f to N. Since K is 
compact the mapping g|K :K=H is compact and thus, in particular, quasi- 
compact. Since K is a cross section for the mapping g: N=H, it follows that g 
is quasi-compact. Since g is also monotone it follows by (1) that g is compact. 
Accordingly the set g-!(H) = N=f—!(H) is compact. 


(3’) CoroLuaryY. If X ts locally compact and f: X= Y 1s a monotone mapping, 
for any compact set K in X, f-'¥(K) ts compact. If K ts a continuum so also 1s 


fF (A). 


(3) COROLLARY. A monotone mapping on a locally compact space 1s compact 
af and only tf each compact set in VY has a compact trace. 


It may be noted that the monotoneity condition in this theorem is quite es- 
sential, even though point inverses are compact. For if X is the part of the graph 
of the parabola y?=x for which —1<y3S1, the vertical projection of X onto the 
interval I= [0, 1] meets all conditions except monotoneity and yet J has a com- 
pact trace but a noncompact inverse. 


4, An application. Invariance of the plane. One of the most outstanding 
results in the field of topology asserts that if we decompose the Euclidean plane 
into an upper semi-continuous collection of disjoint continua, not separating the 
plane, the decomposition space is itself a topological plane and thus homeo- 
morphic with the original plane. This theorem (R. L. Moore [13]) still has no 
satisfactory simple extension to higher dimensional Euclidean spaces. We indi- 
cate next how a form of this fine theorem may be obtained from the sequence of 
results just discussed. 

For the purposes of this discussion we define an S? (2-sphere) as a nondegen- 
erate unicoherent Peano continuum containing a simple closed curve and sep- 
arated irreducibly by every such curve it contains. Similarly an &? is any locally 
compact Hausdorff space whose 1-point compactification is an 8? or, equiva- 
lently, which is homeomorphic with the complement of some point on some 8?. 
That any S$? or &? is homeomorphic with an ordinary 2-sphere or plane respec- 
tively will not be used or proved in this discussion, although this is the case as is 
well known. Indeed the condition of unicoherence is redundant and imposed 
only for convenience in this discussion. 

We first sketch briefly a proof for 


(4) THEOREM. The property of being an 8? 1s invariant under nonconstant, non- 
separating monotone mappings onto Hausdorff spaces. 


It is necessary first to establish the following sequence (i)—(v) of results in 
any §?. 
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(i) JORDAN CURVE THEOREM: Every simple closed curve J on S$? separates 
§? into exactly two components and is the boundary of each of these. The only 
thing left to prove here is that there are just two components of $2— J. This is 
accomplished by constructing an auxiliary simple closed curve J’ using two 
disjoint arcs of J and arcs in two of the components of 8?— J. Existence of a 
third component of §?— J would force all of J into the closure of a single com- 
ponent of 8*—J’ so that J’ could not separate 8?. 


(ii) 6-CURVE THEOREM: For any 6-curve @ on 8?, 8?—6 has exactly three com- 
ponents and each of these is bounded by a unique pair of the edges of 6. This is 
an easy consequence of the Jordan Curve Theorem. 


Gii) BOUNDARY CURVE THEOREM (Torhorst): The boundary B of any com- 
ponent R of the complement of a Peano continuum M on 8? is itself a Peano 
continuum. This is proven by showing that B is a continuum by unicoherence of 
§? and, if it is not locally connected, a 8-curve 6 can be constructed in WV so that 
each of its 3 open edges can be connected to B without meeting either of the 
other two edges—thus contradicting the fact that R lies in just one of the three 
complementary regions of 8 on 8?. 


(iv) SEPARATION THEOREM: If A and B are continua on 8? so that A-8B is 
totally disconnected and both 4d) =A—A-B and By) =B—A-B are connected, 
then 8? contains a simple closed curve J with J(A+B)=A-B which separates 
A, and Bo in 8?. Further, if B does not separate $?, then, for any o>0, J can be 
chosen so that the component of $?— J containing By lies in the o-neighborhood 
of B. 


This is proven by covering By with the union of a sequence of regions whose 
closures are Peano continua not meeting A and so that only finitely many are 
not in the e-neighborhood of A-B for each e>0. Then by unicoherence of 82, Q is 
a Peano continuum. It then follows that the boundary J of the component R of 
$2— Q containing A» meets all our requirements. (Note that in case B doesn’t 
separate 8’, for a given o we may suppose A, contains all points of $? not in the 
o-neighborhood of B.) 

To see this we first note that J must be either an arc or a simple closed curve. 
For if J contains a simple closed curve C, a simple #-curve construction along 
with (ii) shows that J must reduce to C. On the other hand if J were a dendrite 
it could have no branch point because again a 6-curve construction in J+R 
would likewise lead to a contradiction of (ii). 

The proof is then completed by applying the part of the conclusion already 
established to show that J cannot be a simple arc because its end points would 
be accessible from R so that J would lie on a simple closed curve in $8? and hence 
could not separate $2. That any point p of J would be accessible from R follows 
from the fact that R+ 4 is locally connected because, by the part already proven, 
p lies in an arbitrarily small region in §? bounded by an arc or a simple closed 
curve C which meets J in at most two points so that R-C would have at most 
three components. 
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(v) DyGon THEOREM. A dygon is a continuum D which is the union of two 
continua A and B meeting in exactly two points x and y (the vertices of D) and 
so that Ap=A—(x+y) and By = B—(x+y) (the edges of D) are connected sets. 

The Dygon Theorem asserts that: Every dygon D on 8S? separates 8?; and tf the 
boundary of any component of S*—D meets both edges of D it also contains both 
vertices of D. 


The proof is an easy application of (iv) and (ii). For (iv) yields a simple 
closed curve J in 8? separating the edges of D and meeting D in just its vertices 
x and y. The two open arcs of J from x to y are then separated in 8? by D, since 
otherwise one can easily construct a 9-curve containing J and meeting D only in 
x+y. This contradicts (ii). 

The Dygon Theorem yields the following key lemma: 


LEMMA. Let 6: S=2 be monotone, where S 1s an §*. Any simple closed curve J 
in & containing two distinct points x and y with unique tnverses separates 2; and 1f 
the boundary of any component of 2—J meets both open arcs xy of J, 1t must also 
contain both x and y. 


Proof of Lemma. Since @ is monotone, ¢~1(J/) is a dygon D with vertices 
x’=od—1(x) and y’=¢—1!(y). Hence D separates S so that J separates 2 by mono- 
toneity of ¢. Likewise, if Q is any component of 2—J whose boundary B meets 
both open arcs xy of J, the boundary of ¢~1(Q) meets both edges of D and thus 
contains both x’ and vy’, so that B necessarily contains both x and y. 

With this lemma we are now in position to prove (4), the Sphere Invariance 
Theorem: 


The property of being an 871s tnvartant under nonconstant monotone nonseparat- 
ing mappings onto a Hausdorff space. 


Proof. Let f: S—Z be such a mapping where Sis an $*. Then 2 isa locally con- 
nected unicoherent continuum since these properties are invariant under mono- 
tone mappings. Also 2 has no cut point since f is nonseparating, and thus 2 con- 
tains a simple closed curve. 

Thus if 2 is not an $2 it must contain a simple closed curve J such that 2 —J 
either (1) is connected or (2) contains a component whose boundary is an arc ab 
of J. Let x and y be distinct points of J, chosen so that they separate a and 0 on 
J in case (2) holds. 

Decompose S into the continua f~1(x), f-1(y) and individual points of 
S—f-'(x) —f-'(y). Let @: S=S’ be the natural mapping of this decomposition. 
Then ¢ is monotone, nonconstant and nonseparating, so that S’ isa unicoherent 
locally connected continuum containing a nondegenerate simple closed curve. 
Also, if J’ is any simple closed curve in S’, and a; and 0; are any two points of J’, 
there always exist points x, and yy; with unique ¢ inverses so that x, and 4; sep- 
arate a; and 0; on J’. Hence, by the lemma, J’ separates S’, and no simple arc ab 
of J’ can separate S’. Accordingly S’ is an $?. 

Now define g:.S’=2 by g(p) =f¢—1(p) for pES’. It is verified at once that g 
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is monotone and nonseparating. Thus since g~'(x) and g~1(y) are single points of 
S’, the lemma applied to g shows (a) that J separates 2 so that case (1) cannot 
arise and (b) that there can be no component of 2 — J whose boundary is an arc 
ab of J as required by case (2). Thus the supposition that 2 is not an §? leads toa 
contradiction. 

As an easy consequence we now get 


(5) THEOREM. The property of being an &2 +s invariant under quast-compact 
monotone nonseparating mappings onto Hausdorff spaces. 


For let 6: X= Y be such a mapping where X is an &?. Then @¢ is closed and Y 
is locally compact by (1). Thus by (2), ¢@ has a continuous extension @*: X *=Y* 
to the 1-point compactification X * of X onto that of Y. Then X* is an 8? and ¢* 
is nonconstant, nonseparating and monotone and Y* is a Hausdorff space. Thus 
by (4), Y* is an S?; and hence Y itself is an &?. 

REMARK. In view of the well-known fact, not proven here (see above), that 
all $?’s are homeomorphic with each other, as are also all &?’s, as a consequence of 
(4) and (5) we have the classical result of R. L. Moore which asserts that: 


The decomposition space for any nontrivial upper semicontinuous decomposition 
of the plane, or 2-sphere X tnto disjoint continua not separating X, ts homeo- 
morphic with X. 


5. The compact part of a mapping. For any mapping f: X= Y we define 
(i) Q as the union of the interiors of sets in Y having a compact inverse and 
P=f-1(Q), Gi) Q’ as the union of the interiors of the images of all compact sets 
in X. It results at once that P, Q and Q’ are open sets, and that the mapping 
f| P:P=(Q is compact. Also, using Theorem (3) above, we get at once 


THEOREM A. Let X be locally compact. If f is monotone then O=(’. If f ts (1-1), 
O ts exactly the set of points of continurty of f-}. 


THEOREM B. Let X be locally compact and have a countable base and suppose Y 
ts a complete metric space. Then Q’ 1s dense in Y. Further, uf f 1s monotone Q=(Q’; 
and tf f 1s 1-1, f maps P topologically onto Q. 


6. One-to-one Mappings. These are always of special interest. One is natu- 
rally concerned with conditions under which they are homeomorphisms and with 
the extent to which they preserve the topological structure of their domain 
spaces even when they are not homeomorphisms. 

We have already noted that w=e*, 0S@<27, maps the half open interval 
onto the circle |z| =1 in 1-1 fashion and also that the plane can be mapped 
similarly onto the pinched 2-sphere obtained by identifying its poles. In both of 
these cases the range space is compact and the domain locally compact. Of course 
if the domain space is compact, the mapping is a homeomorphism. Also any 
quasi-compact 1-1 mapping is a homeomorphism. 

In general the 1-1 continuous image of even a simple noncompact space 
may be quite different in structure from that of the original. For example such 


564 G. T. WHYBURN [June—July 


an image of the line may fail to be locally connected, as is well known. If we con- 
sider only 1-1 mappings onto locally connected spaces, significant results have 
been found by McAuley and Lelek [14] and Jones [15] in case the domain is a 
line or a plane. The only possible 1-1 locally connected continuous images of the 
line, for example, are the line, figure eight, noose, dumbbell or 6-curve. 

In this particular case it may be noted that if we impose the additional con- 
dition of unicoherence on the image space, then the only possible image of the 
line is the topological line itself. Further, in this case, the mapping itself must 
be topological. This and other related observations and results led to the con- 
jecture at one time that a 1-1 mapping from a locally connected locally compact 
connected separable metric space onto a unicoherent space of the same sort 
would necessarily be a homeomorphism. Indeed such a result was asserted [16 | 
and thought to be proven at one time. However, examples were given by 
Kenneth Whyburn [17] and L. C. Glaser [18] showing that this is not true and, 
indeed, not even when the image space is #” for n23. The case of 1-1 mappings 
of such spaces onto E? has been studied by E. Duda [19] who has shown that 
these mappings are homeomorphisms provided the domain spaces satisfy some 
simple additional restrictions. 

A fairly simple example in which the image is a unicoherent Peano continuum 
(but not a Euclidean space) may be constructed as follows. Let A be an equi- 
lateral triangle ABC including its interior and let O be the center of A. Then 
add to A three solid triangles A’OB, B’OC, and C’OA meeting A in just the 
segments OB, OC, OA respectively, and meeting each other by pairs only in O. 
Let X’ be the union of these 4 solid triangles, and let X’ be mapped onto a space 
Y by identifying A’ with A and mapping A’B linearly onto AB; similarly B’C is 
mapped onto BC and C’A onto CA. Otherwise the mapping is 1-1. Thus Y con- 
sists of A plus three open pockets created by identifying points on A’B with 
corresponding points on AB, and similarly for B’C and C’A. Finally, we delete 
from the domain X’ of the mapping all points on the periphery of the original 
triangle A =ABC. Let X be the resulting space X = X’— (periphery of A), and 
f:X=>Y the restriction of the mapping to X. Then f is 1-1 and continuous, X is 
locally compact, connected and locally connected, and Y is a unicoherent Peano 
continuum, but of course f is not a homeomorphism. (Indeed, Y is compact but 
X is not.) 

It is not enough, in general, to assume X and Y homeomorphic with each 
other in order to assure that every 1-1 mapping of X onto Y will be a homeo- 
morphism, even when these spaces are locally connected generalized continua 
(=locally compact, connected separable metric spaces). It is true, however, that 
in case both X and Y are Euclidean spaces, any 1-1 mapping of X onto Y must 
be a homeomorphism. This conclusion is obtainable as a consequence of the 
Brouwer Theorem on invariance of openness in Euclidean spaces. Indeed we can 
get a somewhat better result by formulating the Brouwer Property in a broader 
setting as follows. 

A space X is said to have the Brouwer Property [12], provided any subset of 
X which is homeomorphic with some open set in X is itself open in X. All Euclid- 
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ean spaces have this property by Brouwer’s Theorem, as do also all Euclidean 
manifolds. However, a large class of spaces which are not Euclidean also have 
this property. For example, the set obtained by taking the 1-point compactifica- 
tion of a surface of infinite genus. Also the generalized closed manifolds of 
Wilder have the Brouwer Property although they are not all Euclidean by any 
means. A straightforward argument suffices to prove 


THEOREM. Jf X and Y are homeomorphic locally compact spaces having the 
Brouwer Property, any 1-1 mapping of X onto Y 1s a homeomorphism. 


Now in case X and Y are Euclidean spaces one of which is mappable onto 
the other by a 1-1 mapping, it results from Theorem B in section 5 above that 
X and Y are of the same dimension and thus are homeomorphic with each other. 
Thus any 1-1 mapping of one Euclidean space onto another is a homeomorphism. 

For further examples and results in connection with the Brouwer Property 
the reader is referred to a study made by E. Duda [20]. 


7. The compactness problem. Results of this sort in the 1-1 case led the 
author many years ago to formulate and study the problem of determining con- 
ditions under which a mapping of a Euclidean space £” onto itself is necessarily 
compact. The cases =1, 2, were solved and appeared in a paper published in 
1959 [12]. For 2 =1 we need only assume compactness of point inverses and for 
n= 2 itis enough to assume them to be continua, i.e., assume the mapping mono- 
tone. Higher dimensional cases of the problem present considerably greater 
difficulty. We now consider these various cases in some detail. 


(i) n=1. If X and VY are lines (E'), a mapping f:X=Y ts compact tf (and 
only tf) tt has compact point inverses. 


Let f have compact point inverses and take any compact set K in Y. Let ab 
be an interval in X whose interior contains f-'(a+8), where a is an interval in 
Y whose interior contains K. Then neither — «a nor 0 can meet f—!(K). For if, 
say, — «a meets f-'(K), f(— a) lies wholly in a8 so that f(— »a+ab) is 
bounded. Thus f()) would meet both — ©q@ and 8 and thus would contain 
a and 6 which is impossible. 

In terms of real valued functions this result says that if fis a real function of 
a real variable continuous everywhere on F£! which takes each real value at 
least once, but only on a compact set, then 

lim f(z) = + «, lim f(%) = Fo. 


I— 00 I — © 


(ii) ~=2. In the case of a mapping of a plane onto a plane it is not enough to 
have compact point inverses in order to make the mapping compact. For the 
function 
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i.e., w=3”, where 


ny 
f= —— + iy, 

x+1 
maps the complex plane onto itself and yet f takes on each value either one or 
two times. However, as indicated earlier, it is sufficient (though not necessary, 
of course) that point inverses be continua. 


THEOREM. If X and Y are planes, any monotone mapping f: X= VY is compact 
(and nonseparating). 


Proof. The map f generates a decomposition of X into continua. Let Y’ de- 
note the decomposition space and ¢:X=Y’ the decomposition map. There is 
the one-to-one map h: Y’=>Y with hé=f. Now ¢ is quasi-compact, hence com- 
pact by (1) of section 3. We therefore have a unique monotone map @*: X *=Y’* 
as in (2) of section 3. By the general form of Moore’s Theorem, Y’* is a cactoid. 

Case 1: Y’* is a sphere. In this case Y’ is a plane. Hence, the one-to-one 
map h: Y’=Y must be open and a homeomorphism by Brouwer’s Theorem. 
Hence f is compact in this case. 

Case 2: Y’* contains a sphere as a proper subset. No sphere can map one-to- 
one into the plane, hence Y’ must contain a plane E which has limit points of 
Y’—F. Then h maps E homeomorphically onto an open subset of Y. Since E con- 
tains limit points of Y—E, it is impossible to extend h: E->Y to a continuous 
one-to-one map of Y’ into Y. 

Case 3. Y’* is a dendrite. By Theorem B, #4 must be a homeomorphism of 
some nonempty open subset onto an open subset of Y. But every open subset of 
a dendrite has cut points, whereas no open subset of the plane has cut points. 

Hence f is compact. The mapping /*: X *= Y* is clearly nonseparating, hence 
soalsoisf: X= Y. 

Since the mapping f in this theorem turns out to be automatically nonsep- 
arating, in view of results on invariance of the plane discussed earlier we have the 


CoROLLARY. Let f: X= Y be a monotone mapping where X 1s a plane and Y 1s a 
Hausdorff space. Then Y 1s a topological plane tf and only tf f 1s compact and non- 
separating. 


(iii) 23. These cases were discussed by the author at the Georgia Con- 
ference on Topology of 3-manifolds in 1961 (see Proceedings of this Conference, 
by M. K. Fort, Prentice-Hall, 1961). It was conjectured that a 1-monotone map- 
ping of #® onto E? is always compact and, possibly, an (n—2)-monotone map- 
ping of E” onto E” is compact—a mapping being r-monoione provided its point 
inverses have trivial homology groups in dimension Sr. The conjecture in this 
form would reduce to exactly the above result in (ii) for 2=2 and to the con- 
jectured one for n=3. This problem and conjecture attracted considerable at- 
tention and effort on the part of those in attendance at the conference and others 
who learned of it later. It was shown by Connell at that time that acyclic maps 
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of E” onto £” are always compact, where “acyclic” means that homology groups 
of al] dimensions of point inverses are trivial. See the above mentioned Pro- 
ceedings. 

In 1965 a positive solution to the problem, verifying the conjecture for gen- 
eral m in the form given above, was published by Vaisala, along with a number 
of related results. Methods of proof required, however, are much more algebraic 
and less elementary than needed in the cases n=1, 2. An independent proof of 
this same general result: (1—2)-monotone mappings of E” onto E” are compact 
was found by Conner and Jones and distributed in manuscript form. Their 
methods differ from those of Vaiséla in that they depend more directly on 
homology theory and exact sequences rather than on the topological index. How- 
ever, both proofs are far from elementary; and a simple direct set-theoretic type 
of proof for this fine theorem for general x would be highly desirable and would 
represent a valuable contribution to mathematical knowledge. 

The question as to the existence of noncompact monotone (=0-monotone) 
mappings of E” onto E* for n23 has been studied in detail by Glaser [18]. He 
has shown that such maps do indeed exist for n24 and has further results on 
existence of monotone maps of /#* onto E", where k& and n are not necessarily 
equal. At the 1969 Georgia Conference, R. H. Bing gave an example of a mono- 
tone mapping of E? onto #? which is not compact. 

Also it would be of interest to determine just what properties of Euclidean 
spaces are essential in making these results on compactness valid. Can one 
formulate sufficient conditions on a space, perhaps in terms of its homology or 
homotopy groups or contractibility or retractability into subsets which will 
suffice to yield similar results on compactness to the ones discussed here for 
Euclidean spaces? Possibly the Brouwer Property could be useful in this con- 
nection. 


8. Compactification of mappings. The existence of various forms of com- 
pactifications of a topological space X (i.e., the topological imbedding of X ina 
larger compact space in which X is dense) such as the one-point compactifica- 
tion, the Stone-Cech compactification, etc., leads naturally to the question as 
to whether certain types of mappings can be compactified in some similar sense. 
This question arose naturally also in connection with a result of Vainstein [5 | to 
the effect that any closed mapping has a partial mapping (restriction) which is 
compact and 1n whitch the image space ts the same as the image of the original domain 
space. The usual dual relationship between open and closed (sets, mappings, etc.) 
led the author to anticipate that open mappings may be related insomeanalogous 
way to compact mappings. More particularly, it was anticipated that if the 
domain space were suitably augmented, the mapping could be extended so as to 
become compact, so that the given mapping would be exhibited as a partial 
mapping of a compact one. 

This was found to be fully correct. In a paper published in 1953 [21] it was 
shown that in a suitably constructed space unifying both the domain and range 
spaces, a compact mapping can be defined which is topologically equivalent to 
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the given one on the prototype of the original domain space. This turned out to 
be true for arbitrary mappings, not just for open ones. Thus it was shown that 
any mapping from one Hausdorff space to another is topologically equivalent to 
a partial mapping of a compact one (actually a retraction) in the unified space. 
The extended mapping is open when the original one was open; and the unified 
space is separable and metrizable when the given spaces are locally compact, 
separable and metric. More recently H. Bauer [22] had occasion to use such a 
result in his studies on measure preservation of mappings and, independently, 
he developed and applied almost exactly the same extension to show that any 
mapping between locally compact spaces is a partial mapping of a conservative 
one. 

Sketched briefly, we begin with an arbitrary mapping f:X— Y, where X and 
Y are disjoint Hausdorff spaces, and define the untfied space Z to consist of all 
points in the union of X and Y with a topology in which a set QCZ is open if 
and only if 


Gi) O-X and Q- VY are openin X and Y respectively, and 
(ii) for any compact set KCQ- Y, f-1(K) -(X —Q) ts compact. 


It turns out that the injection mappings of X and Y into Z are open and 
closed respectively and thus are homeomorphisms. Further, if we define r(x) 
=f(x)EZ forxCX and r(x) =xCZ for xC Y, then r is continuous and compact 
and hence is a compact retraction of Z onto Y. Since r| X is f followed by the in- 
jection of Y into Z it is therefore topologically equivalent to f. If f is an open 
mapping, so also is r. If X and Y are locally compact, Z is locally compact and 
Hausdorff; and, in this case, if X and Y are also separable and metric so also 
is Z. 

In general, the set X is not necessarily dense in the space Z, i.e., X ¥Z unless 
f is everywhere noncompact, where X denotes the closure of X in Z. However, X 
is dense in X of course and the partial mapping 


r| X:X—3Y 


is a compact extension of f to X. Note also that since r is a retraction and 
X—XCY,r|(X—X) is the identity mapping. The original mapping, considered 
as a mapping of X into its image, i.e., f:X—f(X), is a compact mapping if and 
only if both r and r—! reduce to the identity on the set X —X. The important 
special case of a 1-1 mapping f:X—>Y which is compact relative to f(X), i.e., in 
case f is a homeomorphism of X onto f(X), is of special interest here. For if fis a 
homeomorphism of X onto f(X), it turns out thatr extends f to a homeomorphsim 
of X onto f(X). 

The unified space for a mapping has been further studied by R. Dickman 
[23 ] who developed, in particular, conditions for unicoherence of this space and 
implications of its unicoherence so far as the mapping is concerned. 

The process of compactifying a mapping has been studied from a more gen- 
eral viewpoint by Dickman [24], Cain [25], and others, and quite interesting 
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results obtained. A compactification of a mapping f:X=Y is defined as a pair 
(X*, f*) where X* is a Hausdorff space containing X as a dense subspace and 
f*: X *= Y is a closed continuous extension of f from X* onto Y having compact 
point inverses. Thus f* is a compact mapping and f*| X =f. 

Dickman [26| has developed the concept of maximal and minimal compacti- 
fications of spaces and, correspondingly, of mappings. Cain [27] has developed 
two different procedures for characterizing all possible compactifications for a 
given mapping. In each case there is associated with each possible compactifica- 
tion X’ of the domain space X a compactification of the mapping f:X=Y ina 
unique way so that the mapping compactification is determined by the space 
compactification and so that every possible mapping compactification is obtain- 
able by the given process from a space compactification. 

The first of these procedures employs filter space compactifications of spaces 
as developed by Wagner. The second uses in a similar way the ring compactifi- 
cations of spaces provided by the structure space of the subring FR of the ring of 
all bounded continuous real valued functions on the domain space X. This struc- 
ture space has as points the maximal ideals in R and is a Hausdorff compactifica- 
tion of X. (See Wagner [28 |.) 
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HAMILTONIAN MECHANICS AND GEOMETRY 
SAUNDERS MAC LANE, University of Chicago 


1. Introduction. Let me begin by thanking the organizers of the Chauvenet 
Symposium for the opportunity to talk at the dedication of Chauvenet Hall at 
the U.S. Naval Academy. It is a pleasure to honor the long tradition of devotion 
to Mathematics at the Academy, and to recognize again the impetus which 
William Chauvenet gave, in his teaching and writing, to the importance of ef- 
fective exposition. Mathematical ideas do not live fully till they are presented 
clearly, and we never quite achieve that ultimate clarity. Just as each generation 
of historians must analyse the past again, so in the exact sciences we must in 
each period take up the renewed struggle to present as clearly as we can the 
underlying ideas of mathematics. 

The thesis of this article is that classical mechanics and recent conceptual 
methods in abstract mathematics have a lot to do with each other. Many ab- 
stract ideas of pure geometry originally arose in the study of mechanics; for ex- 
ample, the cotangent bundle of a differentiable manifold appeared first as the 
phase space of Hamiltonian mechanics. Many cumbersome developments in the 
standard treatments of mechanics can be simplified and better understood when 
formulated with modern conceptual tools, as in the well-known case of the use of 
the “universal” definition of tensor products of vector spaces to simplify some of 
the notational excesses of tensor analysis as traditionally used in relativity 
theory. This article will develop a few more such cases. It has been stimulated 
by the pioneering work of Mackey [12], Smale [13], Abraham [1], Sternberg 
[14], and many others. 


2. Quantities. Newton’s Laws give differential equations for the trajectory 
of a mass point moving in Euclidean 3-space or along some surface or curve in 
that space. When there are several such points (say & of them) each moving in 
3-space, they can be regarded as a single point in 3k-space. Hence we consider 
Newton’s Laws for a point moving in any “configuration space” M. On M we 
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are to deal with various measurable physical “quantities” such as the distance of 
a point from the origin, or the potential energy of a point of given mass, or the 
x and y coordinates of a point in the plane, or the latitude and longitude of a 
point on an equatorial belt U of a sphere S?. Each such quantity is a continuous 
real-valued function g defined on all the points of MV or of some “piece” U of M. 
These quantities, taken together, have a number of formal properties: To define 
a quantity g on the union U’UU” of two pieces, it suffices to define quantities 
g’on U’ and g” on U”’, so that they agree on the intersection U’(\U"; if g and h 
are quantities defined on U, so is any smooth function f(g, hk) of g and h. These 
properties (the list is not complete) which describe quantities on configuration 
space have a direct mathematical interpretation. Take configuration space to be 
a differentiable manifold (of class C*~), the pieces U of the space M to be open 
sets, and the quantities defined on U to be those continuous functions g: U-R 
to the real numbers R which are smooth functions (of class C”). We recall that 
such a manifold can be defined as a topological space together with a suitable 
function S which assigns to each open set U of the space the set S(U) of those 
continuous functions g: U->R which are the smooth functions on U. The axioms 
on S include those suggested above; for example, the description of a quantity 
gon U’UU"” in terms of its restrictions g’ and g’’ to U’ and U” is the essential 
element in the assertion that S is a “sheaf”. Using S, one also describes the 
smooth maps of M to another manifold N: A function f: M—N is continuous if 
the inverse image f~!V of any open set V of N is open in M; it is smooth if for 
each smooth h: V-R the composite hf is smooth (i.e., AfESG-'V)). 

The axioms for a manifold MV also describe the dimension n of M in terms of 
the existence of coordinates. Early in mechanics it was noted that problems are 
often best treated not with the original (rectangular) coordinates x1, ---, x”, 
but in terms of other “curvilinear” coordinates, say g!, - - - , g™. These appear in 
the axiom on M which requires for each point a€ M a neighborhood U (a “coor- 
dinate neighborhood”) and ” smooth functions, g!, - + - , g?: U->R such that the 
correspondence prog'(p), -- +, g*(p) is a diffeomorphism (one-one, onto, and 
smooth in both directions) of U to an open set in R®. Here the smooth functions 
f on R®” are to be exactly the functions f(x!, -- +, x”) of class C” (continuous 
derivatives of all orders). Hence the axiom states that the smooth functions g 
on a coordinate neighborhood U are precisely the C* functions g=f(g', - +: , g”) 
of the coordinates. Many basic properties of manifolds and mechanical systems 
can be understood and expressed more clearly when they are formulated in a 
fashion independent of any particular choice of coordinates. Thus at the very 
beginning we speak of a quantity g or of a map g: U->R, not of a smooth func- 
tion g(q!, - + - , g*) of some coordinates. 

This review points up the fact that the physicists’ use of “quantities” antic- 
ipates the mathematicians’ use of smooth function, is a starting point for the 
currently active subject of sheaf theory [15], and matches the modern emphasis 
on coordinate-free presentation of geometry. Often a book on physics will con- 
trast a mathematical presentation of an idea (using coordinates) with a physical 
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presentation (not using coordinates); in such cases the contrast is really between 
two mathematical presentations, one with coordinates and one coordinate-free. 


3. Velocity and tangent vectors. With Newton’s Laws we may write down 
the usual equations of motion for our point of mass m moving on the configura- 
tion space M. These equations involve both velocity and acceleration, and so are 
second order differential equations; initial conditions will be given by specifying 
the initial position (by coordinates g!, - + - , g”) and initial velocity (by the cor- 
responding coordinates v!, - - - , v*) (the coordinate v* is that often written as 
g', where the dot suggests “time derivative”). Taken all together, these 2n 
coordinates are not coordinates on M, but coordinates on another smooth mani- 
fold of twice the dimension, the tangent bundle T,M of M. This bundle may be 
described in physical terms as the manifold of all positions-and-velocities on M, 
or in pictorial terms as the manifold of all tangent arrows attached to points a of 
M, or more formally as the manifold 7, whose points are all ordered pairs (a, u) 
where a is a point of M and uw a tangent vector at that point. To complete its 
description as a manifold, one then specifies that the smooth functions are 
(locally) the smooth functions of the 2” coordinates q!,---, g”, vt} +++, v, 
taken in a coordinate neighborhood of 7, which is composed of a coordinate 
neighborhood U on M together with al] tangent vectors at all of its points. With 
this description of the tangent bundle 7, we note that the function indicated 
by (a, u)a is a smooth map 7,:7,.M—WM, called the projection of the tangent 
bundle. Under this projection the inverse image of each point a€ M is the tan- 
gent space 7M, which consists of all vectors tangent to M at a. This is not only 
a submanifold, but a linear vector space, and the full linear group (of nonsingular 
n Xn matrices) acts on this space. These data, present and suitably interrelated 
in the tangent bundle, are essential elements in the recent treatment of many 
geometric ideas by vector bundles, K-theory, fiber bundles, and fiber spaces 
(see Husemoller [7]): They were really discovered and used a long time ago in 
mechanics. 


4, Tangent and cotangent bundles. But we are ahead of our story; to com- 
plete the description of the tangent bundle we need an invariant description of 
the tangent spaces 7. Sometimes tangent vectors (or vector fields) are de- 
scribed as operators on smooth functions; we want a more symmetric treatment. 
Recall [9] that every finite dimensional vector space W determines in concep- 
tual fashion another vector space of the same dimension, the dual space W* con- 
sisting of all linear functions W—-R. If W=T,M is a tangent space, this dual is 
called the cotangent space T*M to the manifold M at a. We describe them to- 
gether: Every path determines a tangent vector, every quantity a cotangent 
vector. In detail, let g: M@—R be a quantity defined on M (or at least in some 
neighborhood of the chosen point a€ M) and let 4 be a smooth path on M pass- 
ing through the point a. If we use the coordinate ¢ (for “time”) on the real num- 
bers (a manifold) R, this path can be represented asa smooth map h: RM send- 


1970] HAMILTONIAN MECHANICS AND GEOMETRY 573 


ing the origin t=0 to the chosen point a€ M. Thus we have two smooth maps 
R—->M > R; 


their composite, differentiated at t=0, yields a real number 


d 
(in| (wor) | 
If h’ is a second path through a we say that h is tangent to h’ at a (in symbols, 
h=,h’) if and only if (g,h). = (g, h’). for all quantities g. The tangent vector rah to h 
at a is then defined to be just the equivalence class consisting of all hk’ with 
h=,h’. Dually, two quantities g and g’ are cotangent at a (in symbols g=, 9’) if 
and only if (g, kh). = (g’, h)a for all smooth paths # through a; then the differential 
(cotangent vector) dag to g at a is just the equivalence class of all these g’. Let 
T*M be the set of all cotangent vectors at a. Now all quantities f, g, - - - defined 
near a form a real vector space under the operations of addition and multiplica- 
tion by real scalars: These operations, transferred to equivalence classes daf, dag, 
make 7“ a real vector space. Moreover, the formula above for (g, /). defines a 
pairing (or function of two variables) which assigns to a cotangent vector dag 
and a tangent vector 7,/ a real number, 


(dag, Tat) +> (dag, Tat) = (g, h)a. 


Addition of tangent vectors is now defined so as to make this function bilinear. 
This makes the set 7, of all tangent vectors to M at a into a real vector space, 
and the pairing makes 7“ its dual space; indeed each dag is the linear function 
(dag, —): Tp, M-R. 

We emphasize that this conceptual description of tangent and cotangent 
vectors does match the intuitive picture of these vectors: A tangent vector to a 
path at a is an arrow measuring the velocity along the path; a cotangent vector 
is a set of level lines (contour lines straightened up) given by a quantity ¢ defined 
near a, and the pairing of the cotangent vector with the tangent vector gives the 
rate of change of the level-lined quantity along the path. 

This invariant description also agrees with the description of tangent vectors 


in terms of local coordinates g!, - - - , g?on M: The pairing is given by the famil- 
iar formula 
n / Og =) d 
dag, Tah) = — , = on R, 
(deg, tal) > (5) ( dt /y dt 


which gives the derivative of a composite function. In writing this formula we 
are dealing with m quantities g*: 4—R, the coordinates on M, and a fixed map 
h:R—M. By this map / the quantities g‘ become quantities g'h: R-R on the 
1-dimensional manifold R, and (dq*/dt)» in the formula refers to the derivatives 
of those latter composite quantities with respect to the coordinate ton R. Thus 
the tangent vector 7,4 has the x coordinates 
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yi -(=) ee yn -(=") 
dt Jo dt Jo 


the tangent vector space 7, is n-dimensional, and the whole tangent bundle 
T.M can be made a differentiable manifold in exactly one way so that the 2n 
local coordinates are g!, ---,q",v!, ---,v". At the same time in the cotangent 
vector space 7° each vector d,f has the 2 coordinates 


_ ) _ -(=) 
p1 (= . » Pn agn . 


and 7*M is n-dimensional. The cotangent bundle 7° MV has points all pairs 
(a, daf), and can be made a differentiable manifold in exactly one way so that the 
2n quantities g!, ---,q", pi, °° , Px become local coordinates. 

There is an evident function which sends every point (a, d,f) on the cotan- 
gent bundle to the original point a on configuration space. This is a smooth map 
aw , called the projection of the cotangent bundle, displayed as 


nr iT’M—-M by (a,df) a. 


In coordinates, the point (g!, - > -, 9g", f1, °* *, Pn) projects to (g!, ---, qg”). 


5. Differentials and differential forms. If g is a quantity defined on an open 
set U of M, it determines at each point a of U a cotangent vector dag. This de- 
termination is really a function from U to the cotangent bundle. We call this 
function the differential of g: It is a smooth map 


dg:U-T°M by at>(a, dag). 


This map is a cross section of the cotangent bundle in the sense that the com- 
posite 7’ o dg: U--U is the identity. This cross section dg is just the usual “dif- 
ferential” of the function g. 

In general any cross section 6 of the cotangent bundle 7° M over U is called 
a 1-formon U. Incase U is a coordinate neighborhood with coordinates q!, -- - , 
q”, these 1-forms 6 are precisely the things which can be written as expressions 


6=fidg'+:-:-: + frdgq” 


in terms of the 1-forms dq‘ (the differentials of the coordinates) and arbitrary 
smooth functions f;on U. In this expression the formal operations of addition (of 
two 1-forms) and multiplication (of a 1-form by a quantity) have the natural in- 
variant description: Since cotangent vectors at a can be added, we can add two 
1-forms 6, Y by adding their values 0a +wWa in each T*M; since cotangent vectors 
can be multiplied by scalars, we can multiply 6 by a quantity f by the rule 
(f 0)a = (fa) (8a). Under these operations the 1-forms on U constitute a module 
over the ring S(U) of quantities on U. Observe by the way that this (classical? 
modern?) concept of module dominates many recent developments in algebra [9 | 
and homological algebra [10]. 
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The systematic use of cotangent bundles, 1-forms, and the like is a major tool 
in geometry (see, for examples, Flanders [4]). In a moment we shall see that the 
cotangent bundle is simply the physicists’ phase space. First we pause to con- 
sider the dual concepts. A cross section X of the tangent bundle T,M over U is 
called a vector field on U because it assigns, in a smooth way, a tangent vector 
X, at ato each a€ U. Moreover, a vector field X operates on quantities, assign- 
ing to each quantity g on U the quantity X¢g defined at each point a by (X¢g)(a@) 
= (dag, Xq); this is called the Lie derivative of g along the field X. For example, in 
a coordinate neighborhood we can take at each point a the path /,, “along the 
gi axis”; this is the path specified in coordinates as hy,.(t)=(qla+t, q’a,---°, 
ga). Then a t>7,hyq is a vector field, the field of “unit” tangent vectors along the 
g! axis. This vector field is usually written as X;=0/0q!, because the coordinate 
formula above for the basic pairing shows that its action ona function g is given 
by X1g=0g/dq'. In the coordinate neighborhood every vector field X can be 
written uniquely in the form 


where the f; are quantities in U. In this formula, the addition and scalar multi- 
plication by quantities f are defined for vector fields just as for forms: Under 
these operations the vector fields on U form a module over the ring S(U) of 
quantities on U. 


6. Lagrange’s equations. We return to mechanics, where Newton’s Law 
gives the equations of motion for a point in configuration space in the familiar 
form 


m,(d2x'*/di?) = F;, t=1,--°-,n, 


where the x!, - - -,x"are rectangular coordinates and F; is the zth component of 
the force. These equations may be rewritten in more invariant form using the 
quantities which represent potential and kinetic energy. The kinetic energy T, in 
the familiar rectangular coordinates, has an expression such as 


T= (1/2) my(v?)?t- + + + (1/2) ttn (uv)? = (1/2) am(G?)?-+ + + + (1/2) ming”)? 


In other coordinates there will be different formulas, but any such T depends on 
the coordinates in the tangent bundle, so J is a quantity 7:7,.M—R on that 
bundle. (The letter T is overused here, thanks to separate traditions, in which 
plain T is a quantity (kinetic energy) and dotted 7 a functor, the tangent bun- 
dle functor.) In a conservative system, the zth component F; of the force on the 
particle can be expressed as the quantity 

OV 


F;=-—) 
ox* 


where V is the potential energy. This V, originally a quantity V: M—R on the 
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configuration space M, is also a quantity on the tangent bundle 7-M, namely 
the composite function 7, M—M->R. Therefore the difference 
L=T—V:T.M—-R 


is also a quantity on 7.M. It is called the Lagrangian. Now in the rectangular 
coordinates above, 


d*xt d (<=) d (<=) OV OL 
mM; = i - |? Bh; = —-— = -) 
dat? dt \Ox* dt \Ax" Ox" Ox 


so the familiar equations of motion can be written as 


d { oL OL 
—( )- = 0, t= 1,-++,n. 
dt 


Ox* Oxt 


Now transform to any other local coordinates g!, ---, g"; a calculation shows 
that these equations again take the form 


d / aL aL 
—( =) - — = Q, t=1,--+°+,n. 
dit \ dq? 0g’ 


The Lagrange equations have the same form in any local coordinates; we omit the 
classical explanation: These are the equations for the extremal curves (trajec- 
tories) which minimize a suitable integral. 


7. The Legendre transformation. Now we shift from the tangent bundle to 
the cotangent bundle. This shift to “phase space” is accomplished by a “Legen- 
dre transformation” which uses the quantity LZ to carry tangent spaces to co- 
tangent spaces. This transformation, which sometimes appears adventitious 
when presented via coordinates, can be described invariantly, following an 
observant suggestion of Sternberg [14]; we start with the case of one tangent 
space (at one point a€ M) and get the following situation: 


THEOREM. If W is a real vector space, regarded as a differentiable mantfold, then 
at each point uC W the tangent and cotangent spaces have canonical tsomorphisms 
TuW=W, T*W=W*. Any quantity L on W determines by ut-d,L a smooth 
mapping &1:W—-W%, called the Legendre transformation for L. In particular, uf L 
ts a positive definite quadratic form on W, making W an inner product space, then 
£1 ts the standard natural tsomorphism which identifies the nner product space W 
with its dual. If &1 ts invertible, then the inverse (&1)~1 1s also a Legendre transfor- 
mation for the quantity H on W* which is defined in terms of (linear) coordinates 
vip. +--+, on W and the dual coordinates pi, +++, bn on W* as 


(1) H =p, +--+ + op, — L. 


Proof. Each vector w€ W determines a path hy: R->W atuGW by t-u-+iw; 
this is just the “radial” path from the point u along the vector w. The correspon- 
dence wrr,h, gives the desired isomorphism W2T,,W; its dual is T™W2=W*. 
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The mapping £,: W-»>W* comes from the very definition of the differential; if 
we describe differentials by coordinates as above, the zth coordinate of £ zu, in 
symbols p;(£ 1), is just the value (0L/0 V*), of the zth partial derivative of LZ at 
this point w. Thus we can write (with v‘=q*) 


(2) Piokpy = dL /dv" = 0L/0g°, = 1,°°-,n. 


These equalities (between quantities W—R) describe the map £,: W—W* com- 
pletely in terms of its composites with coordinates on W*. These equations are 
often written just as p; =0L/0qg*. This coordinate description of £ is to be under- 
stood as an equation between quantities on W; we emphasize that it is just a 
coordinate version of the simple conceptual assignment u'-d,LC W*; in words, 
send each point of u to the value at u of the differential dL, where the space T*W 
of differentials is identified with W* by the canonical isomorphism we have de- 
scribed. 

Next suppose that LZ is a quadratic form. This is the important case for our 
purposes, in which W is the tangent space 7, to a configuration space M at 
some point a, while L=T— V is the classical Lagrangian. Restricted to one 
tangent space 7, this L is (up to a constant) just the familiar positive definite 
form 41> my? for kinetic energy. This also provides an interpretation for the 
coordinates p;, which we introduced just as the coordinates on W* dual to the 
given v*; with this L, p;=0L/dv' is just p;=m,v', the ith-component of the 
momentum. For this reason the fi, - - -, Pn are called the momentum coordinates 
in W*, 

Now the statement that the function L is a “quadratic form” is usually ex- 
plained by saying that Z has a quadratic expression in some coordinate system, 
but it can be also explained in invariant fashion: Z is a function L:W—-R with 
L(bu) =L(u) for any scalar b and such that the expression 


(ut, 2") = (1/2) [L(w + wu’) — L(w) — L(w’)] 


is a bilinear function—that is, a function linear in each of the vectors uw and 
u’CW. (In other words, a function Z is quadratic when its deviation L(u-+u’) 
—L(u) —L(u’) from additivity is bilinear.) Now the formula above gives a real 
inner product (uw, u’) for any two vectors of W; this inner product is symmetric 
in “wand uw’ by definition, and is positive definite by the assumption on L. Hence 
W with L is an inner product space (a Euclidean vector space =a finite dimen- 
sional Hilbert space) and Z is given in terms of the inner product as L(u) = (u, u). 

We return to the proof of the theorem. Along the standard path h,, which 
we introduced above we have 


L(u + tw) = (u + tw, u + tw) = (u, u) + 2t(u, w) + (wv, w), 
since the inner product is bilinear. Differentiating at ¢=0 gives 
(dyL, w) = 2(u, w). 
In other words (up to the factor 2), d,Z is that linear form on W given by the 
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function (4, —):W-—-W. This correspondence ut-d,L is thus precisely the 
familiar map ut>(u, —) sending W to W%*, for W an inner product space. 

The map £1: W—-W* will be invertible (locally) provided the familiar Jaco- 
bian matrix ||0°Z/dv‘dv‘|| is nonzero at every point of W. When it is invertible, 
the equation 

—1 


n . & 
H = >) vi; —L on W along W<——_ W 
= 1 


= 


% 


does define a (smooth) quantity H on W*. Specifically, in this equation the 
coordinates p; on W* are given as quantities on W*, while the coordinates v‘ and 
L, originally given as quantities on W, become quantities on W* by composition 
with £7 °. The equation is thus short for the more explicit equation 


. al —1 
H = » (viLr )p: — L&z 


(and we emphasize this point because it explains a general method of reading an 
equation in quantities defined on different spaces in terms of quantities on one 
space along a given diagram W<—W* of arrows between those spaces). From this 
formula we may now calculate the Legendre transformation £2: W*-W. The 
coordinate formula for £ (above) is v’&y=0H/dp;, with partial derivative cal- 
culated on W%*. By the usual rules for the derivative of a product 


OH Ovt OL dvi 
et Dy Dy, 
OD; i OD; i Ov* Of; 


the two sums cancel because p;0 £1 is 0L/dv‘, by the definition of £1. This 
formula proves that v/LxyL1,=v), so LeL_=1 and Ly is indeed (locally) the in- 
verse of £2. This completes the proof. 

The formula for H may be stated in invariant terms, for any yCW’%, as 


—1 —1 
Hy = (y, £1 y) — L&x ¥y. 


Here £; yGW and (,) denotes the pairing W*X W—R which makes W* the 
dual of W. With coordinates p;, - + +, p, for y and dual coordinates v!, - - - , v” 
for &p'y, the value of this pairing is exactly the sum ) p,v‘ of the previous 
formula. The above calculation that &y is £;' may also be put in invariant form, 
but the translation to this form appears to be cumbersome. 

The notation for this theorem has been chosen so as to fit the natural ex- 
tension from one tangent space to the whole tangent bundle of any manifold MV. 
Let L bea quantity on T,M, and identify each T,(7,M) with T,M and T“(T,M) 
with (1, M)*=T*M, as above. A point of T,M is then a pair (a, u) for uC TM, 
and the quantity Z determines, by restriction to each tangent space, a quantity 
L| T.M on that space. Then we define 


£r:T.M—-T'M, (a, u) (a, 4,(L| T.M)), 
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a smooth mapping of the tangent bundle 7,M to the cotangent bundle T°’ M. 
This mapping is sometimes called the fiber derivative of L (see Abraham [1]) be- 
cause it arises by taking the differential of L restricted to each fiber. If we put in 
the projections of these two bundles, the diagram 


Lr 
T.M ———> T° M 


Te 1° 
M = M 


commutes; one says that the Legendre transformation carries fibers (tangent 
spaces) to fibers (cotangent spaces). In coordinates, the formulas for £1 are just 
those already given in (2), plus the equations g‘£,=g* on the position coordi- 
nates. The case where JL restricted to each fiber is a positive definite 
quadratic form is still applicable as above. Indeed, a manifold M with a smooth 
function assigning to each tangent space a quadratic form there is just a mani- 
fold with a Riemann metric. In other words, the kinetic energy in a classical 
mechanical system may be regarded as a Riemann metric on the configuration 
space. 


8. Hamilton’s Equations. This Legendre transformation £ is now applied to 
the trajectories of the given mechanical system. These trajectories are originally 
the paths h: RM on configuration space which satisfy the Lagrange equations. 
Now at each point h(t) of Mon such a path h# the path itself determines a tangent 
vector Tayyh to M, and the map f(h(t), Tayyph)CT.M “lifts” the path h toa 
path h: R—>T.M on the tangent bundle; this is just the path giving position and 
velocity at any time +CR. The image under & of these lifted trajectories will be 
the trajectories in J*° M of the mechanical system. A straightforward calculation 
from the Lagrange equations shows that these trajectories h on T° M are the 
solutions of the systems of equations 

dq' 0H dp; oH 


— 
ry 


at OD; dt dq? 


on R, evaluated along 4: R->T° M. Here “evaluated along i” means (as in a 
previous case) that each of the g‘, the p;, and the partial derivatives of H, all 
originally quantities on T° M, are to be regarded as quantities on R (that is, as 
functions of ¢) by composition with h. 

These 2” equations are Hamilion’s equations. They describe the trajectories 
on phase space T° M by a system of first order differential equations. These 
equations depend on the quantity H on 7° M; this quantity is the Hamiltonian 
defined above in terms of the Lagrangian. The formulation of these equations 
also seems to depend on the fact that phase space is the manifold of “positions 
and momenta” (mathematically, that phase space is a cotangent bundle). This 
means that we can get these equations in this form for any system of 2z local 
coordinates on T° M, provided these coordinates are split into two sets: position 
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coordinates g}, - , g” and the corresponding momentum coordinates fi, -° : 
by. Any position coordinates will do—provided that they are matched with the 
corresponding momentum coordinates. There is another mystery beyond the use 
of such pairs of coordinates. A system of first order differential equations on a 
manifold 7° M is usually given by a vector field X on 7° M; the solutions of the 
equations may then be described as those paths on the manifold which thread 
through the vector field, with tangent vectors at each point 0} exactly that spec- 
ified as Xb by the vector field at that point. But this system of equations is 
determined by the quantity H, and what a quantity H gives naturally is a 
covector field dH, not a vector field. 

So we reconsider the properties of phase space. The formula H= > gip;— 
used to define H suggests the expression 


6= D1 pid. 

Now #; and g‘ are quantities on 7“ M, so each dq’ is a 1-form on T° M, and 
so is each scalar multiple »;dg‘ and hence @ itself. Now if we make any change in 
the local coordinates g* and the corresponding change in the momenta );, a 
calculation shows that this 1-form @ is unchanged. Indeed, this 1-form can be 
described in invariant fashion, using only the fact that T° M is a cotangent 
bundle. As a 1-form on T° M, 6 must be a cross section of the cotangent bundle 
of T° M; that is, a smooth map 7° M—T° (T° M) which is a right inverse of the 
projection m°:7°(T’ M)-T° M. So take any point D=(a, dag) on T° M and 
define 6 as the map 


b = (a, dag) > ((a, dag), do(gr*)). 


Here a’ :T° M—M is the projection of the given tangent bundle, so gz’ is a 
quantity on 7“ M, and the expression on the right in this formula is a point of 
T’ (T° M) with first component (a, dag) the given point b. Thus this formula does 
define a 1-form 6’. Moreover, 6’=6: For 0= > p.dqi:T’ MT" (T° M) is a linear 
combination of the 1-forms dg‘, so is given by the formula (definition of the 
linear operations ya) 


6b = (b, D, pi(d) dog’). 


Now we can write b=(a, dag) = (a, SR: dag‘) with m scalars k;; these scalars 
being exactly the p; coordinates k; =; 6. Therefore 


6(b) = (a, dag) = (b, Dy ki deg’) 
= (b, Dik: do(q*r*)) 
= 6’ (a, >, hy dag’) = 0'(a, dag), 
so 0=6’, as claimed. On every cotangent bundle there is an invariant 1-form @. 


9. Forms and symplectic manifolds. On any manifold S we now consider 
1-forms 9 and 2-forms w. In terms of local coordinates x!, x”, - + - these are ex- 
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pressions (in quantities f;, g;) like 
0 = fidx! + fedx? +--+, w= giodu! A dx? + gigdu! A dxi+---; 


and any 1-form 6 determines a 2-form d@ by formal application of the differen- 
tial d as 


0 0 
do = (2 8) at A dat + wee, 


ox} Ox? 


The conceptual presentation uses the exterior algebra of the cotangent spaces 
W*=T*S of S (see, for example [9]). First, the exterior square W*/\ W* is the 
vector space generated by all formal products e/\e’ of the vectors e, e’CW%, 
where the exterior product /\ is required to be bilinear and alternating 
(e/\e’ = —e’ Ae, hence e/\e=0). If W has a basis of ” vectors e, then W*/A\ W* 
has a basis of n(m-—1)/2 vectors e*/\e? for 1<j7. Over the manifold S we can con- 
struct the exterior bundle 7° S/\T° S whose fiber over each point 0 is the vector 
space 7*S/\T*®S. A 2-form won S is then defined to be a smooth cross section 
of this bundle 7° SA T° S—S. Since the dx? for local coordinates x‘ form a module 
basis of the 1-forms, the exterior algebra above shows that the dx‘/dxé for 
4<j form a module basis for the 2-forms. This means exactly that any 2-form 
can be written uniquely in the form }\g;;dx*/dx/ displayed above. 

In particular, on any cotangent bundle S=7Z°M, our invariant 1-form 
6= >; dq‘ yields by exterior differentiation a 2-form w=d6 with 


w = Dd; A dq’. 
t=] 


It is a closed 2-form (dw =0 because dw =dd@) and is of maximal rank. The latter 
means that its exterior power w*=w/\ +--+ /\w to u factors is nonzero; indeed 
w= +kdpi/\ +--+ A\dba/\dq'/ +--+ /Adq” is (up to a factor &) just the familiar 
“element of 22-dimensional volume” on 7° M. 

Now an even dimensional manifold S equipped with a specified 2-form a, 
with dw=0, of maximal rank is called a symplectic manifold. We have just seen 
that every cotangent bundle (every phase space 7° M) is a symplectic manifold. 
One reason for naming such manifolds is precisely that the Hamilton equation 
above can be written down directly for any quantity H on any symplectic 
manifold. This can be seen by choices of coordinates: Given any 2-form w, with 
dw =0 and of maximum rank, a theorem due to Darboux states that in some 
neighborhood of each point one can successively choose coordinates g!, -- +, g” 
and pi, -- +, P, so that w takes on the standard form w= > dp,dq*. With these 
coordinates we can write down Hamilton’s equations. Calculation then shows 
that a different choice of coordinates with the same properties yields the same 
trajectories for Hamilton’s equation. 

There is a better and more conceptual proof which uses the 2-form w directly. 
If W* is the dual of the vector space W, then each eC W* is a linear function 
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W—R. Because of this, we shall show that each element :C W*/A W* yields a 
linear function 


b:W—— W*. 


Since any # is a linear combination of vectors e/\e’ for e and e’/CW%, it will 
suffice to take t=e/\e’ (and use the corresponding sums of functions ?’). Thus 
we define (e/\e’)’ on each wE W to be an element of W*:; that is, to be the linear 
function of w’ CG W* given by the formula 


[(e A e)>w](w’) = e(w)e’(w’), 
where e(w) and e’(w’) are real numbers, since e, e’: W—R. Now consider the 


2-form won S; at each point DES, wETS/AT*S gives a map T,S—-T*S of 
tangent space to cotangent space. Put together at all points, this is a smooth map 


«yb 
T.S T°S 
[lh 
S = S 


of the tangent bundle to the cotangent bundle. When w is of maximal rank on 
S, this map is one-one and onto, so its inverse is a smooth map of the cotangent 
bundle back to the tangent bundle. In particular, given any 1-form on S, such 
as the differential dH of the Hamiltonian, this map gives a vector field 
(w’)-! o dH on S. This is the vector field which is used in Hamilton’s equations; 
therefore these equations arise directly from the quantity H and the basic 
2-form w on the symplectic manifold S. 


10. Canonical transformations and generating functions. Specific problems 
in mechanics are often treated by picking coordinates convenient to the prob- 
lem in hand; in the symplectic formulation of mechanics, this means picking 
on S new local coordinates, conventionally written Q!,-+--,Q", Pi, +--+, Pn, 
in such a way that the 2-form w is still expressed asw= > dP;/\dQ'. Classically, 
this is called a canonical transformation from the canonical coordinates p;, g* to 
the new canonical coordinates P;, Q*. These new coordinates need no longer 
have a direct physical interpretation by momentum and position. 

Certain “generating functions” Fare classically used to give such canonical 
transformations. The traditional presentation ([6], [16]) goes about as follows. 
Take a smooth “generating function” of 2” coordinates g!, -- -,q", Pi, -+-+,Pn, 
and assume the determinant 


everywhere. Now the definitions 


g = 9g, pi = OF /dq', t=1,---,n, 
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give 2m functions which we regard as coordinates, while the parallel definitions 
Ot = OF /0P;, P,= Pi, t= ,+++,n 


again give 2n coordinates. The determinant condition implies that we can solve 
for the Q!,---, QO", Pi, ---, Pa as functions of the g!,---+, 9", pi, --°-, Da 
or conversely. A calculation (to be done below) shows that this transformation 
carries the form > dP;/AdQi to the form > dp;/dqi. Thus F has “generated” 
a transformation from one set of canonical coordinates to another. 

This situation could also be described as follows. Let S be a manifold of 
even dimension 2” which is given as a product S=.S;XS_ of two n-dimensional 
manifolds S; and S2, and let F be a quantity on S. If@isany form on S (a 2-form, 
a 1-form, or a 0-form; that is, a quantity such as F), then the exterior derivative 
da can be written in a natural way as a sum da=d,a+d.a, where d; is the differ- 
ential along S,; and d, that along S». In particular, d;F is a 1-form on S, and 
d2(d,F) =d,(d2F) is a 2-form on S. Our basic assumption now reads: F is a quan- 
tity on a manifold S=S, XS: such that the 2-form w=d,d,F has maximal rank. 
This assumption means exactly that S with the 2-form w is a symplectic mani- 
fold. Indeed dw =0 becomes 


des = d(dydy F) = (dy + do) (dydoF) = didsF + d,d,F = 0. 


Choose any local coordinates g!, ---, gq” on S; and Pi,---, Pr on Se. Then 
gi,---+,q", Pi, +++, Pa are 2n local coordinates on S=S;,X Se, and 


I dq! q dq” ? 
hence, applying d> in the evident way, 


w= adF => 


t=1 j=1 dg‘ OP; 


dgt \ dP}. 


This is the formula by which the quantity F defines a 2-form. Calculating the 
exterior product w/\ --- Aw, to m factors, the condition that it is nonzero 
becomes exactly the condition that the determinant above is nonzero at all 
points of S. 

From the data S=S,XS; and F we have thus defined a symplectic manifold 
GS, w=d,d2F). In this manifold (and in these coordinates) the basic 2-form does 
not have the usual canonical form. But there are other local coordinates on S; 
for example, the determinant condition above implies that the quantities 
gi,--:,q", Pi, +++, pba with p;=0F/dg‘* are also local coordinates. In these 
coordinates we may calculate the 2-form wo= >,dp;/\dgq‘ by the usual formula 


= Didp: A dg = rere 4p; NN dgi + QOD 
t 30 


t j 0 gi dg* 


e2 


dq’ /\ dq’. 


The first double summation is w=d,d2.F, while the second contains terms with 
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dq‘ /\dq‘=0, and terms with dq//Adq'= —dgq‘/\dqi and 1+j, which cancel in 
pairs; therefore wo =w. In the coordinates g‘, p;, w becomes canonical. An exactly 
analogous calculation shows that w is canonical in the coordinates P;, Q' 
=0F/0P;. These two calculations together give the calculations previously 
suggested which show that the transformation from the coordinates gq‘, p; to 
QO’, P; takes the canonical 2-form to itself. But the calculation is best understood 
by going through the form w expressed in terms of the intermediate (or original) 
coordinates g!,---,q", Pi, +--+, Pn. 

The same intermediate step can be exhibited in other classical cases (see 
[6], [16]) of generating functions for canonical transformations. The presenta- 
tion of these ideas in classical books is sometimes impoverished because the 
notion of a differential form (other than a 1-form) is wholly avoided. The gen- 
eral methods of E. Cartan for invariant integrals use the calculus of differential 
forms extensively and go far beyond what we have presented here. 

We have considered only changes of coordinates. Dually, there are symplec- 
tic transformations between two symplectic manifolds (S, w) and (S’, w’) of the 
same dimension. These are the smooth maps f:S—S’ which carry the form 
w’ on S’ back to the form w on S (in the natural sense in which forms go back- 
wards). 


11. Hamilton-Jacobi Equations. Many problems of classical mechanics can 
be solved by first obtaining solutions of a certain first order partial differential 
equation, the Hamilton-Jacobi equation. We sketch the situation very briefly. 
Given the Hamiltonian H:7° M—-R, each quantity S on M determines a 
1-form dS: M—T° M which is a cross section of the cotangent bundle. The com- 
posite of these two functions is a quantity on M; setting it equal to 0 gives 
an equation 


HodS =0 on M 


called the Hamilton-Jacobi partial differential equation. If we write the Hamil- 


tonian H as a function H(qg!,---, q", f1,°°+°, pn) of local coordinates and 
the differential dS: M—>T° M as the map a'?(a, d.S), where a has the coordi- 
nates g!, ---, q”, this equation appears as 
a ( ; CR) ~~) 0 
qd ? ? qg ? dq! ? ? dg” ? 


it is a partial differential equation of the first order in S. 

On the other hand, for any path h: RM, giving the quantity Son M allows 
us to lift this path to h=dS oh: R-T° M, a path on the cotangent bundle. The 
role of the Hamilton-Jacobi partial differential equations may then be formu- 
lated in the following theorem (which I take from some lecture notes of George 
Mackey, who treated the more general case of a “time dependent” Hamil- 
tonian): 
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THEOREM. Let S be a solution of the Hamulton-Jacobt partial dtfferential 
equations for a Hamiltonian quantity H on a cotangent bundle T° M. Then, if a 
path h on M satisfies 


dg? oH 
dt 7 Op; 


i=1,-+-+,n on Ralonghand h, 


(the first n Hamilton equations) tt also satisfies the second n equations 


dp; 0H . . 
— — -) 4=1,---,non Ralong hand h. 
dt 0g’ 


Conversely, if S1s a quantity on M which has this property for all h, then S satisfies 
the Hamilton-Jacobi partial differential equation. 


There are many further connections with the theory of characteristics of 
first order partial differential equations. This beautiful theory—well presented, 
say, in [3] and [5], is normally treated analytically, with heavy use of coor- 
dinates. It should be possible and useful to have a coordinate-free and more 
geometrical presentation for this and similar “analytic” theories. 


12. Summary. With these lines we have just begun the exploration of the 
relation between ideas arising in classical mechanics and conceptual methods of 
modern differential geometry. Much more could be said in differential geometry 
(see [2], [14]) or in the translation of the classical presentation of mechanics 
({6], [16], [18], etc.) into geometrical language. Some portions of this task are 
carried out at an elementary level in [8] and [11], and at a more advanced level 
in [1], [12], [17]. We have not touched on the active research in the related 
topics of structural stability and qualitative dynamics. We have emphasized the 
idea that quantities are functions and that equations hold on a manifold along a 
diagram of functions. More use of categorical ideas may be indicated. There has 
been a suggestion (F. W. Lawvere) that “categorical dynamics” will replace the 
category of differentiable manifolds by other more flexible categories, axiomati- 
cally characterized so as to bring out the tangent bundle and infinitesimal 
constructions. Here, as throughout mathematics, conceptual methods should 
penetrate deeper to give clearer understanding. 
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ON SOME PROBABILISTIC ASPECTS OF CLASSICAL ANALYSIS 
MARK KAC, Rockefeller University 


1. When one looks upon the numerous connections between probability 
theory and analysis one notes that the interactions between the two fall into 
two categories. 

On the one hand we have problems whose very formulation depends on 
concepts and terminology of probability theory, e.g., what is the probability 
that a power series 


(1.1) >) + One” 
0 


with random signs has its circle of convergence as a natural boundary? 

On the other hand we have a more subtle and less direct intervention of 
probability theory in the situation in which neither the formulation nor the 
solution of the problem has anything to do with probability. 

Today I should like to illustrate this second category of interactions by dis- 
cussing a number of points in the classical theory of the logarithmic potential. 


2. I begin with a cursory review of the Wiener measure in the space of plane 
curves, or equivalently, of the theory of two-dimensional Brownian motion. 

Let O bean origin chosen in the plane and denote by 7(é), 0Si<~, continu- 
ous plane curves originating at O. 
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There is a completely additive (Lebesgue) measure P in the space of these 
curves which has the following fundamental property. 
Let OStji<ta< +--+ <#, and let Q, Q2, - - - be plane Borel sets, then 


(2.1) P{7(t1) EM, 7(t2) EM, + +, 1(tn) EQn} 


-{ ve i) P(O| 713) o(r1| 725 2 — ta) ++ + pCPa-1| Tate — tnadri ++» Ota, 
Q Qn 


where the integration with respect to 7, is over Q; and 


exp[—||r — pl|?/2¢] 


(2.2) p(p| 732) = _ 


(||a|| denotes the Euclidean length of &). The integral with respect to the 
(Wiener) measure P will be denoted by £ to conform to standard probabilistic 
terminology. 

It should perhaps be noted that (2.1) (with p given by (2.2)) is taken from 
the physical theory of plane Brownian motion. Wiener’s great achievement was 
to show that assigning the measure (2.1) to sets 


(2.3) {r(tr) EM, +++, (tn) E Qn} 


of continuous curves can be extended to a vastly richer class of sets. There is, in 
fact, a complete analogy between what Wiener did to the space of all continuous 
curves and what Lebesgue did to the real line: when starting by assigning a mea- 
sure to intervals (sets (2.3) are the analogues of intervals and were called 
“quasi-intervals” by Wiener) he extended it to a large class of sets of real num- 
bers. 


3. Let 2 be a bounded, closed plane region and let A be an open set in the 
plane. Denoting by Vg the indicator function of 2 (and similarly by Va the 
indicator function of A) we consider the expression 


oe) t 
(3.1) i) etE {exp |—» f Valy + Far | Vay + ZO); dit, u>O0,s> 0. 
0 0 
The motivation for considering (3.1) comes from the fact that 
t 
f Voly + r(r)) dr 
0 


is the time, up to ¢, spent by the Brownian particle (which starts from y) in the 
set Q and the joint probability 


(3.2) Py f mG + r(r))dr < a, Vay + r(d)) = it 
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that this occupation time is less than a and that the path ends up at ¢ in A, is 
clearly of interest. 

Expression (3.1) is a double Laplace transform of (3.2) (with respect to a 
and #), and for technical reasons is easier to study than (3.2). 

It is now quite easy to show that 


00 t 
(3.3) | eth jexp | —u f VaG+F)r | v.G+ho)t di= | F(r; u, s)dr, 
0 0 A 
where F(7; u, s) satisfies the integral equation 
+ 1 —)}> 3 u js + + 
(3.4) F(r; u, s) = Ko(v/2s|| y ~~ r|l) ~~ “| Ko(v/2s||p — ||) F (p; u, s)dp, 
T TJ o 


where Ky is the familiar Bessel function which makes its appearance through 
the formula 


°° xp[—llr — yl|2/2 1 es 
(3.5) | e~st a di = ~ K (vV 2s||r — y\|). 


Let me derive (3.4) formally so that you will see a little bit the kind of 
manipulations one has to perform. 
We have, expanding the exponential, 


Bfexp | —m f Vo + rode |VaG + 70) 
(3.6) 
Ss Ge 


kao «RS 


E (f v6 + re))ar) Valty + rw 


and note that 
E \(f Valy + i(r))ar) Valy + rok 


= h! fof Ply+7(h) CQ, +++, ytr() CQ,y+7 € Afdt,- ++ dt 


Ostystos:++-sitzst 


(3.7) 


— f! fof fo Sf eG liameGl Fite a) 


Ostystos++-stpst 
b(te1| Te te — tea b(re| 734 — te) dri + + + dry dr dtydly + + + dhe, 
where the basic formula (2.1) has been used. 


Performing the integrations on ?’s first, and using the familiar convolution 
theorem for Laplace transforms we get 
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> 1 ———| | <> > 
F(r; u, 8) = — Ko(W/2s||7 — 9||) 
vin 


— 2S ae fos f Kavi = 5 «+ 


W kel 


(3.8) 


Ko(vV/2s||7 — tell) drs > - > are 


and (3.4) follows almost at once. 

There are, of course, questions of convergence. For example (3.8) holds only 
for sufficiently small u. But a simple argument based on analytic continuation 
(“permanence of formulas”) establishes (3.4) for all positive u. 


4, Using the familiar asymptotic formula 
2 
(4.1)  Ko(x) = log — — y+ o(1), x — 0 (y—the Euler’s constant) 
x 
we have 


as _— 1 
(4.2) Ko(vV/2s|lr — pll) = log/2/s + log ~— — y + o(1), 


7 — 3 


where o(1) is uniform in 7 and p, provided both are restricted to a compact set. 
We can rewrite now (3.4) in the form 


F(r) = 1/4{ Ko(v2s||y — 7]|) — logv/2/s} 


(4.3) —* f tava — ab — oev Tl eG 


+loev 7s} — 4 f FG)azh 
Q 
I shall now ask you to believe me that one can prove that as s | 0, F(r) 


= F(?; u, s) remains bounded and since as s | 0,F(r; u, s) is monotonically in- 
creasing ,the existence of the limit 


(4.4) im F(r;u,s) = f(r; x) 
follows. From (4.3) and (4.2) we conclude that 
(4.5) lim log/2/s {1 —4 { I (5 w, svat = gay; u) 
exists. 
5. Let 


(5.1) G(y; 4,5) = sfc jew | -« [v6 + H(e))ar | at 
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and note first that 
(5.2) Giy;4,s)=1— wf F(p; u, s)dp. 
Q 


A formal derivation (which can, however, be rigorized) of (5.2) is as follows. 
By (3.3) we have 


J PG; u, s)dp = fice jexp [-« f vaG + *(e))ar| Valy + rw) dt 


1 co ad t —> —> 
— — —{ est —__ E jexp [—u f Voly + i(r))ar | dt, 
Uu 0 dt 0 


and (5.2) follows by integration by parts if only 


(5.3) lim E fexp| —a J “Va + i(r))ar|t =0, u«>0. 


t— 0 


The truth of (5.3) follows at once from the well-known and relatively ele- 
mentary fact that in the plane 


(5.4) lim Valy + r(1))dr = @. 
0 


to 


It is also easy to show (the formal steps are essentially the same as in Section 
3) that G satisfies the integral equation 


> U ——|{—> > > > 
(5.5) GGsu,s)=1-~f KoVvIllp — DGG: w, sap 
2 
which we rewrite in the form 


> Uu —_—— > > 
GO; 4,5) =1—-— f (log 3/8); u, s)dp 
(5.6) mee 


Isll5 ny) aw > > 
— =f Kove — ¥]|) — log/2/s) G(p; u, s) dp. 


Multiplying both sides of (5.6) by log(./2/s) and recalling that by (4.5) the 


limit 

(5.7) lim logy/2/s G(y; 4, s) = lim logs/2/s (1 — uf F(r; u, sar) = go(y; 2) 
s {10 sJ0 2 

exists, we obtain 


> _ 1 > > 
(5.8) galy;u) = Po(u) — ~ J (toe pal — 7) ga(p; u)dp, 
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where 
(5.9) Po(u) = iim log /2/s (1 — =f (log -V/2/s)G(p; 4, sd) 

8: T Q 


the existence of the limit in (5.9) being part of the assertion. 
It follows, in particular, that 


Uu > > 
(5.10) =f goles modo = 1, 

wJo 
and 
(5.11) gal; u) = Po(u) — S28] log aay aus wd 
where 
(5.12) Po(u) = Po(u) ++ Y- 

6. It is a trivial observation that if 
YC 

then 


ga(y; #) S ga(y; 4). 


It is somewhat less trivial but still only an exercise in classical analysis to cal- 
culate gc(y; u) where C is a circle of radius a and center at the origin. 
The answer is, for y outside the circle, 


| 1+ 2u {Clog 1/r)Io(s/2ur)rdr 
gol; u) = $$ 


6.1) 2u J tevmrvrar 


1 ——) > > 


where Jo is again a familiar Bessel function (of first kind and imaginary argument, 
thus the companion of Ko). 

We can use gc(¥; u) as an upper estimate for ga(y; u) if CCQ. Note now that 
by comparing (6.1) with (5.11) we have for yEC 


o —_ Lo(V2ul|5l|) 


gc(y) 
(6.2) au f Io(/2ur)rdr 
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Letting uT © we find that 


+ 1 1 > 
(6.3a) lim gc(y;u) =log——log7az> yvEC 
ute a [35 
and 
(6.3b) lim gc(y;u) =0, yEC, 
ufo 


and therefore, for the circle at least, 


(6.4) Uc(y) = lim go(y; u) 


is the logarithmic potential of C. 

Without going into too much detail, let me merely state that for quite a gen- 
eral class of regions the limit of go(y; u) as u7 © is the logarithmic potential of 
the region. 

If we go back to formula (5.11) we see that it can be rewritten in the form 


> 1 ~> 
(6.5) go(3; 0) = Pa(u) — f lox ——— nalda), 
a (lp — 3 
where the measure p, is defined by the formula 
(6.6) wu(B) = (ul) {gales adda, 
BNne 


and in view of (5.10) we have 
(6.7) Wy (Q) = 1. 

Letting ut © along an appropriate subsequence we have 
(6.8) Hu(B) — u(B) 


in the usual sense of convergence of measures and finally 
-> . > 1 —> 
(6.9 UVaG) = Tim go 0) = RO) ~ f log =a ale) 


where R(Q) is clearly the Robin constant of the region Q. 
We also have 


(6,10) R(Q) = kim Po(u) 


although so far the limit is taken along an appropriate subsequence. 


7. It is at this point that my story really begins. 
Since one knows from classical theory that the equilibrium distribution p is 
unique it should be possible to dispense with the proviso that the limit u T © be 
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taken along an appropriate subsequence. 
It should, in particular, be possible to prove (6.10) as it stands. 


To do this note first that Po(u) is bounded for u’s bounded away from 0. 
To see this integrate (5.11) over 2 obtaining (with the use of (5.10)) 


(7.1) [| Pau) =~ + = = f 0G: 9) f toe saa oe 


(|| as usual denotes the Lebesgue measure of @). Since clearly there is an M 
such that 


(7.2) 


1 » 
freee a5] < a 
a lle — 3 
for all pEQ, we have (again using (5.10)) that 
(7.3) | @| Pola) < (r/u) + M. 


Next let Ay, Ae, As, - - « be the eigenvalues and dg, de, 3, > - * normalized 
eigenfunctions of the kernel 


(7.4) (1/r) log 1/llp — 9], a, VE. 
Equations (5.11) and (5.10) yield easily that 


a8 - ( f 6,66)48) 


(Po(u)) = - Li (i/u) Fd; 


and it follows almost at once that 


(7.6) 1 tg Woe Cf. bile ot) 


RQ) T jal 


When I discovered this formula more than fifteen years ago I had already 
proved a similar formula for the capacity of a three-dimensional region, namely 


(7.7) is (J.004) 


C(Q —_—— OO) 
(Q) = 7 2 
where the \’s and ¢’s are the eigenvalues and normalized eigenfunctions of the 
kernel 


1 1 — 


7.8 — —___., ye. 
v8) xiao” 
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The kernel (7.8) is well known to be positive definite but this could clearly 
not always be true of the kernel (7.4) because the Robin constant can be nega- 
tive (e.g., for a circle of radius a>1). Thus, for some regions at least, not all the 
d’s could be positive and the question of negative eigenvalues had to be looked 
into. : 

Now, one look at (7.5) will show that if \;<0 then either 


(a) Po(-1/r) =0 or (b) f o@)d) = 0. 


Now, (b) can be excluded by the following argument: The kernel 


(1/n)Ko(V2sl|o —y/|), 9, 9 EO 
is positive definite for all s>0 and therefore we have for every 6€L.(Q) 


tn ff o@)xuv 255 — JoGrdodj z 0. 
If however 
(7.9) J oa =0, 
then 


(1/n) J J (0) Ko(VBsl|9 — 3l|) 6G) doa 


= (1/n) J J 6G) {Kov5ll6 — 5) — log VIE + 1} oGhdid & 0 


and letting s | 0 we obtain 


1 » 1 + 
(7.10) — | 6G) log ——z 6G)dsdj = 0. 
w J [2 — 3 
In other words, the kernel (7.4) is positive definite on the subspace of L.(Q) 
defined by (7.9). 
If an eigenfunction ¢; belonging to a negative eigenvalue \; did satisfy (7.9) 
we would have a contradiction since (7.10) would give \;20. 
Having thus excluded (b) there remains only (a). In other words if \;<0 
then Po(—1/A;) =). 
I shall now prove that Po(u) =0 cannot have two distinct roots. For if u; and 
U2 were such roots (#1 u2) then by (5.11) go(y; 1) and ga(y; u2) would be eigen- 
functions of (7.4) belonging to distinct eigenvalues and hence orthogonal to each 
other. However, both go(y; v1) and go(y, uz) are nonnegative and hence the only 
way they could be orthogonal would be if they were to vanish on complemen- 
tary subsets of Q. But, if, e.g., w2> 11, go(¥; U2) Sea(¥; us) and go(¥; us) would 
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have to vanish wherever g(y; #3) did, which is clearly impossible. 

Thus Poe(u).=0 cannot have two distinct roots. It cannot have a multiple root 
either because by a slight perturbation of the region the degeneracy would be 
resolved and we would again have a contradiction. 

Thus the kernel 

1 1 +o 
— log yp&Q 


lla  opit 2? 
~> > 
r |x — all 
can have at most one negative ergenvalue. 


8. It remains to find a convenient criterion for deciding whether there is or 
there is not a negative eigenvalue. 

If R<O there clearly must be a negative eigenvalue as seen by looking at 
(7.6). 

Assume now that R20 and that e.g., \y<0. 

It is clear that the graph of (P(u))—1 versus u must look like 


(P() )> 


and that therefore (P(u))—! must vanish for some u> —1/\, which contradicts 
the boundedness of P(u). 

You could inquire whether it is possible to have an eigenvalue which is equal 
to 0. The answer is no and the proof is easy. 

If 0 were an eigenvalue there would exist a nontrivial function ¢6CL.(Q) 
such that 


> 


1 ~ 1 » 

(8.1) —{ o(y) log =—+=, dy =0, pea. 

mw Jo [| — 3 

By taking the Laplacian of both sides of (8.1) we obtain ¢ =0 almost every- 
where contrary to the assumption. 

Summarizing we can say that a necessary and sufficient condition for the 

kernel (7.4) to have a negative eigenvalue is that the Robin constant be negative. 
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Since zero is never an eigenvalue we are faced with an amusing situation 
when we vary the size of the region continuously (keeping, e.g., its shape in- 
variant) so that the Robin constant goes from being negative through zero to 
being positive. The sole negative eigenvalue does pass through zero (or to be 
more precise it approaches zero as R-—>0) but since zero is not an eigenvalue 
something strange must happen to the corresponding eigenfunction. In fact it 
“vanishes” in the sense of mystery stories. However, a careful analysis of Dr. Z. 
Ciesielski shows that in a certain sense this eigenfunction approaches as R-0 
the equilibrium mass distribution of the region with R=0. So it doesn’t really 
“vanish” but is transfigured. 


9. It would be nice if I could tell you that the striking properties of the eigen- 
values of the logarithmic kernel (7.4) could be neither found nor proved without 
the intervention of probability theory. 

That indeed they can has been demonstrated quite recently by Dr. J. L. 
Troutman ((1], [2]) who gave purely classical proofs of the facts concerning 
the negative eigenvalues of the kernel (7.4). Since I never published my proofs 
(hoping to include them in a joint book with Dr. Ciesielski but which, for a 
variety of reasons, will probably never be completed) though I discussed them 
in a number of lectures, Dr. Troutman’s discovery is entirely independent of the 
whole probabilistic development. 

There is, however, a slight “fallout” of the probabilistic approach which per- 
haps is worthy of mention. 

Hidden in the calculations is the heart of the probabilistic proof, namely, that 
the logarithmic kernel 


1 1 
— 108 (> 7 
w  ” |lF — Ol 
is a finite part of the Laplace transform 


fro exp [—llr ~ o?/24] 
0 


2rt 


of the basic transition probability density of the plane Brownian motion. 
This immediately suggests looking for analogies and the simplest one is the 
ordinary random walk on a line with equiprobable steps of +1 or —1. 
Here is a table indicating the analogies: 


Plane Brownian Motion Simple Random Walk 
\egion 2 Finite set m1, me, °° > , m, of integers 


= exp(— I — alh/20 p(n| m; t) = — feos £) exp[at(m — n) |dé 


+> 
e 


b(p| 73 2) 


= prob. that the displacement in # steps is m — n. 
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Laplace transform Generating function 
° » 1 —i\2 = = 1 6 cos(m —n 
fee |¥ nat = — Ko V75||7 — all) Sesto n| ms) = — f cost OE og 
0 T 0 2r4¥qo 1—scosé 

Finite part of the Laplace transform Finite part of the generating function 

1 1 1 £21 — cos(m — nyt 

tg eral Zr Jo 1—cosé é |m 
Logarithmic kernel over ©. Matrix — ((|m:— m;| ), AsStjsr. 


It is now obvious what the theorem is. 

The matrix — ((| mi—m;| )) has at most one negative eigenvalue and since it 
is finite and has trace zero it must have exactly one negative eigenvalue. 

Integers m; can be replaced by rational numbers and then, by continuity, by 
real numbers. 

We have thus finally the following theorem (see [3]): 

If ay, 2, - + + , @y are distinct real numbers then the matrix ((| a;—a,| )) has 
exactly one positive eigenvalue. 

This theorem can also be proved in a more direct way (as was shown to me 
some years ago by Professor Pélya) but you must agree with me that only 
through probability theory does one see that this theorem and the theorem that 
the logarithmic kernel has at most one negative eigenvalue are really one and 
the same. And this, I hope you will again agree with me, is quite remarkable. 
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GEOMETRIC EMBEDDING OF COMPLEXES 
R. A. DUKE, University of Washington 


Introduction. It is well known that each finite m-dimensional simplicial com- 
plex can be embedded in the (2n-++1)-dimensional Euclidean space E?**! and is, 
in fact, geometrically embeddable there. (A simplicial complex is geometrically 
embeddable in E* provided that there exists an embedding such that each of its 
simplices is a convex, rectilinear simplex in EZ*.) This result is the best possible 
in the sense that for each integer 721 there exists a finite n-dimensional simpli- 
cial complex which is not even topologically embeddable in E**, that is, not 
homeomorphic to any subset of #%, It is also known that for k<2n an n-dimen- 
sional simplicial complex which is topologically embeddable in E* is not neces: 
sarily geometrically embeddable there. 

Beyond this, however, little is known concerning either the topological or 
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geometric embeddability of specific finite-dimensional simplicial complexes in 
finite-dimensional Euclidean spaces. In Section 2 we will review some of the 
established results as well as state some conjectures which have been made con- 
cerning the embeddability of such complexes. 

Many questions remain unanswered even for the case of 2-dimensional com- 
plexes. One such deals with triangulations of 2-manifolds. By a triangulation of a 
manifold M we shall mean a finite simplicial complex homeomorphic to M. Each 
triangulation of an orientable 2-dimensional manifold is topologically embed- 
dable in E, and a theorem of Steinitz [18] concerning convex 3-polytopes has as 
a corollary that each triangulation of the 2-sphere can even be embedded geo- 
metrically in E*. Griinbaum [10, Chpt. 13] has asked whether geometric em- 
beddings in E#® exist for every triangulation of each orientable 2-manifold. In 
Section 3 we will discuss the results which have been obtained concerning this 
more restricted, yet apparently quite difficult problem, which is still open even 
for the case of the torus. 


2. General Results. Henceforth n-complex will mean a finite, 2-dimensional, 
simplicial complex. The geometric embeddability of an arbitrary n-complex C in 
E+! follows from the fact that the images of the vertices of C may be chosen to 
be a set of points which are in general position in #?"*!, that is, a set of points in 
F?"*1 such that each subset of at most 2%-+2 members is affinely independent. 
For in this case, the image of each simplex of C may be taken to be the convex 
hull of the images of those vertices which it contains. Since C is of dimension n, 
none of its simplices has more than n-++1 vertices, and for any pair of simplices of 
C there exists a larger simplex in E?"+! (not necessarily in the image of C) having 
the images of the given pair among its faces. Thus the images of any two sim- 
plices of C are either disjoint or intersect in a simplex which is a common face of 
both. (This result has been generalized by M. A. Perles to include certain com- 
plexes whose cells are not necessarily simplices [see 10, p. 204].) 

The first examples of m-complexes not embeddable in EE?" are due to van 
Kampen [13] and Flores [8, 9]. In order to describe these examples as well as 
generalizations due to Griinbaum [10, 11] and Wu [20], we will need the follow- 
ing additional terms. The complete n-complex with k-vertices, C(n, k), is the n-di- 
mensional simplicial complex having k vertices and with the property that each 
subset of at most m-+1 of its vertices determines a simplex in the complex. The 
join of two disjoint simplices T and S having vertices 7, +++ ,v, and wy,°°°-, 
Wm respectively is the simplex having {v1, - 8 Un, Wyo, Wm} as its set of 
vertices, and is written T\/S. For disjoint simplicial complexes K and C, the 
join K\/ C is the complex consisting of all simplices of the type 7S, where T isa 
simplex of K and S a simplex in C. Here the empty set @ is considered to be a 
(— 1-dimensional) simplex in each complex. If K is an n-complex and Can m-com- 
plex, then K\/C is an (x1-+-m-+1)-complex. The operation of forming the join of 
disjoint complexes is both associative and commutative in the sense that for any 
pairwise disjoint complexes Ki, Ke, and Ks, there exist isomorphisms between 
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Ki\V Ke and Ko\V Ky and between (KV Re) V Ks and KiV (K2\ Ks). Using this 
terminology we have the following result. 


THEOREM 1. Let ni, m:, and p be nonnegative integers such that n;+3 sm; 
S2n:+3, fortv=1,2,°°+,p. Then the (m+tmet+ +++ +n,+p—1)-complex 


C(n1, m1) V C(na, m2) V “es V C(np, Mp) 


as not topologically embeddable in Euclidean (m-+me+ +++ +m,—p—2)-space, 
but 1s even geometrically embeddable in Euclidean (m+ +++ +m,—p—1)-space. 


The proof of this theorem, patterned on the original treatment of Flores, is 
rather long and makes use of his notion of “self-entangled complexes” and the 
Borsuk-Ulam theorem on antipodes (see [11]). 

In particular, if m=m+ ---+ +n ,+p-—1 and m;=2n;+3 for 1=1, 2,---, 
pb, the n-complexes obtained are not embeddable in EH”. (For »=1, one obtains 
the two “Kuratowski skew curves” which can be used to characterize nonplanar 
graphs. For p=1 and m,=2m,+3, we have the result of Flores and van Kampen 
that the complex C(m, 2n+3), which may be interpreted as the n-skeleton of the 
(2n-+2)-dimensional simplex, is not topologically embeddable in E?".) 

Concerning the n-complexes with m;=2n;+3 forz=1,2,---+,, Griinbaum 
[11] has proved the following (see also [21]): 


THEOREM 2. Let n, p, m,°+*+, Ny be nonnegative integers such that n=m 
+ +-+-+-+n,+p—1. Then every proper subcomplex of the simplicial n-complex 


C(m, 221 + 3) V C(me, 2m2 + 3) V +--+ V C(mp, 2np + 3) 
ts geometrically embeddable in E". 


The “minimality” property of this theorem does not hold for all of the com- 
plexes of Theorem 1 for which m;<2n,;+3 for some 7. These results suggest the 
following, also due to Griinbaum. 


ConjJECTURE. Each finite n-dimensional simplicial complex which 1s topologi- 
cally embeddable in E* 1s also geometrically embeddable there. 


For k <2n, an n-complex which is topologically embeddable in E* need not be 
geometrically embeddable there. In [4], Cairns gave an example of a 3-complex 
homeomorphic to the 3-simplex but not geometrically embeddable in E’. This ex- 
ample was modified by van Kampen in [14]. In connection with the enumera- 
tion of a certain class of convex 4-polytopes, Griinbaum and Sreedharan [12] dis- 
covered a 3-complex M which is topologically equivalent to the 3-sphere and 
which has the property that its 2-skeleton C, that is, the complex C consisting 
of all 0, 1, and 2-dimensional simplices of M, is not geometrically embeddable in 
ES, Barnette [2] has found a second 3-sphere with similar properties. By taking 
joins of the form CV {u}V --- V {ux-2}, where C is the 2-skeleton of M and 
each v; is a single point, one can establish the following: 
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THEOREM 3. For each k= 2, there exists a k-complex which is topologically em- 
beddable in E**1, but which 1s not geometrically embeddable in E*+, 


That no such complexes exist for k =1 is a result of the well-known theorem 
on the straight line representation of planar graphs due to Wagner [19] and 
Fary [7]. 

In the most general form, the question raised by these results might be put 
as follows. For what integers k, n, and m does there exist a simplicial k-complex 
which is topologically embeddable in £” but which is not geometrically embed- 
dable in E”. Solutions for the cases k+2=n=m and m=n+1 would be of 
particular interest, even for k =2. 

By taking the “dual” complex (see [10]) of the 3-sphere M, one obtains a 
3-dimensional cell-complex M*, not a simplicial complex, which is also homeo- 
morphic to the 3-sphere and is such that neither M™* nor its 2-skeleton is geo- 
metrically embeddable in any finite-dimensional Euclidean space. (M™* was first 
described by Briickner [3] who listed it, erroneously, among the boundary com- 
plexes of certain convex 4-dimensional polytopes.) The existence of complexes 
such as M* makes the following conjecture even more interesting. 


CONJECTURE. For each integer n= 1, every finite, simplicial n-dimensional com- 
plex homeomorphic to the n-dimensional sphere 1s geometrically embeddable in E*+!, 


3. Triangulated 2-manifolds. As noted above, little is known concerning 
even the geometric embeddability of 2-complexes. Some information can be de- 
rived in this case, however, from a theorem of Steinitz [18] on convex 3-poly- 
topes. A subset P of a finite-dimensional Euclidean space is called a convex poly- 
tope, provided that P is the intersection of a finite number of closed halfspaces 
and is bounded. In this case P is the convex hull of a finite set of points, its 
vertices. The graph of P is the 1-dimensional complex consisting of the vertices 
and edges of P. The theorem of Steinitz, one of the most important results in the 
study of convex polytopes, implies that an abstract 1-complex G is the graph of 
some 3-dimensional convex polytope if and only if G is planar and 3-connected 
(that is, G can be embedded in E? and has at least 4 vertices, any two of which 
can be joined by at least 3 independent paths). It follows from this theorem that 
if C is any 2-dimensional cell-complex whose point set is homeomorphic to the 
2-sphere, then there exists a one-to-one incidence-preserving correspondence be- 
tween C and the boundary complex consisting of the proper faces of some 3-di- 
mensional convex polytope. Hence such a complex C is geometrically embedda- 
ble in £?. (Compare this result with the case of cell-complexes homeomorphic to 
the 3-sphere, which includes the complex M™* of Section 2.) 

The question then arises whether each 2-dimensional cell-complex homeo- 
morphic to some closed, connected, orientable 2-manifold is geometrically em- 
beddable in E%. In his book on convex polytopes |10, Exercise 13.2.3], Griin- 
baum has pointed out that the answer to this question is negative since a 2-di- 
mensional cell-complex homeomorphic to such a manifold M and such that each 
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vertex is the endpoint of exactly 3 edges is geometrically embeddable in E? if 
and only if Mf is the 2-sphere. Each simplicial 2-complex which is a triangulation 
of an orientable 2-manifold is topologically embeddable in E® (see [18]), and 
Griinbaum has asked whether geometric embeddings in E® exist for all such 
triangulations. Even when restricted to the case of the torus, the orientable 2- 
manifold of lowest positive genus, this question seems to be quite difficult. 

An easy argument involving the Euler characteristic shows that a triangula- 
tion of the torus must have at least 7 vertices and it can be checked that there 
exists only one such triangulation, 77, having exactly this many vertices. In Ty 
each two distinct vertices are joined by an edge. It has been known for some time 
that a geometric embedding of the 2-complex 77 in E° is possible. Mébius [16 | 
described this complex as an example of a “polyhedron of the second class” 
(that is, of genus 1) and stated without proof that it could be constructed by 
fitting together 7 tetrahedra. A complete description of a geometric embedding 
has been given by Cs4sz4r [6]. 

Another geometric embedding of 7; can be obtained by making use of the 
notion of a cyclic polytope. A cyclic d-dimensional polytope is a convex polytope 
with d+1 or more vertices each lying on the moment curve Ma in E*, where 
Ma=(\(t, #, +++, é):t a real number}. The cyclic 2d-polytopes provide the 
simplest examples of d-neighborly polytopes, where a polytope P is said to be 
n-neighborly provided that each subset of 2 vertices of P is the set of vertices of 
an (nz —1)-dimensional face of P (which must therefore be an (7 — 1)-dimensional 
simplex). (The existence of neighborly polytopes, although rediscovered only 
recently, appears to have been known to Carathéodory [5] in 1911 and, in E4, to 
Briickner [3] in 1909. For more on their interesting history see Klee |15].) The 
cyclic 4-polytope P; with 7 vertices is such that its 2-skeleton contains a sub- 
complex isomorphic to 77. Thus by selecting any 3-dimensional face F of a copy 
of P; in E* and projecting P, onto the affine hull of F from a point of £4 not in 
P, but sufficiently close to the relative interior of F, one obtains a 3-complex, a 
Schlegel diagram otf P, which is geometrically embedded in E* and combinatori- 
ally equivalent to the boundary complex of P; with F deleted. This Schlegel dia- 
gram contains a copy of 77 geometrically embedded in E°. 

In the same way, any triangulation of a 2-manifold which is contained in the 
2-skeleton of a 4-polytope in this sense could be embedded geometrically in E?. 
While for each nonnegative integer g there exists a convex 4-polytope which 
contains a triangulation of a 2-manifold of genus g, it can be shown [1] that the 
only triangulated manifolds contained in the 2-skeleton of a cyclic 4-polytope 
are the 2-sphere and the torus. Moreover, a study of the structure of the cyclic 
polytopes shows that a triangulation of the torus is contained in a cyclic 4-poly- 
tope if and only if each has the same number of vertices and there exists a simple 
closed path through the vertices of the triangulation such that every triangle of 
the triangulation has exactly one edge on this path. This criterion can be used 
to show that one of the 7 triangulations of the torus with 8 vertices can not be 
found in a cyclic 4-polytope. It is not known, however, whether each triangula- 
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tion of the torus or even each orientable 2-manifold can be found in 4-polytope, 
perhaps always in some neighborly 4-polytope. Altshuler [1] has used the simple 
closed path criterion to show that each triangulation of the torus in which each 
vertex is an endpoint of exactly 6 edges can be embedded geometrically E?, and 
has considered another special class of 4-polytopes, the “stacked polytopes.” 

Another approach, suggested by a proof of a special case of the theorem of 
Steinitz (see [10, Exercise 13.1.3]), consists of reducing the list of all triangula- 
tions of the torus to certain “minimal” triangulations by successively eliminating 
certain pairs of triangular 2-cells (and hence also some vertices and edges). By 
such a procedure one can obtain a finite list of minimal triangulations of the 
torus, each with 10 or fewer vertices. It can be shown, both directly and from the 
fact that each is contained in some convex 4-polytope (although not necessarily 
a cyclic one), that these minimal triangulations can be embedded geometrically 
in £?, Attempts to use these facts to obtain the result for all triangulations of the 
torus, however, have been unsuccessful. 

Barnette (unpublished) has shown that geometric embeddings exist for a 
large number of triangulations which are obtainable by subdividing certain non- 
simplicial 2-complexes homeomorphic to the torus. He has also established the 
existence of geometric embeddings in EF? for all triangulations of the “pinched 
torus,” that is, those 2-complexes resulting from the identification of two non- 
adjacent vertices of a triangulated 2-sphere. 

It appears that no counterexamples are known to the following conjecture 
which includes the last conjecture of Section 2 as well as the principal question 
of this section. 


CONJECTURE. For any positive integers k and n, each triangulated k-dimensional 
mantfold which ts topologically embeddable in E” 1s geometrically embeddable there. 
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AN ANALYTIC PROOF OF YOUNG’S INEQUALITY 


J. B. DIAZ, Rensselaer Polytechnic Institute AND 
F, T. METCALF, University of California, Riverside 


The main result of the paper is the analytic proof of Young’s inequality, 
given in section 2. While the proof in section 2 may be read entirely indepen- 
dently of the considerations of section 1, the discussion in section 1 is essential 
for a complete overall understanding of the subject, both historically and 
mathematically. 


1. Introduction (or, Geometry versus Analysis). 
1.4. The inequality referred to in the title of this paper appears, for the first 
time, on page 226 of W. H. Young [1], who expresses himself as follows: “If 


v= U(u) 


is a positive monotone increasing function of a positive real variable u, possess- 
ing a differential coefficient U’(u) which is positive (>0), and V(v) is the inverse 
function such that 


u = V(»), 
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then 
(1) uv — abs [ v@as + [Ves 
a b 


where 
b =5U(a). 


“In fact the function of u represented by uv—ab—_%U(z)dz has a negative 
second differential coefficient, while its first differential coefficient vanishes when 
v= U(u), that is when u= V/(v). It has accordingly a maximum whose value is 

V (v) V (v) 
vV(v) — ab — { U(z)dz, that is i) 2U’(z)dz; 
V(b) V(b) 
or, what is the same thing, /{} V(z)dz. Hence, the inequality in question follows.” 

Two comments, relative to this quotation of W. H. Young, are appropriate 
at this point: 

In the first place, it is clear that, in order for Young’s argument to hold, one 
must suppose something else about the derivative U’(z), besides merely that it 
exists and is positive. Because, if nothing further is assumed, then the definite 
integral {7@zU’(z)dz, which is used in Young’s proof, need not exist at all. 

In the second place, if the derivative U’(z) is assumed, further, to be contin- 
uous (just existence is not enough), then one can write, with integration in the 
Riemann-Stieltjes sense, that 


V(v) Vv) 
f 2U'(z)dz = f zdU. 
Vv) V(b) 


In view of this, it seems clear how later writers (see below) were led to consider 
Stieltjes integration in their proofs of Young’s inequality. 


1.B. The inequality of W. H. Young [1] is given in the book of Hardy, 
Littlewood, and Pélya [2, p. 111] as follows: 

“Suppose that #(0) =9, and that ¢(x) is continuous and strictly increasing for 
x 20; that y(x) is the inverse function, so that ¥(x) satisfies the same conditions ; 
and that a20, b20. Then 


ab s [omar + [ verar 


There is equality only if b=¢(a). 

“The theorem becomes intuitive if we draw the curve y=¢(x) or x=Y(y), 
and the lines x =0, x =a, y=0, y=), and consider the various areas bounded by 
them. A formal proof is included in that of the more general theorems which 
follow.” 

The formal proof alluded to, in this quotation from Hardy, Littlewood, and 
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Pélya [2], which is contained on pp. 112-113 of [2], involves several, at first 
glance, seemingly extraneous notions; for example, the notion of a function of 
bounded variation, and of Riemann-Stieltjes integration (in particular, the 
formula for integration by parts in Riemann-Stieltjes integrals). The same re- 
mark applies to the proof given in E. C. Francis and J. E. Littlewood [3, pp. 
47-48]. (Added October 31, 1969: The same remark also applies to the proof 
given on p. 70 of the recent book Lectures on Measure and Integration, by H. 
Widom, to which our attention was kindly drawn by the referee.) 


1.C. In his book on trigonometrical series, Zygmund [4, pp. 64-65] dis- 
misses any analytic attempt at proof, as follows: “Let d(u), u20, W(v), v290, be 
two functions, continuous, vanishing at the origin, strictly increasing, tending 
to ©, and inverse to each other. Then, for a, )20, we have the inequality, due to 
Young, 


(1) ab S P(a) + Vd), where ®(x) = [ dau, V(y) = [ va. 


The geometrical proof is obvious. It is also easy to see that the sign S can be re- 
placed by = if and only if b=¢(a).” 

The easily constructed geometrical pictures, which form the basis for such a 
proof, can be found, for example, in Tolsted [5]. 


1.D. Inhis book on integration, McShane [6, pp. 131-132, p. 216], seemingly 
unwilling to rely on a purely geometric proof, gives an analytic proof of Young’s 
inequality, patterned after the original proof of Young, quoted above; which, 
however, in its best version, involves the extra assumption that ¢ is absolutely 
continuous on every finite interval. (For another comment on McShane’s ana- 
lytic proof, see the remark at the end of section 2.) 


1.£. Geometric proofs of mathematical propositions may sometimes lead 
one astray; compare, for example, the well-known geometric “proof” that every 
plane triangle is isosceles. Therefore, no matter how convincing any geometric 
proof of Young’s inequality may be, it is, nevertheless, essential to have a valid 
analytic proof. However, the construction of a valid analytic proof (see some of 
the references quoted above) does not appear, by far, to be as trivial a matter as 
the construction of a geometric proof. Be that as it may, it is the purpose of the 
present paper to provide a direct analytic proof of Young’s inequality, valid for 
continuous functions ¢, which is not beset with the difficulties of Riemann- 
Stieltjes integration, absolute continuity on every finite interval, etc., which 
were mentioned in the preceding discussion. Moreover, the proof given here 
lends itself to generalizations of Young’s inequality, as will be developed in full 
elsewhere. 


2. Analytic Proof of Young’s Inequality. 
The aforementioned analytic derivation of Young’s inequality will now be 
presented (f now replaces the ¢ of section 1). 
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THEOREM. Suppose that f 1s a real-valued function defined for all 0 St< o, 
and 1s such that 


(i) f(0) =0, 
(ii) f ts continuous for OSt< (from the right, at t=0), 
(iii) f ts strictly increasing for OSt< @, and limi... fH=+. 
(Under these circumstances, f has an inverse function f-!, which 1s defined for 
OSt<«, and which obeys (i), (ii,) and (ili), with f replaced by f-', of course.) 
Then, for any a2@0, b20, one has 


abs J “f(Odt + J "-Ai) dl 


with equality 1f and only tf b=f(a). 


Proof. Define, for simplicity in writing, for OSa<o and OSD< «a, 
a b 
A(a, 6) -{ f(di +f f()dt — ab. 
0 0 


Then, the inequality to be shown is just 0SA(a, b). 
(1). It will first be shown that 


A(a, f(a)) S A(a, 6) 


for O0Sa<o and 0S)< %, with equality if and only if )=/f(a@). For any such 
a and bd, one has 


f(a) 


A(a, 8) — A(a, f(a)) = J _W@ ~ alt = J [a — f-1(0) Jat. 


There are two cases to consider. The first case is )2f(a). Here, whenever 
b=tZ2f(a), one has f-'(b) =f-(t) =f [f(a) ] =a. Consequently, 
b 
a(a,8) — (af) = f [FW = alat 2 0. 
f(a) 
Since f—1(¢) —a is continuous, and strictly increasing, for f(a) StSb, equality will 
hold if and only if b=f(a). The second case is )Sf(a). Here, whenever ) Si 
<f(a), one has f-'(b) S$f-(t) Sf-'[f(a) ] =a. Consequently, 


f(a) 


(a,b) — A(a, f(a)) = [fa — f1@ar& 0. 
b 
Since a—f—'(#) is continuous, and strictly decreasing, for )StSf(a), equality 


will hold if and only if b=f(a). 
(II). Putting, for brevity, 


a tT (a) 
a(a) = A(a,f(@) = f float fo Prat - af, 
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for OSa< ~~, it will now be shown that 
d(a) = 0 


for all OSa< @ (this will be done by showing that the function 6, which is con- 
tinuous for 0Sa< ©, possesses a derivative which is zero for all 0<a<o; 
while, obviously, 6(0) =0). Then, for any a>0, and —a<e< ~, one has 


ate f (ate) 
5(a + €) — 6a) -{ [f() — f(a + 6 |dt +f | [f-1(@) — alde 


a f(a) 
-{ eto -sola+ fo [e-prola 
ate f (@+e) 


The cases 0<e< «© and —a<e<0 must be considered separately: 
Case (i): 0<e< @. In this case, it follows that, since the functions f and f~! 
are strictly increasing, 


a+)-azf [fe -fe+old 


f (ate) 


+ [f"U(@) — aldt = — e[f(a +) — f(a)]; 


f(a) 


and also that 


ate 
a+o-aa sf [e+o-fa+old 


f (a+e) 
+ If*(f(a + €)) — a]dt = el f(a + €) — f(a)]. 


f(a) 


Case (ii): —a<e<0. In this case, it follows that 


a+) saz] [ete — fala 


f(a) 


+ [a — f- (f(a) |dt = — e[f(a + €) — f(a]; 
f (ate) 
and also that 


(a+) oa) Sf (fot) flat ola 


ate 


f(a) 
+f [a — f (f(a + €)) |dt = e[f(a + ©) — f(a). 


(a+e) 


Thus, for any a>0, and —a<e< ~, with e090, one obtains 
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5(a + e) — 6(a) 
—|fa+6 —f@| s——_— | f(a +e) — f(a)|. 
Since f is continuous at a, it follows that 
tim 262 t9) — 4) _ 


«—0 é€ 


for 0<a< _o. In other words, 6’(a) =0 for 0<a< @; while, obviously, 6(0) =0. 
This means that 6(a2) =0 for all OSa<o. 
Thus, one has completed the proof that 


A(a, b) 2 Aa, f(a)) = 9, 


with equality if and only if b=f(a), as desired. 
REMARK 1: Part (I) of the preceding proof, that 


A(a, f(a)) S A(a, 4), 


is immediate, for any continuous function f. However, part (II) of the preceding 
proof, which involves showing that the derivative of the function 


a f(a) 
sa) = Aa) = f fat fo Ma ao 


exists and is identically zero for O0<a< ©, is not quite so obvious, for a contin- 
uous function f. This is because, for a function f which is merely continuous 
(rather than continuously differentiable), it is not possible to differentiate di- 
rectly, term by term, using the so-called “Leibniz rule for differentiating definite 
integrals with variable limits of integration,” in order to obtain the derivative 
6’(a). It is for this reason, of being able to differentiate term by term in obtain- 
ing 6’(a), that McShane [6, pp. 131-132] assumes first that f is continuously dif- 
ferentiable. Later in his book, McShane [6, p. 216; see also footnote *on p. 132] 
indicates that his proof can be extended to functions f which are absolutely con- 
tinuous on every finite closed interval. But, since there are strictly increasing 
continuous functions which are not absolutely continuous, McShane’s proof does 
not cover the case of an arbitrary continuous f. (For an example of such a func- 
tion, simply “add x” to the “Cantor function” discussed in McShane [6, pp. 
48-49 ].) 

It is also true that one could start with the argument of McShane [6, pp. 
131-132 | for continuously differentiable f, and seek to extend it to cover the case 
of functions f which are merely continuous, by employing an approximation 
theorem (for example, Weierstrass’ approximation theorem of continuous func- 
tions by polynomials), which guarantees that any continuous f is the limit of a 
sequence of continuously differentiable functions. But, all in all, one has the 
feeling that, after everything is thus said and done, the resulting line of reason- 
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ing will certainly not possess the strictly elementary character of the proof given 
in this paper. 

REMARK 2: It is not strictly necessary to require that limz.,, f(t) =+©, as 
is done in hypothesis (ili) of the theorem; this is required here only to simplify 
somewhat the writing in the proof. In case lim:,4, f(¢) is not infinite, but posi- 
tive, then the inverse function f—! is defined only on the finite interval OSD< 
limis+eo f(¢), and the same proof as given above applies, except that, both in the 
statement of the theorem and in the proof, the inequality 0S)<-+ © must be 
replaced, wherever it appears, by the clumsier looking inequality 0Sb<lim:.4, 
f(t). Thus, the proof given above, suitably interpreted, covers fully both cases, 
that of lim,,,, f(é)=+o, and that of lim:.4, f(4) positive. 

REMARK 3: The letters A and 6, employed in the course of the proof, are, 
really, when one comes right down to it, entirely superfluous; these letters are 
merely introduced in order to simplify the writing of the argument. In order to 
make evident the simplification of the writing involved, it is, perhaps, useful to 
state explicitly that, in part (I), one is seeking to show that 


a f(a) a 
[ soa+ fo pra apm s [ roat f pr@at— as, 


for all a=0, 620, with equality if and only if b=f(a); whereas, in part (II), one 
is seeking to show that 


a f (a) 
o= f sous f fat — af (a), 


for all a=0. 

Furthermore, it is to be noticed that these two parts of the analytic proof 
correspond exactly to the two types of geometric pictures which are drawn in the 
geometric “proof” of Young’s inequality: Part (I) corresponds to the figure 
drawn when 0f(a), while part (II) corresponds to the figure drawn when 
b=f(a). 
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ON A RESULT OF LIBRI AND LEBESGUE 


K. S. WiLtiAms, Carleton University, Ottawa 


1. Introduction. In this note we observe how a simple property of a primi- 
tive mth root of unity provides us with a counting function for the number of 
solutions of a congruence f(x1, - - +, x,)=0 (mod zn). We illustrate the idea by 
taking f(x1, °- +, xn) =x + +--+ +x} and a prime n=p=1 (mod J). We are led 
naturally to the g-nomial periods of the pth roots of unity where g=(p—1)/I 
[2 ]. We express the number of solutions of x{+ - - + +x,=0 (mod ) in terms of 
these periods, rediscovering an old result due to Libri and Lebesgue [1]. An 
alternative form of this result is also proved which provides a generalization of 
one due to the author when /=3 (see [4]). The formula of Libri and Lebesgue 
has been generalized by Weil [3]. The material in this note is not new but we 
hope that perhaps the presentation is. 


2. Two properties of w(n). For any integer n22 let w(n)=exp(27mi/n). A 
well-known property possessed by w(m) is the following: 


LEMMA 1. If m 1s an integer, then 
n=l n, ifm = 0 (mod x), 
X fo(n)}r= 4” | 

0, ifm #0 (mod x). 


Proof. The left-hand side is just a geometric progression. 
This property of w(m) guarantees that any complex-valued function f(m) 
(m an integer) which is periodic with period u has a finite Fourier series. 


r=0 


LEMMA 2. If f(m) is periodic in m with period n, then 


n—1 


fim) = DF alr) {o(n)}™, 


r==0 


where a(r) = (1/n) Do” f(s) {o(n)}—. 


Proof. We have, using Lemma 1, 


nal 1 zl 2-1! 
2 a(r) {eo(n)}mr = — FZ f(s) {om)} ome 


{ *!l n—1 
a” dS) Xu {a(n} mor = f(m). 


0 
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3. Counting function. Lemma 1 provides us with a counting function for 
congruences modulo 2, for if f(x1, - +--+, xx) is a polynomial with integral co- 
efficients, then the number of solutions (x1, + + + , xz) of f(a1, - + + , xx) =0 (mod 7) 
satisfying 0Sx,<m is given by 


= {2  fo(n) feu sand = SF fam}ren cen, 


Ly, *, LE=0  r=0  r=0 21,0+°, 4-0 
This can be simplified if f(x, - - + , xx) 1s separable in the variables x1, - > - , xz. 
We consider an application where this is so. 
4, Application to x; + --- +x, We take f(x, ---,%,) =xi + ---: +x and 
a prime n=p=1 (mod J), and use the law of exponents: w*t*=w%w>, Then the 
number N,(J, 2) of solutions (x1, - + + , xx) of xi + +--+ -+x}=0 (mod P) is given 
by 
1et RS l l 
Ny( 2) =— DD {w(p)} rete +b 
r=0 21,°°°,2,=0 


Let us write S,(l, r) = Do?75 {w(p) } rad We note that S,(J, r) is periodic in 7 
with period p, and 


p—l 
Spl, 0) = » l= 4, 


xr==0 
so that pN,(1, k)—p*= >o771{ Spl], r)}*, In the summation, 7 takes on the 
values 1,2, ---,—1. These are g®, g!, g?, -- + , g?-? (taken modulo p) in some 


order, where g is a primitive root modulo p. As S,(J, r) is periodic with period p, 
we have 


p—2 


(4.1) pN>(l, k) — pF = D7 {Soll, g*)}*. 
We next show that S,(J, g*) is periodic in s with period J. 
Lemma 3. For all integers s, S,(l, g*) =S>(I, get). 


Proof. We have 


Sp(l, get!) = = (o(p) eet = = (oo(p)} oon’ 


Now the mapping x—g~'x (so that gx—x) taken modulo p is a bijection on 
{0,1,---+,p—1}, so that 


> {co(p)} (o='e" = > {eo(p) fo", 


xr=0 r=0 


612 K. S. WILLIAMS [June-July 


that is, S,(/, gst!) =S,(J, g*) as required. 
As p—1=0 (mod /) this periodicity implies 


p—2 I—1 


D {Soh g}* = ¢ Do {Sol ¢)}, 


s==0 S==0 


so that (4.1) becomes 


pN p(k, 1) — pk = q DY {SpIl, 8°) }*. 
s==0 
Now let us examine S,(J, g*) (for s=0, 1,---,J—1). We have 
p—1 3 p—1l 3 
Soll, gt) =D folp)}"* = 14D {o(p)} ee 
z==0 z=1 
pa get lt i=l gut gta) 
=1+4 2 {o(p)}o" =14+ D7 Dd {a(e)}o 
razQ u=0 


But g?-!=1, so we have 


I—1 g—1 


Spl) = 142 fo(p fo = 1470S [algo 


r==0 u=0 u=0 


ot lu 


The expressions >/%z} {w(p) } gebin are called the g-nomzal periods of the pth roots 
of unity [2]. We write 


q—1 


m= D>, {eo(p)} or 


U==0 
so that we have the result of Libri and Lebesgue [1]: 


THEOREM 1. The number N,(k, 1) of solutions of xi + +++ +x,=0 (mod p) is 
given by N,(k, 1) =p*'+(q/p) itso {1+}. 

5. An alternative expression for N(R, 1). We can apply Lemma 2 to S,(I, g°) 
(as it is periodic in s with period /) to obtain a different expression for S,(I, g*) 
and thus a different expression for N,(k, 1). By Lemma 2 we have S,(I, g*) 


= ar =0 a(7) {w(Z) re, where 
{ tt 
a(”) = = D Soll gfe} 


== > ! 61S {o(p) 8 nu {eo(Z) }-" 


l s==0 u==0 


i—1 gq—-1 


1 i~1 g s+lu 
= LleOr + UX {co(p)}o” “feo(Z)}~rs 


s==0 s=0 u=0 
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1 l—1 p—2 e 
=7 2 leO}or + y {o(p) }# {o(Z) }-r8 
1+ x {uo(p)}*, r=0, 
045 { w(p)}o°{(J)}-r9, r=1,2,---,l—- i, 
0, r= 0, 
7 is {w(p)}e{a(}-, 7 £0. 
Writing 7r,= > 70 {w(p) hott w(Z) fare where 7 is any integer, we have 
Sah) =X {oO} rr 


giving the following theorem: 


THEOREM 2. The number N,(k, 1) of solutions of x\-+ +++ +x,=0 (mod #) 
ts given by 


Np(k,) = p+ ' > {0 }rt ‘ 


Db s==0 r=] 
This generalizes a result of the author [4] when /=3, viz., 
N(R, 3) = pet + [(p — 1)/3p] [ra + 72)* + (wri + 772)" + (wri + wrs)*], 
where w =w(3) =4(—1++/-—3). 
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LIMIT POINTS OF SEQUENCES IN METRIC SPACES 
M. D. Ag1é AND D. D. ApAmovié, University of Belgrade, Yugoslavia 


The aim of this paper is to generalize both statements of the following 
theorem [1]: 


THEOREM A. Let C(&) be the set of limit points of the bounded complex sequence 
£. Then C(&) ts connected if and only tf there exists a subsequence yn = (Yn) of — such 
that C (n) = C(E) and Ya4i—Yn0 (n> &). 
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The “if part” of this theorem is due to H. G. Barone [2], and the “only if 
part” to Paul Schaeffer [1]. 
A generalization of the first statement is 


THEOREM 1. Let X be a compact metric space, = (xn) a sequence of tis points, 
and C(&) the set of limit points of &. Assume 


(1) lim d(4a, %n41) = 0. 


Then C(&) 4s a connected set. 

Proof. Suppose that C(&) is not connected, i.e., that 
(2) Ci) = AUB and ANB=GQ, 
with A and B nonempty closed sets. Put 


(3) e = Inf{d(t, u):t C Aju € B}. 


Since the space X is compact, there exist a©. A and )©B such that d(a, b) =e. 
In view of the second condition of (2), we have e>0. 

The sets 
(4) O1 = U Kt, e/3) and O, = U K(t, ée/3), 

tcA teB 

where K(#, 6) is the open sphere whose center is t@X and radius is 6, are then 
disjoint. Owing to the compactness of X, the set Oi\/O, contains all members of 
the sequence & with large enough indices. This last fact together with (1) implies 
that there is a natural number m» such that 


(5) 4, & O1U Op (n = No) 
and 
(6) d(Xn, Xn41) < €/3 (n = No). 


Since A and B are nonempty sets of limit points of the sequence £, upon taking 
into account (4) and (5), we see there exists a natural number m2», such that 
XmE Or; and *m+1©@ Oz, and so there exist ‘CA and u€B, such that 
(7) d(t, Xm) < €/3, d(u, Xmz1) < €/3. 
Then, by (6) and (7), 

d(t, ut) s a(t, Xm) + d(Xm, Lm-+1) + d(Xm41) u) < €, 


which contradicts the choice of e, and thus proves the theorem. 

For the proof of our second theorem, which generalizes the second statement 
of Theorem A, we shall need the following lemma (see [3], pp. 169-170, T, (a), 
where the supposition of compactness of X is not necessary). 
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LemMMA. Let E be a connected pari of a metric space. Then for any two elements 
tand u of E and for every e>0 there ts a finite number of elements to, th, - - + , ty of 
E, such that to=t, th. =u and d(ty1, ty) <e (v=1,---, hk). 


THEOREM 2. Let X be a metric space and &=(xn) a sequence of tts points, for 
which C(é) 1s non-empty and connected. Then there exists a subsequence n= (yn) of 
the sequence & with the properttes litins €(Va) Ynsi) =9 and C(n) =C(). 


Proof. For each 6>0, there exists an (at most) countable cover 2 of the 
set C(&) which consists of spheres of diameter 6 whose intersections with C(€) are 
not empty. Namely, 2 can be formed from all those spheres with centers in 
Xn (n=1,2,3, - - -) and with diameter 6 which have nonempty intersections with 
C(&). If 2, denotes such a collection of spheres for 6=v—! (v=1, 2, 3, ---), the 
collection of spheres Z»o=U,;-1 2» is countable, so that we can write Zo 
= | Ki, Ko, K3,- °° I. Besides, for each ¢ in C(&) there exists a sphere from Zp 
with an arbitrary small diameter which contains ¢. Let t;CK;\C(E) «=1, 2, 
3, +--+). According to the lemma, there exist positive integers 1 =ki<hkp< --- 
<kix< +--+ and points u4,CGC(&) (n=1, 2, 3, +--+) such that 


My, = ti(t = 1, 2,3, +++); dai, Un) < (Ri <n Skip i = 2,3,---). 
It is clear that 
(8) lim d(tén, Unsi) = 0. 


n> 

Furthermore, let 1 be the smallest positive integer such that x,,C Ki 
and d(x», ¥1) <1; if the natural numbers p, (v=1, - - - , #—1) are determined, 
then let p, be the smallest natural number with the properties: 


Dn > Pn—13 C(Xp,, Un) <n; ifn = k,, then x,, € Ki. 


According to the construction of the sequence (w,), an infinite and strictly mono- 
tonic sequence of natural numbers (,) has been defined by the above procedure. 
The sequence 7 = (yn), where yn =X, (n=1, 2,3, +--+), has, therefore, members 
with arbitrary large indices in each sphere K; (4=1, 2, 3, ---). Therefore, 
C(y)> C(E), and since the opposite inclusion clearly holds, we have C(y) = C(é). 
On the other hand, according to (8), 


d(Yn; Yn+1) = d(Xp,5 Xpns1) Ss U(Xpqs Un) + d(tn, Uns1) + C(Un41, Xpn41) 
< nt + d(ta, Unsi) + (n + 1)71- 0 (21 —> ©), 


A few additional remarks concerning Theorem 1. 

1. Compactness of the space X cannot be simply omitted from Theorem 1. 
This can be proved by choosing a suitable sequence & in the two-dimensional 
Euclidean space X = R? (for example, such that C(&) be the union of acurve and 
its asymptote). 

2. On the other hand, compactness of the space X is not necessary for the 
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holding of the implication 


(9) lim d(%n, Xn41) = 0= C(&) is a connected set, 


n->oO 


which is proved by the example of the one-dimensional Euclidean space X = R. 

3. Another example for the same statement is a metric space X with a non- 
denumerable amount of points in which d(x, y) =1 («#y). 

4. In view of the above, we can raise the question of replacing the compact- 
ness of X in Theorem 1 by a weaker condition. Such a condition certainly could 
not be Z-compactness (the property of a space to be represented as a denu- 
merable union of compact parts). Moreover, according to Remarks 1 and 3, 
Z-compactness of the space X is neither a necessary nor sufficient condition for 
implication (9) to hold. 
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RESEARCH PROBLEMS 
EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (if any are known to the author) and by a brief description of known 
partial results. Material should be sent to Victor Klee, Department of Mathematics, Unwersity 
of Washington, Seattle, WA 98105. 


MUST A COMPACT ENDSET HAVE MEASURE ZERO? 
Victor KLEE AND MICHAEL MartIN, University of Washington 


Statement of the problem. For any collection § of line segments in Euclidean 
d-space E?, let P(S) denote the set of all endpoints of the members of 8. Any 
set of two or more points has the form P(8) for some $8, but a subset M of E? is 
here called an endset if and only if @=P(S) for some collection $ of pairwise 
disjoint segments. In the real line E! any endset is countable. Intuition suggests 
that the d-dimensional Lebesgue measure of any compact endset in E? is zero. 
When d 2 this follows from an argument in [3, 3.3]. However, for d=4 (and 
hence also for all d>4) Bruckner and Ceder [1] have produced a counterexample 
by using Nikodym’s construction [5] of a Cantor set X of positive measure in 
FE? such that for each point x of X there is a line in HE? intersecting X only at 
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x. Hence the answer to the title question is affirmative for dS2, unknown for 
d=3, and negative for d24. 


Origin of the problem. The problem originated in a study [3] of the facial 
structure of convex bodies. For any convex set C in E? let I(C) denote the in- 
terior of C relative to the smallest flat containing C. Now suppose that B is a 
convex body in E4, X is the boundary of B, and X, is the union of the sets [(C) 
as C ranges over all maximal convex subsets of X. For example, if B is a polytope 
X, consists of the entire boundary except for points on (d—2)-dimensional faces 
of B. It seems probable that, as surely happens when B is a polytope, X, is al- 
most all of X in the sense that the (d —1)-dimensional measure of the set Y~X, 
is equal to zero. For d $3 this is proved in [3, 3.4] by using the fact that in E} 
and E?, compact endsets have measure zero. It may be that the conjecture con- 
cerning convex bodies is correct for all d even though (in view of the Bruckner- 
Ceder example) this method of proof is inapplicable for d= 5. 


An equivalent question. There is an endset M; in E? associated in a natural 
way with any real-valued function f whose domain is a subset D; of #4-!. The 
endset M; is the union of the graph of f and the graph of the function f+1; that 
is, 


M; = {(x, f(x)):« © Dy} U { (a, f(x) + 1):2 © Dy} C Et! X El = E4 


As the various sets M;.,, for 0<7 <1, are pairwise disjoint and all translation- 
equivalent to M,, it follows that either M; is nonmeasurable (a possibility that 
can be realized in various ways—see, for example, [2, 21.27]) or the d-dimen- 
sional Lebesgue measure of M; is zero. A similar argument shows the measure 
of IM =P(S8) is zero whenever M is measurable and 8 is a collection of pairwise 
disjoint parallel segments. As is explained in the next paragraph, the title ques- 
tion amounts to asking whether almost parallel is as good as parallel in this con- 
text. 

For each number 7€ ]0, 1/3| define a (d, 1)-endset as a compact set of the 
form P(8), where § is a collection of pairwise disjoint segments having one end- 
point within y of the origin (0, - - - , 0, 0) in H? and the other endpoint within y 
of the point (1, 0, - - -, 0). The segments in such a collection need not be par- 
allel, but they are very nearly so, especially when 7 is close to zero (see Figure 1), 


Fic. 1 
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Using standard and elementary techniques of measure theory, it is easy to derive 
(a) any measurable endset in E? is of measure zero 
from 
(b) there exists ng>0 such that any (d, ya)-endset is of measure zero. 
For d=2 the truth of (b) follows from an argument in [3, 3.3]. The proof 
depends on the fact that if P(S) is a (2, 7)-endset for small enough y, and x and y 
are the left endpoints of two members of §, then 


|x. — yl] = |]x — 91] /2 


for all e©]0, |, where x, and y. are obtained from x and y by moving these 
points a distance e toward the corresponding right endpoints (see Figure 1). It 
follows that the measure of the set of all left endpoints is not much reduced by 
the e-motion, and an argument similar to the one above (involving an uncount- 
able collection of pairwise disjoint sets) can be applied to show that P(S8) has 
measure zero. However, this doesn’t work when d2 3, for then the two segments 
may nearly cross and ||x.—-.|| may be much smaller than ||x—¥|] (Figure 2). 


Fic. 2 


An example and a problem in E?. Let X be a Cantor set in E' such that 
the 1-dimensional measure of X is positive. Then of course the 2-dimensional 
measure of X XX is positive. Further, X XX is contained in a simple closed 
curve J [4, p. 540] and there is a homeomorphism of E? onto itself which carries 
J onto an ordinary circle K [4, p. 173]. As K is plainly the set of all endpoints of 
a collection of pairwise disjoint topological arcs, the same is true of J, even 
though the 2-dimensional measure of J is positive. It would be of interest, for a 
collection $ of pairwise disjoint topological arcs, to find geometric conditions 
that are weaker than straightness and yet guarantee that the set of all endpoints 
has measure zero if it is measurable. 
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SOME PROBLEMS IN ADDITIVE NUMBER THEORY 
P, Erpds, Israel Institute of Technology, Haifa 
1. Let 1Sa,< +--+ <a,Sx be a sequence of integers for which the sums 
k 
(1) > €104, e= 0 or 1 
i=1 


are all distinct. Put max k =f(x) where the maximum is taken over all sequences 
satisfying (1). It is easy to see that 
logwx  loglogs 


4 
log 2 log 2 


f(*) < + O(1), 


and Moser and I proved 


Q) fle) < oem 4 OB E* 4 oy, 


log2 2log2 


This is the best-known upper bound for f(x). 
Is it true that 


(3) f(x) = (log «/log 2) + O(1)? 


Moser and I asked: Is it true that f(2*)2k+2 for sufficiently large k? Conway 
and Guy showed that the answer is affirmative (unpublished). 

P. Erdés, Problems and results in additive number theory, Colloque, Théorte 
des Nombres, Bruxelles 1955, p. 137. 


Zz. Let 1Sai< +--+ <a,Sx be a sequence of integers so that all the sums 
Qtr td, 1S S++ Sits, 1Ss8r 
are distinct. Put max k=g,(x). Turdn and I proved 
(4) g(x) < xt? + O(x/4), 


This was recently improved by Lindstrém to ge(x) Sx/#+x"4+41. The lower 
bound ge(x) 2 (1+0(1))x!/? easily follows from a classical result of Singer on 
difference sets. Turan and I conjectured 


(5) go(x) = wl? + O(1). 
Bose and Chowla proved that g,(«) 2 (1+0(1))x"* for each r=2, and they con- 
jectured 


(6) g(x) = (1 + o(1))alr. 


This is known only for r=2. 

R. C. Bose and S. Chowla, Theorems in the additive theory of numbers, Com- 
ment. Math. Helv., 37 (1962-63) 141-147. 

H. Halberstam and K. Roth, Sequences, Oxford Univ. Press, Oxford 1966. 
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P. Erdés and P. Turan, On the Problem of Sidon tn additive number theory 
and on some related problems, Journal London Math. Soc., 16 (1941) 212-215 
and 19 (1944) 208. 

J. Singer, A theorem in fintie projective geometry and some applications to 
number theory, Trans. Amer. Math. Soc., 43 (1938) 377-385. 

B. Lindstrém, An inequality for Bz sequences, Journal Combinatorial Theory, 
6 (1969) 211-212. 


3. Let 1Sa,< --- bean infinite sequence of integers. Denote by h(m) the 
number of solutions of n»=a;+a,;. Turdén and I conjectured that if h(n) >0 for 
n> then 
(7) lim sup h(n) = ©. 

Another stronger conjecture states that if a,<ck? for every k then (7) holds. 

These conjectures seem very difficult. It is a curious fact that the multiplica- 
tive analogues of these conjectures are not intractable. 

Let 1<di< +--+ be an infinite sequence of integers. Denote by H(n) the 
number of solutions of n =0b,b;. Assume H(n) >0 for n>». I proved 

lim sup H(n) = o. 
Turan and I further conjectured that the number of solutions of a;+a;Sx 
cannot be of the form cx+O(1), where ¢< ©. In other words 


(8) >, h(n) = cx + O(1) 
n=1 
can hold only if ¢=0 and the sequence, ai1< - - - , is finite. Fuchs and I proved 
avery much stronger result than (8). We in fact showed that if c>0 then 
3 wll 
9 h(n) = cx +0 (<2) 
©) Ae) (log a)?” 


is impossible. Jurkat showed (unpublished) that (9) is impossible even with 
o(x/4), Perhaps Jurkat’s result is best possible and 


>> h(n) = cx + O(x"4) 
n=} 
can hold for a suitable sequence a1< ---. 
Is it true that the number of solutions of a1++-a;-++-a, $x cannot be of the form 
cx-+O(1)? Our method used with Fuchs does not apply here. 
I have several times offered $250 for the proof or disproof of any of the 
conjectures (3), (5), or (7). 
Paul Erdés, On extremal problems of graphs and generalised graphs, Israel 
Journal of Math., 2 (1965) 183-190, and On the multiplicative representation of 
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integers, tbid., 2 (1965) 251-261. 
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CLASSROOM NOTES 


EDITED BY DAVID DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


AN EXISTENCE THEOREM FOR NON-NOETHERIAN RINGS 
RoBERT GILMER, Florida State University 


In developing the theory of Noetherian rings, it is desirable to have at hand 
some examples of non-Noetherian rings. One such example is the ring of poly- 
nomials in infinitely many indeterminates over any nonzero ring. A second ex- 
ample is R|X], where R is any nonzero commutative ring without identity [1], 
but in this case, R[X ] is also a ring without identity. We present here a theorem 
which provides a method for constructing, as subrings of a polynomial ring in 
finitely many indeterminates, a wide class of commutative rings with identity 
which are not Noetherian. It should be noted that the proof of the theorem given 
uses only one result (Lemma 1) outside the basic theory of Noetherian rings, 
namely, that a finitely generated idempotent ideal of a commutative ring is 
principal and is generated by an idempotent element. An examination of the proof 
of this result reveals that even it is obtained as a direct application of Cramer’s 
Rule for determinants over a commutative ring with identity. 


THEOREM 1. Suppose that R 1s a nonzero commutative ring, that A 1s a nonzero 
ideal of R distinct from R, and that {Xy}yea ts a set of indeterminates over R. The 
subring S=R+A [{x rt | of R[{X x} ], consisting of those polynomials over R hav- 
ing each of their nonconstant coefficients in A, 1s Noetherian tf and only tf these three 
condtiions hold: (1) A ts finite, (2) Rts Noetherian, and (3) the ideal A 1s 1dem- 
potent. 


Proof. Suppose that S is Noetherian. It is clear that A must be finite, for if 
not, the ideal A [{X,}] of S would not be finitely generated. The mapping on S 
which sends each polynomial f€ES onto its constant term is a homomorphism 
from S onto R, so that R is Noetherian. If o is a fixed element of A and if aE A, 
the ideal (cX,, aX?, -+-,aX7,---) of S is finitely generated, so that aX” 
€(aX,,---,aX,) for some positive integer m: 
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axe" = >. (siaX_ + n:aXz) 
i=] 

for some s;€S, n;EZ. Equating coefficients of X7*' from each side of this equal- 
ity, we conclude that a=ba for some element } in A. Thus aC A’, AC A?2, and 
therefore A = A?. 

To prove the other half of Theorem 1, we need to prove a lemma. This result 
is known (for example, a proof appears in [2], p. 58); we include its proof here 
for the sake of completeness. 


LemMMA 1. Jf B ts a finitely generated 1dempotent tdeal of a commutative ring T, 
then B is principal and 1s generated by an idempotent element. 


Proof. We first assume that T has an identity and we let {bs}Pea be a finite set 
of generators for B. Then B=B?= >"_, Bb; so that we obtain a system of 
equations 


n 
b, = >» Suede, 
i=l 


where s,;¢B and where 1 Sk Sn. This gives rise to the system of equations 


nr 


>, (Oni — Seb; = 0 1sksn, 
f=] 
where 6x; is the Kronecker delta. 

By Cramer’s rule, db;=0 for 11, where d is the determinant of the ma- 
trix [5,;—sz]. It is easy to see, however, that d is of the form 1—0 for some 
element 0 in B. Since 0=db;=0b;—bbd; for each 7, B is the principal ideal gener- 
ated by b. And since 1—) annihilates B, (1—b)b=0, or b=0b?. 

If J contains no identity element, we consider a commutative ring T’* ob- 
tained by adjoining an identity element to 7. Then B is a finitely generated 
idempotent ideal of 7*, and hence is principal as an ideal of T*, generated by 
an idempotent element v. Since T* is obtained by adjoining an identity element 
to 7, it follows that v also generates B as an ideal of T. 

To complete the proof of Theorem 1, we assume that A is finite—say 
A= { 1,2,---:, tk, that R is Noetherian, and that A is idempotent. Then A is 
principal and is generated by an idempotent element e. Moreover, R is the direct 
sum of A and the ideal B= {x—ex|xCR}. Since R is Noetherian, A~R/B and 
B=R/A are Noetherian rings, and A is a ring with identity. Hence, A|Xi, -- -, 
X,| and B are ideals of S which are Noetherian rings. It then follows that 


S=R+A[X,---,X)=BO@A[X,---,X, 
is a Noetherian ring. 


COROLLARY. Suppose that Ris a commutative ring with identity containing no 
idempotent elements other than 0 and 1. If A ts a nonzero ideal of R distinct from R, 
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and if {Xy}rea is any set of indeterminates over R, then S=R+A[{ Xj} ] is a com- 
mutative ring with identity which 1s not Noetherian. 
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THE COSRIMS REPORTS 


R. P. Boas, Northwestern University 


What is—or was—Cosrims? It was a 12-man committee of the National 
Academy of Sciences under the chairmanship of Lipman Bers, and the name 
stands for Committee on the Support of Research in the Mathematical Sciences. 
Before it was through it had 50 or more collaborators doing things for it, and 
another 50 or so worked on the CBMS Survey, with John Jewett as executive 
director, that collected the needed data. The principal product of Cosrims was 
simply entitled “The Mathematical Sciences—a Report”; it was completed, 
after more than a year of work, at the end of 1967 and issued late in 1968. 

You might well wonder what Cosrims is doing on this program, when it is 
AMS rather than MAA that is primarily concerned with research in math- 
ematics. The answer is that one of Cosrims’ main principles, from the very 
beginning, was that one cannot separate research and education in mathematics. 
This is because the increasing mathematization of society, which we cannot stop 
even if we want to, demands that more and more people should be trained to 
some degree in mathematics (and when I say “mathematics” I mean the math- 
ematical sciences generally); the people who train them must themselves be 
trained at a higher level, and this training must eventually be done by the rela- 
tively small body of mathematical scientists who do research—an activity that 
itself calls for continued learning. Even the purest mathematician I know, one 
who says flatly that he is interested only in research in pure mathematics, at 
least adds, “and in training other people to do research in pure mathematics.” 

Let me quote from the Cosrims report: 

We need people who can teach mathematics in grade school in a way that will not create a 


permanent psychological block against mathematics in so many of our fellow citizens. We need 
people who can understand a simple formula, read a graph, interpret a statement about prob- 
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E 2242. Proposed by M. L. Fredman, California Institute of Technology. 


Show that 5°7_, k@”) =0 (mod n), 21, for all integers k (positive or nega- 
tive), where (a, 0) denotes the greatest common divisor. 


E 2243. Proposed by Richard Sinkhorn, University of Houston. 


Show that every normal stochastic matrix is necessarily doubly stochastic. 


FE, 2244. Proposed by T. K. Leong, T. A. Peng and K. C. Yeo, University of 


Singapore 
Show that, for any fixed m 22, the series 
1 1 x 1 1 1 x 
PD a et mo To 1 Oe 
1 1 1 x 
Sm bi! lm+2) m1 Sm 


is convergent for exactly one value of x and find the sum of the series for this x. 


E 2245. Proposed by A. W. Walker, Toronio, Canada 


If A, B, C; a, 6, c; s are the angles, side lengths, and semiperimeter of any 


plane triangle, then 


(a+ 6+ c)3(s — a)(s — b)(s — c) 2 (a? + 6? + c?)? cos A cos BcosC. 
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E2246. Proposed by Andrew Rochman, Saint Louis University 


Let f be a non-constant, real valued, continuous function such that, for all 
x, yER, f(xty) =o(f(x), vy). Prove that f is monotonic. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Fixed Point for a Composite Function 


E 2187 [1969, 826]. Proposed by R. E. Chandler, North Carolina State Uni- 
versity 


Let D be the closed unit disk in the complex plane and let f, g:D—D. 
Suppose g is analytic on some open set containing D and suppose that f and its 
iterates f?, f*, - - - each have exactly one fixed point on D. If f and g commute 
(f(g(z)) =g(f(z)) for all z@D) then either g has exactly one fixed point in D or 
g(z) =2. 


I. Solution by J. R. Kuttler, The Johns Hopkins University Applied Physics 
Laboratory. The result is not correct. As a counterexample, let g be analytic as 
above, not the identity, and have more than one fixed point on D. Let 29 be a 
fixed point of g and let f(z) =2 for all zCD, i.e., f takes all of D onto the point 
Zo. Then g(f(z)) = g(80) =20=f(g(z)) for all gE D, and f and its iterates have the 
unique fixed point Zo. 


II. Solution by O. P. Lossers, Technological University, Eindhoven, Nether- 
lands. With the added hypothesis that f be one-one, the result is valid. Let 
29 be the unique fixed point of f. Then 2 is also the unique fixed point of f* 
(k=2, 3,---). Since f(g(g0)) = g(f(20)) = g(80) it follows that 2 is also a fixed 
point of g. Now let 2; be a fixed point of g. Then we have g(f(21)) =f(g(21)) =f(21). 
Hence f(21) is also a fixed point of g. It follows by induction that { 21, Zo 
defined by 2n41=/"(21) is a sequence of fixed points of g. 

If all the g, are different, then the function g(z) —z has infinitely many zeros 
on D. Since g is analytic it follows that g(z) =z in this case. If g(z)#z there 
are positive integers 7, k with j7<k such that 2,,;=2441. Hence f%(21) =/*(21) 
= fi(f*—7(z,)). It follows from the fact that f (and hence f*) is one-to-one that 
21=f*-4(z,) and since 2 is the only fixed point of f*-/ we see that 2=2. Hence 
Zq is the only fixed point of g. 


Also solved by M. G. Greening (Australia), M. S. Klamkin, Renate McLaughlin, Norman 
Miller, Simeon Reich (Israel), Steven Rohde, E. F. Schmeichel, and the proposer. 


Series Converging for all Subsets of a Family 


E 2188 [1969, 826]. Proposed by P. R. Chernoff, University of California at 
Berkeley, and W. C. Waterhouse, Cornell University 


Let > dn, where S is a subset of the positive integers, mean the sum with 
elements of S taken in increasing order. Let F be an arbitrary countable family 
of subsets; construct a conditionally convergent series }jan with > sa, con- 


654 ELEMENTARY PROBLEMS AND SOLUTIONS [June—July 


verging for all S in F. (Several special cases, notably F=all arithmetic progres- 
sions and F=all geometric progressions, are problems in Pdélya & Szegé, 
Aufgaben und Lehrsiize, I, 3, 3.) 


Solution by the proposers. Let Si, Se, - - - be the sets in F. Let 7; be S; if Sy 
is infinite; otherwise let 7; be its complement S{. Let 72 be 7y(/\Sz if this is 
infinite; otherwise let T2=74/\S3. Proceeding inductively we thus construct a 
sequence of infinite sets 7; D72.73;> --- with 7m contained in S» or Sy. 
Choose EG 7i, and inductively let tm: be the least element of 741 greater than 
tn. Then the & for k2m lie all in S,, or all in its complement. Now just set 
ady,=(—1)*/k, with all other a, =0. 


Also solved by G. A. Heuer, and Joel Spencer. 


An Identity on Restricted Partitions 


E 2189 [1969, 826]. Proposed by S. W. Golomb, University of Southern 
California 

Let b(n, k) be the number of partitions of the positive integer into k integer 
parts all of which are powers of 2. (We allow parts to be repeated; we allow 


2°=1 as a part; and we disregard the order in which the & parts are listed.) 
Show that 


1+ >) do b(n, Barve =1 J dO (—y)? ar, 


n=1 k=1 n= 
where w(m) is the number of 1’s in the binary representation of n. Also, reach a 
conclusion about these partitions based on the substitution y= —1. 


Solution by Michael Goldberg, Washington, D. C. It was noted by Euler that 
the coefficient of x”y* in the expansion of 


(1) F(x, y) = (1 + yx)(1 + yx?)(1 + yet) --- 


is the number of different ways that n is the sum of k different powers of 2. 
Hence, F(x, vy) = Don-o V2™ x. Also he noted that the coefficient of x"y* in the 
expansion of 


(2) P(x, y) = 1/(1 — yx)(1 — ya2)(1 — yat) ++ - 


is the number of ways that z is the sum of k powers of 2, repetitions allowed. 
Therefore 


P(x, y) =1+ DI DL d(n, k)ary'. 
n=1 k=1 
Also we obtain 1/P(x, y)=F(x, —y) = )o2.9(—y)"™x". Hence, the indicated 
relation is verified. 
When y= —1, we have P(x) =1/F(x), which gives a relation between par- 
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titions into an odd number of parts and partitions into an even number of parts. 


Also solved by L. Carlitz, N. G. Fine, M. G. Greening (Australia), Douglas Lind, E. F, 
Schmeichel, and the proposer. 


ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to J. Barlaz, Ruigers—The State Uni- 
versity, New Brunswick, NJ 08903. Solutions of Advanced Problems in this issue should be 
typed (with double spacing) on separate, signed sheets and should be mailed before September 
30, 1970. Contributors (in the United States) who desire acknowledgement of receipt of their 
solutions are asked to enclose self-addressed, stamped postcards. 


An asterisk (*) means netther the proposer nor the editors supplied a solution. 


5740. Proposed by L. R. Etzwetler and W. C. Waterhouse, Cornell University 


Let R be a commutative ring, S a multiplicative subset of R. Let M and N 
be R-modules. There is a canonical map 


ST} Homa(M, N) —_ Homg-tr(S—1M, SIN). 


This map is known to be injective if MV is finitely generated, and bijective if 
is finitely presented (N. Bourbaki, Algébre Commutative, Ch. II, Section 2, 
Prop. 19). Find an example in which MM is finitely generated and the map is 
not surjective. 


5741. Proposed by Simeon Retch, Israel Institute of Technology 


Let p(z) =dotaye+ +--+ + +472", dnX%0, be a polynomial with complex co- 
efficients. It is known (Morris Marden, The Geometry of the Zeros, AMS, New 
York, 1949, p. 98) that if 2; is the zero of largest modulus, then 


n—l1 1/2 
| zi] < {1+ 5 | oi/an|*t . 
j=0 
Prove that if 2, is the zero of smallest modulus, then 
1 n—1 1/2 
| en| < {1 +—(¥ | ai/0»|*)} ; 
n j=0 


5742. Proposed by Anne Penfold Street, University of Alberta 


Let G bea group and A a subgroup of G. Let xCG, x A. We say x augments 
A if A,=A\ (x, x~!) is also a subgroup of G. Suppose A is a subgroup of G such 
that any element of G augments A. Characterize G. 


5743. Proposed by Byarnt Jonssen and J. B. Nation, Vanderbilt University 
Let f(x) be a real valued function with at least k derivatives. Given that for 
some real number 7, 
lim f(x) =O and lm «ff (x) = 0, 


H leon °°) 
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prove that lim,..,.x"f (x) =0, OS7SR. 
5744. Proposed by Jan Myctelski, University of Colorado 


A cube 20 X20 X20, is built out of bricks of the form 2X2 X1. The faces of 
the bricks are parallel to the faces of the cube but they need not all lie flat. 
Prove that the cube can be pierced by a straight line perpendicular to one of the 
faces which does not pierce any of the bricks. 


5745.* Proposed by Dantel Pedoe, University of Minnesota 


QO is a non-specialized quadric hypersurface in Euclidean space of  dimen- 
sions, La hyperplane which intersects Q in a non-specialized quadric of one less 
dimension. We consider the pencil of quadric hypersurfaces 0+kL?=0, and 
suppose that Q1, Qe, Qs, Q4 are any four of them; 71, 72, T3, T4 the total Gaussian 
curvatures at any point on the quadric of contact Q0=L=0; 14, Me, M3, M; 
the mean Gaussian curvature of the respective quadrics at the same point. 
Prove that the two cross-ratios 


{T1, T2, Ts, Ts}, {Mi, Me, Ms, Mi} 


are equal. Deduce that if two conics touch at the two points P,, Pe, then the 
ratio of their curvatures is the same at both points. 


SOLUTIONS OF ADVANCED PROBLEMS 


Homomorphism in a Semigroup 


5522 [1967, 1014; 1968, 801]. Proposed by T. C. Brown, Kiev State University, 
U.S. S. R. 


Let S be a periodic semigroup whose idempotents lie in the center. Then the 
mapping x—>x"®), where x" is idempotent, is a homomorphism. 


II. Solution by the proposer. Let x, yE.S; choose m so that x™, y™, (xy)™ are 
all idempotent. Then 


(xy)™ = (xy)™(ay)™ = xPa(ay)™. 


By induction, (xy)™=x"™w'(xy)™. Similarly (xy)™= (xy)™w’/y™. Hence (xy)™ 
= (xy)™xmy™, Now 


(xy)™ = (xy)™y™ryamamy™ = (ary)™ (ym tam) (ary) amy”, 
By induction, 
(xy)™ = (wy)™(yrtamt) (sey) may = (ary) tam—2) marmy, 
Then 
(xy)™ = (xy) Tay) (yr tat) (ym tat) m— Tama 
= (xy) (ym tat) haemy, 


By induction, (xy)™=x™y™, 
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EDITORIAL Note. The first solution printed for this problem [1968, 801] assumed a cancella- 
tion law. The proposer’s solution above avoids the assumption. 


Evaluation of i) Il 
j 


5529 [1967, 1015; 1968, 914]. Proposed by D. S. Mitrinovié, University of 
Belgrade, Yugoslavia 


sin kj(% — a;) 


X — Qj 


Evaluate 
on sin kj(% — ay;) 
Evaluation of U———. «, 
—wo j=l XY — aj 
with k;, a;,7=1, 2, - +--+, real numbers. 


Note. The published solution for this problem is in error. Murray S. Klam- 
kin remarks that it is to be expected that the given integral depend on all the 
k’s and be symmetric in k;, a;. The formula obtained in the solution 


I=nr {I 


jue aj-1 — A 


sin kj(aj1 — ay) 


does not involve k; and is not symmetric as required. (kj; =0 must imply J =0.) 
Accordingly the solution is withdrawn and we urge our readers to reconsider 
the problem. 


A Closed Subset in the Gelfand Space of a Banach Algebra 


5654 [1969, 200]. Proposed by L. J. Lardy and J. A. Lindberg, Jr., Syracuse 
University 


Let A be a commutative Banach algebra with identity, and let 4 denote 
the Gelfand space of A. Call a closed subset J of ®, an inverting set for A if the 
invertible elements of A are precisely those elements whose Gelfand transform 
does not vanish on J. Is the intersection of two inverting sets an inverting set? 


Solution by Larry Exfler, Louisiana State University. Not necessarily. Let 
A denote the closed unit disk in C. Let A be the uniform closure in C(A XA) of 
the polynomials in two variables. Then the Gelfand space of A can be identified 
with AXA. Consider K = { (2, w)GAXA:|2| =|w| } and L={(z, w) GAXA:|z2| 
=1 or |w| =1}. Both K and L are inverting sets for A but KNOL is not an in- 
verting set for A. 


Also solved by the proposer. 


Tangents to Rectifiable Curves 


5659 [1969, 309]. Proposed by K. L. Singh, Memorial University of New- 
foundland 


Prove that a continuous rectifiable curve in a uniformly convex space 
possesses a tangent at almost all of its points. 
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Solution by H. Pétard, III, Princeton University. We prove the more general 
result: A continuous rectifiable curve in a Banach space £ with separable dual 
space HL’ has a tangent vector at almost all of its points. 

Let y(t) be a continuous rectifiable curve in a Banach space F, with dual 
E’ and second dual E’’. Define a measure yp on [0, 1] by setting u([a, b]) = varia- 
tion of y(é) in [a, b]; uw is finite and positive. We are going to construct a function 
f, from [0, 1] into E”, such that y(b) —y(a) =f? f()u(dt) for 0<a<d<1. 

For each vector x’CE’, set \z:(A) = fax’ (dy(t)), where A denotes an arbi- 
trary Borel subset of [0, 1]; A. is a measure satisfying the inequality |\2(A)| 
<||x’||-u(A) for any A. Hence, the Radon-Nikodym theorem tells us that 
Nz (A) = fafer (Au (dt), where | far (t) | <||x’|| almost everywhere. If x’, y’CF’, then 
clearly frr4y(¢) =fe-(t) +f, (4) for almost all ¢. Therefore, every ¢ outside of an 
appropriate union of exceptional sets gives rise to an element f(#) CE” satisfying 
x’ (f(t)) =f. (t). If EB’ is separable, then our bad set of é’s will be a countable union 
of exceptional sets, and therefore has measure zero. Clearly then, |[f(#|| <1 
almost everywhere, and 


"(r(0) — a) = fa'fO)u(ad, 


a 


which implies the desired identity y(b) —y(a) =? f(u(dé). 
Now we can write u(A) =.(A)+/,g(dt, where gEL1(0, 1) and y, is singu- 
lar with respect to Lebesgue measure. Therefore, 


(8) — (a) = f sdgoae+ f fOuae 
= [ ssa t+ ru((a, 5). 


Here X, is a vector-valued measure satisfying [|e (A)]| <Sps(A).) So 


ate 
m af f(t)g(tdt 


lim e}(y(a + €) — y(a)) = li 
(*) <0 <0 
+ lim «),((a, a + e]). 
e—0 


The first limit on the right-hand side of (*) exists for almost every t€ [0, 1], 
since Lebesgue’s theorem on differentiation of the integral holds for vector- 
valued functions (see Dunford-Schwartz p. 217 for the proof, which just copies 
the proof for the scalar case). The final term in (*) is zero for almost all ¢, since 
ea ((a, a+e])]|| <e'p,.((a, a+e])—0 as e0. 

Thus, equation (*) shows that y has a tangent vector for almost every /, 
provided only that the dual space E’ is separable. 

Finally, suppose that y takes its values in any reflexive Banach space E. 
To show that y has tangents, we may restrict our attention to the subspace 
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EyCE#, the closed linear span of the vectors {y(t):0StS1 t FE) is a separable 
reflexive Banach space, which implies that Ey’ is separable. 


A Combinatorial Iden‘ity 


5663 [1969, 309]. Proposed by D. Z. Djokovic, University of Waterloo, On- 
tario 


Show that 


1 1 /n— 1 
© spr cad ae 1) 


where the summation is over all nonnegative integers pi, po, ---, such that 


Solution by Henry Ricardo, Yeshwa University and Manhattan College. This 
result is part of the more general problem E 700 [1946, 340]. A solution, essen- 
tially that given for E 700, follows: In the formal identity 


(@ + wt? es 4 am)h = oF — a)FL — x) 


© /k j— 1 
ad = a) ( vd Ja 


j=0 J 


we equate the coefficients of x” to get 


k} n—i 
2. pilpol +++ pa! 7 ( — .) 


which is the desired result upon dividing by k&!. Here we have used the multi- 
nomial theorem where the summation and the p; have the same meaning as in 
our problem. 


Also solved by H. D. Abramson (Scotland), Anders Bager (Denmark), M. T. L. Bizley (En- 
gland), Michel Bousquet, Robert Breusch, Paul Brock, L. Carlitz, C. A. Church, Jr., S. C. Currier, 
Jr., A. E. Fekete, N. J. Fine, Roberto Frucht (Chile), H. W. Gould, M. G. Greening (Australia), 
D. A. Hejhal, M. Hirsello, J. E. Kilpatrick, M.S. Klamkin, Harry Lass, C. P. Lawes, Douglas 
Lind, O. P. Lossers (Netherlands), Andrzej Makowski (Poland), Marvin Marcus, S. E. Payne, 
Henri Pétard, Jernej Polajnar (Yugoslavia), D. P. Roselle, Barry Simon, James Singer, T. Tamura 
(Japan), E. W. Trost (Switzerland), D. A. Zave, David Zeitlin, and P. J. Zwier. 

Note. This problem has appeared earlier. For other solutions and earlier references see Siam 
Review, Vol. 9 (1967) p. 252. 


The Inverse Cube Law for a Planet 
5664 [1969, 310]. Proposed by Harry Pollard, Purdue University 


Suppose a particle is attracted to a fixed center O by a force proportional to 
the inverse cube of its distance 7 from O. Without determining the possible 
orbits, show that one of these three things must occur: 

(a) the particle moves in a circle; 
(b) the particle collides with O in a finite time; 
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(c) r--> © ast, 
Conclude that the inverse cube law is a poor substitute for the inverse 
square law in designing a solar system. 


I. Solution by M. L. Laplaza, University of Puerto Rico. It is well known that 
we have motion in a plane when the particle is subject to a central force. The 
hypothesis becomes 


— 
Cemnmenaeed 
—— 


de” (at ty)? d 
Suppose that x?-+-y?0; then 


d | /dx\? dy\? d 1 
ala) * (a) J > ae (-ep) 
dt L\dt dt dt x + 4? 


(=) 4 (2) 4 k “{ dx 4 >) ; 
— —)=h— —{ 4 — —)= hk. 
dt di pty ava od 


From this, there follow 


d*x _ x d*y 9 
2 (x2 ++ y2)2 


and so, 


FD Lith, (at + 9%) = hit +24 
x ——— —_ = ) ~ OM = ) 
a a an ; ; 


oe + vy? = kit? + QRot + Re. 


If limy.(x?-+y?) ¥ ©, then kj =ke=0, and x?+-y? = ks. If ky or ko ¥0, x?-+y? may 
become zero for some value of t, but eventually goes to infinity. 


II. Solution by Robert Israel, University of Chicago. It is well known that for 
central-force problems the particle motion is in a plane, that the angular mo- 
mentum A =mvor is constant, and that 


2 2 
ar Vo F 
at’ r m 
Substitution of F= —k/r® leads to the equation 
d*y <A*— km 
dt? mr 
Thus if A*=km, the radial component of velocity is constant, so the particle 
either reaches the origin in a finite time, stays a constant distance from the 
center (travelling in a circular orbit since A is constant and nonzero), or recedes 
from the center at a constant rate. If A?>km, then r->© as t-o because 
conservation of energy prevents the particle from colliding with the origin and 


because the assumption that 7 is bounded by some R gives a lower limit of 
(A2—km)/m?R® for d’r/di?, which implies that the particle must attain r>kR 
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in a finite time. If A?<km, the particle may encounter the origin in a finite 
time or go infinitely far away as t— ©, because if the motion is bounded by some 
R we have a lower limit of (km — A?)/m?R? for (—d?r/dt?), so that in a finite time 
the radial velocity must be negative, after which in a finite time the particle 
must encounter the origin. 

The inverse-cube law is a poor choice for a solar system because the circular 
orbits are not stable; almost any slight perturbation will send a planet on a path 
leading either into the sun or out of the system. 


Also solved by L. E. Clarke (England), D. H. Gaeddert, D. A. Hejhal, O. P. Lossers (Nether- 
lands), Henrik Meyer (Denmark), H. Pétard, Jr., and J. S. Shipman. 

Clarke furnishes the reference to L. A. Pars, Introduction to Dynamics (Cambridge, 1953), p. 
254 for the result: A circular orbit described under an inverse mth power law of attraction is stable 
if and only if n<3. 


Fixed Points in a Limited Contraction 
5672 [1969, 565]. Proposed by Stephen Weingram, Purdue University 


A continuous map f:R*—R” is a contraction outside the set K if there is a 
constant c(0<c<1) such that for any two points x, y outside K, | f(x) —f(y)| 
<c|x—y|. Prove that if f:R"—R* is a contraction outside a compact set K, 
it has a fixed point. 


Solution by Michael Menn, Boston College. Assume (without loss of general- 
ity) that O¢ K. Let M=max {lub|f| /K, | £(0) | /(1—c)} and let B be the closed 
ball of radius M about the origin. Note that cM@+ | f(0)| <M. The result will 
follow from the Brouwer fixed point theorem if it can be shown that f maps B 
into B, But if «EK then f(x)GB, and if xEB—K then |f(x)| S| f(«)—f(0)| 
+|f(0)| Sc|x| +|f)| ScM@+|f0)| SM. 

Also solved by D. T. Adams, D. R. Anderson & V. M. Sehgal, Einar Andresen (Norway), 
David Boyd, Theodore Chang, E. P. Del Norte, D. A. Herrero, R. J. Loy, Joel Spencer & Emmett 
Keeler, Jean Spitzer, R. A. Struble, J. H. van Lint (Netherlands), D. A. Zave, Martin Ziegler 
(Germany), and the proposer. 

Boyd proves the result using the condition on f that |f(x)| S$A-+c|x|, A20, 0<c<1 outside 


a bounded set K. Loy notes the necessity of the boundedness condition on K and that K may or 
may not contain any of the fixed points. However, if f(K) CK, then all of the fixed points arein K. 


Product of Algebraic Numbers 


5674 [1969, 565]. Proposed by L. Carlitz, Duke University. 


It is proved in problem 5542 [1968, 1021] that the following statement is in- 
correct: If a, b, are algebraic over F of degree m and nu respectively and if m and 
n are relatively prime, then ad is algebraic over F of degree mn. 

Show that the statement is correct when F=GF(q), gq=", p prime, n21. 


Solution by H. F. Matison, Jr., Sylvania Applied Research Laboratory, 
Waltham, Mass. We use the fact that greatest common divisors satisfy 


(g—1,qg¥—-1) = ge” —-1 
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which one proves easily by looking at the fields of the indicated sizes. To say 
that a has degree m means that a is in GF(g”) but not in any proper subfield. 
This means that a is a root of unity of order a, where a divides g”™—1; but a 
does not divide g4—1 for any proper divisor d of m. Similarly for b of order 6 
and degree n. 

Now ab is a root of 1 of order I.c.m.(a, 8). It has degree D for some D divid- 
ing mn. Thus, both a and 6 divide g?—1. Thus, @ divides (qg?—1, g™—1) 
= g(D.m)—1, which implies m divides D. Similarly, 2 divides D. Since (m, n) =1, 
D=mn as required. 

Also solved by Anders Bager (Denmark), Douglas Lind, R. A. Moore, J. H. Smith, T. Tamura 


(Japan), and the proposer. 
Bager notes that the two n’s appearing in the problem are not to be taken as equal. 
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Baste Statistics. By David Blackwell. McGraw-Hill, New York, 1969. v+143 pp. 
$5.50. (Telegraphic Review, October 1969.) 


This book is truly a new approach for a course in statistics to satisfy a gen- 
eral requirement in mathematics for undergraduates. It is one text in which the 
author accomplishes what he says he will in the preface. 

Used in the classroom during the summer of 1969, it was delightful for both 
instructor and students. One is first attracted by the cover, the small size of the 
book, the largeness and clearness of the print, and the eye-catching diagrams. 
One is next impressed with the substance and depth of the subject matter, 
covering the usual topics, but covering them in a logical, well-organized, unique 
and intriguing manner. The material of one chapter is subtly structured to lead 
into that of the next chapter. The instructor finds himself unable to change the 
order of presentation, but not wanting to make a change. Problems at the end 
of each chapter also contribute to the smooth movement to the next chapter 
and include ones that challenge the more capable student to investigate the 
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WHAT IS A SHEAF? 


J. ARTHUR SEEBACH, Jr., LINDA A. SEEBACH, ann LYNN A. STEEN, 
St. Olaf College 


Ever since Jean Leray and Henri Cartan in 1950 formally introduced the 
concept of a sheaf, the various examples and applications of sheaves have come 
to play a major role in such diverse fields as several complex variables, algebraic 
geometry, and differential and algebraic topology. Yet nearly all monographs 
which use or introduce sheaves assume the sophistication of graduate level 
algebraic topology. So it is very difficult for an undergraduate to acquire from 
the available literature a real understanding of sheaves and their applications. 
It is the purpose of this article to introduce the theory of sheaves at an ele- 
mentary level with the hope that the interested reader will then be able to 
approach any of the standard treatises (e.g., [2], [3], [6], [10], or [11]) with 
significant insight. 

Our avenue of approach to the theory of sheaves will be through examples 
drawn from three major areas of mathematics: from analysis, the sheaf of germs 
of holomorphic functions; from algebra, the sheaf of local rings; and from geome- 
try, the sheaf of differential forms. We will develop each of these particular 
sheaves in considerable detail, for the different perspectives thus revealed will 
more readily make transparent the subsequent discussion of the general theory 
of sheaves. 


1. The sheaf of germs of holomorphic functions. A holomorphic (or analytic) 
function on an open subset D of complex n-space C* is defined to be a complex 
valued function on D which has a local power series representation at each 
point of D. Osgood’s lemma [6, p. 2] asserts that a continuous function on 
DCC* is holomorphic if and only if it is holomorphic in each variable separately. 

A very important property of holomorphic functions is that they are 
uniquely determined by their behavior on open sets: if f and g are holomorphic 
on a domain D (a domain is a connected open set), and if f equals g on a non- 
empty open subset of D, then f equals g on all of D. To see this, we need only 
observe that the largest open subset of D on which f=g is also closed (relative 
to D), since the partial derivatives which determine the power series expansion 
are continuous. Since D is connected, this set must be D. 

Now if z€C*, we say that f is holomorphic at z if it is holomorphic on some 
neighborhood of g. The collection A, of functions holomorphic at z forms an 
algebra over the field of complex numbers in which the operations of sum and 


Professor Steen received his Ph.D. in 1965 under Kenneth Hoffman at M.I.T.; Professor See- 
bach received his Ph.D. in 1968 under A. I. Weinzweig at Northwestern; his wife Linda Seebach 
is a Ph.D. candidate under Daniel Zelinsky, also at Northwestern. Professors Steen and Seebach 
are the authors of the recently published Counterexamples in Topology. 

The paper printed here was derived from a series of lectures presented by the authors at a 
weekly joint colloquium of the mathematics staffs of St. Olaf and Carleton Colleges, both located 
in Northfield, Minnesota. Editor. 
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product involve intersection of domain: if f: U->C and g: V->C, then f+g: 
U(\V->C and fg: U(\V->C. We let I, be the ideal in A, consisting of those 
functions in A, which vanish identically on some neighborhood of z. 

The algebra of germs of holomorphic functions at z is then defined to be the 
quotient ring (algebra) A,/J,, and is denoted by O,. So a germ of a holomorphic 
function is an element f+J, of O., where f is holomorphic at z. We will usually 
denote this germ by [f],. Following the usual practice, we shall often identify, or 
fail to distinguish between, two functions which belong to the same germ. This 
sloppiness is somewhat justified by the uniqueness property stated above, for 
two functions which belong to the same germ and are defined on the same do- 
main D must differ by a function in J,, which means that they agree on some 
neighborhood of z, and thus must agree on D. 

We may now define the stalk space (espace étalé) of germs of holomorphic 
functions to be the set S= {(z,[f].)|f is holomorphic at z€C*} together with 
the natural mapping p from S to C* defined by p/((z, [f],)) =z. We call p-1(z) the 
stalk at the point zCGC*; it is simply a copy of O,, the algebra of germs of holo- 
morphic functions at z. The stalk space S is thus the disjoint union of the stalks. 
Intuitively, we shall picture S as a space of interpenetrating sheets lying over 
C», with p projecting S onto C* (Fig. 1). 


(3, Lf] 2) 
ge 


Fic. 1 


To make this intuitive picture more precise, we lift the topology of C» back 
to S, to make S into a topological space. For each open set U in C* and each 
function f which is holomorphic on U, we define V(f, U) = { (z, [f |.) | zEU , Each 
such V(f, U) is contained in S, and the collection of all such sets covers S, for 
if (z, [fol.)€S, fo must be holomorphic on some neighborhood Up of z and(z, 
[fo].) EV (fo, Uo). Furthermore, V(f1, Us) V (fo, U2) = V(f, U) where U= {2€U, 
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VU,| [fiJe= [f2].} and f=f,|v=f2|v. Thus the sets V(f, U) form a basis for a 
topology on S, and relative to this topology, the projection p is a local homeo- 
morphism. That is, for each basis neighborhood V(f, U) in S, the one-to-one map 
plvu.v) is a homeomorphism onto U. For if we let p;,y denote plvy.uys and if NV 
is an open subset of U, then p;y(N)=V(f, N) which is open in S, while if 
Vif, UD CVG, U), then py,u(VU, U’)) = U’. The topology on Sis uniquely deter- 
mined by the requirement that the projection p be a local homeomorphism. 

This topological space S, together with the local homeomorphism p which 
projects S onto C*, is called the sheaf of germs of holomorphic functions over the 
base space C*. As the agricultural terminology implies, we think of the sheaf as 
a bundle of stalks (Fig. 2), each with a full head of germs (or, if you wish, seeds, 
or grain). 


2 
~ ——nas —————— ——— vu 
a a ree UI 
_s 
== = = Sn ene oe vee 
—=—— SE 
Pw et 


Fic. 2 


We can show that the stalk space S is Hausdorff as follows: points of S may 
differ either because they are on different stalks, or because they are on different 
levels of the same stalk. In the first case, the projections of two points p, gES 
differ in C*; so, since C= is Hausdorff, there are disjoint neighborhoods of p(p) 
and p(q) which may be lifted back to S. To be specific, if p = (z, [f],) and ¢=(w, 
[g |.) where zw, then there exist disjoint open neighborhoods U,, Uw of z and 
w respectively and on them holomorphic functions fE [f], and g€ [g].., respec- 
tively, so that V(f, U,) and V(g, U.) are disjoint neighborhoods of » and gq. 

The second case is a bit more complex, since it depends on the uniqueness 
property of holomorphic functions. If p= (z, [f].) and g=(z, [g].) are different 
points on the same stalk, then [f],~[g],; so there must exist different holo- 
morphic functions f€ [f], and g€ [g], which are both defined on some neighbor- 
hood U of z. We claim that V(f, U) and V(g, U) are then disjoint neighbor- 
hoods of p and q, for if (w, [h].) EV(f, V)OV(g, U), then we U and [f]..= [A] 
= [|g]... But as we observed above, the uniqueness property of holomorphic 
functions implies that the two functions f and g with the same domain U which 
belong to the same germ [|], must be identical on U. But they are not identical 
on U, since [f],~[g].. So VG, WAV(g, U)=M, and thus S is Hausdorff. 
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There is still another consequence of the uniqueness property of holomorphic 
functions that can be used to further illuminate the sheaf of germs of holo- 
morphic functions. The uniqueness property may be roughly interpreted as 
saying that the global behavior of a holomorphic function is uniquely deter- 
mined by its behavior on any open set. This makes meaningful the vague ques- 
tion of identifying the largest domain to which a given holomorphic function 
can be extended. In the classical study of analytic functions this question led 
to the concept of a Riemann surface, or more generally to complex analytic 
manifolds. 


Fic. 3 


A manifold is, essentially, a topological space which is locally homeomorphic 
to complex Euclidean n-space C*. To be more precise, we will call a topological 
space X locally Euclidean (of dimension n) if every xCX is contained in an open 
set U, which is homeomorphic under a mapping 7, to some subset of C*, where, 
furthermore, the coordinate patches U, are coherent in the sense that for each 
x, yCX, 7,07, 1 is a homeomorphism between 7,(U;,0\ U,) and 7,(U,(\ U,). The 
first half of this definition guarantees that X is locally like C*, while the second 
condition requires that the locally Euclidean patches overlap so as to form a 
coherent Euclidean structure on all of X. We shall call each pair (Uz, Tz) a local 
coordinate system, since wz? lifts the coordinate system of C™ back to U (Fig. 3). 

Since the same topological space may be covered by several different col- 
lections of coordinate systems (U;, 7,), and since we do not wish to distinguish 
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between two covers which provide essentially the same coordinate structure on 
X, we define a mantfold to be a locally Euclidean topological space in which the 
collection of coordinate systems (U;, 7,) is maximal with respect to the defining 
properties for a locally Euclidean space. Since each locally Euclidean space 
generates a unique manifold, we shall often refer to locally Euclidean spaces as 
manifolds even if the collection of local coordinate systems is not maximal. Other 
types of manifolds may be produced by projecting to real Euclidean space R* 
instead of to C* or by requiring that the homeomorphisms 7, 0 7,! be analytic 
or C® (infinitely differentiable); such manifolds are naturally called analyitc 
mantfolds or C* mantfolds. 

A function f:X—Y from one analytic manifold’to another is called holo- 
morphic if for each x and y, ty of oz! is holomorphic on its domain, which is 
w2(U.(\f-!(U,)). In the special case, where Y=C', the identity map 7: YC! 
is used to define the local coordinate systems. So a holomorphic function f from 
the analytic manifold X to C! is characterized by the property that fom;? is 
holomorphic on 7,( Uz). 

It should be clear from this description that the sheaf of germs of holo- 
morphic functions can be regarded as an analytic manifold, using the projection 
p to define the local coordinate systems. It is a particularly important manifold, 
since on it we can define what is known as the universal holomorphic function. 
This is the mapping F:S—-C defined by F((z, [f].)) =f(z). F is clearly holo- 
morphic since for each local coordinate system (U, ply), we have Fo (o| uv) 
=f'| ou) where f:p(U)—>C is holomorphic. 


Ss 


hd 


D. $$ (" 
f 
Fic. 4 


F is universal in the sense that the behavior of all holomorphic functions on 
C is subsumed in that of F. In particular, whenever {:D-—>C is holomorphic 
(where D is a domain in C”), we can factor f through the sheaf S as follows: 
there exists a unique function /: D—S such that Fof=f. Clearly f is defined by 
f(z) = (zg, [f le), so the associated diagram (Fig. 4) is commutative; f is continuous 
since f-1(V(f, U)) = U. 

With this structure, we can now describe the domain of holomorphy of a 
given holomorphic function f—that is, the largest domain to which f can be 
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uniquely extended. It is the connected component E of S which contains f(D). 
(Of course f(D) is connected since it is the continuous image of a connected set.) 
Although D is not literally a subset of E, it is imbedded by f in E, and thus the 
universal holomorphic function F is the extension to E of the function f. 


2. The sheaf of local rings. Let A be a commutative ring with 1, and Sa 
multiplicatively closed nonempty subset of A with O€&.S. We construct from A 
and Sa ring Asx called a ring of quotients of A, in which the elements of S have 
multiplicative inverses. On the set 


AXS= {(2,s)}@E A,s ES} 


we define an equivalence relation (a, s)~(0, ¢) if and only if there exists rG.S 
such that (at—bs)r =0. We also define two operations, 


(a, s) + (6, 2) = (at + bs, st) and (a, s)(d, t) = (a8, st), 


which are compatible with the relation. We denote by Ag the ring of equivalence 
classes with the induced operations. As in the ring of integers with the set of 
nonzero elements as S, the equivalence class of (a, s) is denoted by a/s; thus we 
call S the set of denominators. 

The 0 of Ag is 0/s (any s in S), the identity is s/s; and if s©S, s~!=1/s. 
There is a homomorphism a@:A—Azg defined by a(a) =as/s, which is indepen- 
dent of the choice of s. Of course if A is an integral domain, a is one-to-one be- 
cause Ker a= {a|sa=0 for some sCS}. 

If I is an ideal of A, the ideal a(J) in Ag can be represented by 


a(I) = {a/sja EI,s ES}, 


and we shall write JAg for a(Z). This function a on the set of ideals of A defines 
a one-to-one correspondence between the set of prime ideals in Ag and the set of 
prime ideals in A whose intersection with S is empty [12, p. 223]. 

Since we may describe a prime ideal P in A as one whose complement is 
multiplicatively closed, we may form the ring of quotients of A whose set of 
denominators is the complement of P. We shall denote this ring of quotients by 
Ap, and call it the local ring of A at P. The ring Ap has only one maximal ideal, 
PAp, since clearly P is the largest prime ideal of A with the property that its 
intersection with the complement of P is empty. 

These local rings will be the stalks for the sheaf of local rings and the set of 
prime ideals of A will form the base space. This space is called the spectrum of 
A, denoted by Spec A, and is topologized by taking as a basis for the topology 
all sets V.= {PCSpec A|x€P} where xGA. Then Vi=Spec A, Vo=@, and 
Vit\Vy= Vy; thus {V.}sea is a basis. Since Use V2 = {P| (x)zemC P} where 
(<)seu is the ideal generated by the subset M of A, any ideal f of A defines an 
open set V;= | P| ICP}, and every open set U is of this form although I is not 
uniquely determined by U. A closed set, then, is a set of primes containing some 
fixed ideal, so that a point P in Spec A is closed if and only if it is a maximal 
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ideal. For most rings, therefore, the base space is not even 71; however, Spec A 
is always T». 

It is possible to define a ring of quotients with respect to the complement of 
a prime P because it is multiplicatively closed, but the complement of a union 
of primes is also multiplicatively closed. Hence, with each nonempty open set 
U in Spec A we may associate the ring of quotients Ay = {a/s|PE U=sEP} 
whose set of denominators is the complement of the union of all the primes in 
U. If Uand V are open sets in Spec A such that UC V, we may define restriction 
homomorphisms puv:Avy—Avu as follows: if a/sC Ay, s is not an element of any 
prime ideal P in V, and so a fortiorz not an element of any prime ideal P in U. 
Hence a/s is also an element of Ay. We define py,y(a/s) =a/s, but this map is 
not the identity, or even one-to-one, since the equivalence classes which are 
used to define the ring Ay are larger than those used to define Ay. The kernel of 
pu,v consists of those elements a/s such that a is a zero divisor with respect to an 
element in one of the prime ideals of V which is not in any element of U. 

An important property of pu,y is the commutativity of the diagram in Fig. 
5, where ay:A—Avy takes a to a/1. Now pu,y is uniquely determined by this 
property and from this it follows that pu,v is the identity map and that if 
UCVCW, then pu,w=pu,v © pv,w. This system, consisting of Spec A, the rings 
Ay, and maps puv,y:Avy—Av when UCY, is called a presheaf over Spec A. 
Besides the maps pu,y corresponding to pairs of open sets for which UC V, we 


Ay y 
4 
Ay ey 
A Pu A Ap.v 
I, ee 
Fic. 5 Fic. 6 


can define maps pp,y:4y—Ap when PCV. If V is open and PCV, we define 
ppv: Ay—Ap, by pp,y(a/s) =a/s, which is possible since a/s€ Ay implies s€P. 
The map pp,y is the unique map which makes the associated diagram commute 
(Fig. 6). As before, it follows from the uniqueness of pp,y that 


pPP,wOpw,u = pP,U f#PEWC U. 


If U is open in Spec A, and u is any element of Av, we may treat u asa 
function from U to the stalk space S=\{Ap|PCGSpec A} by defining u(P) 
=ppu(u)CAp for PE U. If VCU, we call py vu) the restriction of u to V. If 
uCAy and vC Ay, and if there is an open set WCUNMV such that pyw,v(u) 
=pw,v(u), we say u and v agree on the open set W, for if PE W, then 


u(P) = pp,u(u) = pp,w Oo pw,u(u) = pp,w O pw, v(v) = pp,v(v) = v(P). 
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If uG Av, either the function u or its image u(U) in S is called a section of the 
presheaf. The set of sections u(U) covers S, for if a/s@ Ap then s is a denomina- 
tor in Ay, where V= {Q| s€Q}, and a/s€ Ay is a section over V whose image 
at P is a/s. 

The collection of all these sections u(U) is a basis for a topology on S, since 
the intersection of two sections is a union of sections. For, suppose u@ Ay 
and v@ Ay are sections over the open sets U and V. If u(U)C\w(V) = @ there is 
nothing to show. If xG@u(U)O\w(V), x =pp,v(v) =pp,y(v) E Ap for some PE UNV. 
Then u=a/s, where s is not an element of any element of U, and v=a’/s’, where 
s’ is not an element of any element of V. Since a/s=a’/s’ in Ap, there exists 
t€:P such that t(as’—a’s)=0. Let W={Q|t€Q}. Then a/s=a’/s’ in Awavav 
and, since the diagram in Fig. 7 commutes, the section over W(\U\ V defined 
by a/s=a’'/s’ is a subset of u(U) and of v(V) and it is a neighborhood of x. 


Ay 


Awnunv Ap 


Fic, 7 


In this topology the projection p is a local homeomorphism, for if xC Ap 
there is a section v over V through x for some V, and the restriction of p to the 
section v(V) is one-to-one and onto. If UC V is open and «CU, the restriction 
of v to U is also a section through x, and thus u(U), where u =pv,y(v), is open in 
u(V);so p is continuous. If N is an open subset of v(V) it is a union of sections 
over open subsets of V and p(JV) is the union of these open subsets. 

The topological space S, together with the projection p: S—Spec A, is called 
the sheaf of local rings over Spec A. If U is an open subset of Spec A, any con- 
tinuous function f: US such that p of is the identity on U is called a sectzon 
of the sheaf S; the set of all sections over the open set U is denoted by T'(U, S). 
The relation between the sections of the sheaf S and the sections of the presheaf 
(that is, the elements u@ Az) is rather subtle, for even though each presheaf 
section may be thought of as a (continuous) function on U which is a local in- 
verse for p, two anomalies may occur. It may be that two different elements 
u=a/s and v=0b/t of the ring Ay yield the same function under the interpreta- 
tion outlined above, or there may be functions in T'(U, S) which cannot be de- 
rived from any section u€@ Ay. Thus the interpretation map from Av toT(U, S) 
need not be either one-to-one or onto, though if U isa basis set (that is, one of 
the form V, for some x€A) this map is an isomorphism [5, No. 4, p. 86]. For 
this reason, the open sets of the form V, are often called distinguished open sets. 

If the base space Spec A is not Hausdorff, certainly the sheaf S cannot be 
Hausdorff. But even if Spec A is T, it may well happen (although it is difficult 
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to visualize it on Hausdorff paper) that two distinct sections over U, uand wv’, 
may agree on a proper open subset V of U (Fig. 8). If P isin the closure of V 
but not in V, u(P) and u’(P) are distinct points of the sheaf but cannot be 
separated because every neighborhood of P contains points of V on which u 
and wu’ agree. 


Fic. 8 


A special case. In order to provide a more geometric interpretation of the 
sheaf of local rings as well as a glimpse of the origin of the subject, we shall look 
at some special rings. Let Q be the field of rational numbers, O[m, - ++, xn| 
the ring of polynomials in variables with rational coefficients, and J an ideal 
of Olx:, --+,xn]. Let A be the quotient ring Ola, -- + ,%n]/I, which we may 
write QO[#, +--+, #,], where #=x,+J,i=1, ---,n If E= { (x1, 5 Xn) 
EC p(X%1, °° * Xn) =0 for all pel}, i.e., H is the intersection of the zero sets 
of all polynomials in the ideal J, we may interpret A asa ring of complex valued 
functions on E. For since p1(x) — p2(x) EI, whenever 1(#) = p2(%), p1(£) = po(%) 
implies pi@v) = p2(x) if e=(1, +--+, X%n)CE. 

We may also construct, from the pairing which takes pEA and xEE to 
p(x) EC, a one-to-one correspondence between the points in E and homomor- 
phisms from A to C. If x=(%1, +--+, %n,)CE the function which takes p to 
p(x) isa homomorphism from A to C, and conversely, if 6: AC is a homomor- 
phism, «= (¢(%1), -- +, 6(#n)) ECE, since if pEl, 


p(x) = p(b(&:), - + > , O(@n)) = (pC), + + +, P(En)) = (0) = 0. 
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Since C is an integral domain, the kernel of the homomorphism defined by 
x CE is a prime ideal in A. Conversely, it may be shown that if P is an element 
of Spec A it is the kernel of some homomorphism from A to C [13, p. 164]. So 
we may regard Spec A as a set of equivalence classes of H under the relation 
x~x’' if and only if x and x’ are zeros of the same polynomials pC A. Thus if 
x~x', p(x) = p(x’) for all pin A, so A may also be interpreted as a ring of func- 
tions on Spec A. 

That an element ~ of A not bein a prime P means that p is not in the kernel 
of the map A—A/P, or that p is not zero on any point x in E for which the 
kernel of the corresponding homomorphism is contained in P. Thus, the local 
ring Ap at P, which contains the inverses for all p€¢P, is the ring of all rational 
functions defined locally, i.e., in some neighborhood of P in Spec A. The set of 
such functions which vanish at P is the only maximal ideal in this ring. The 
ring Ay of presheaf sections over an open set UCSpec A is a ring of functions 
defined at every point of U. (If U is a distinguished open set, Ay is the ring of 
all such functions.) The kernel of a restriction map pu,y:Ay—Av (where UC V) 
is the set of functions which vanish at every point of U, though they may not 
vanish on all of V. The elements of Ay which are not restrictions of functions 
in Ay are inverses of functions which have zerosin V—U. 

The topology of Spec A induces a topology in E, the weakest one in which 
the identification map from E to Spec A is continuous. In this topology, the 
closure of a point x CE is the set of all points in # which satisfy the same poly- 
nomials as x. 

We choose a particular ring A to study in more detail. In Q[x, y], let I 
= (xy), and 


A= Q[x, y|/ (xy) — Q(z, 5], 
where £j=xy-+ (xy) = (xy) =0. Then 
E = {(z,w) € Cj ew = 0} = {(2,0|zEc}U{O,w|weEc}, 


the union of the complex coordinate axes (planes) in C. 

The ideals (#) and (¥) in A are prime but not maximal, since Q[#, 9]/(#) 
~0[5|~Oly], which is not a field. The points (z, w) in E whose corresponding 
ideal is (#) are those of the form (0, w) where w is transcendental over Q, and 
similarly, the points corresponding to the ideal (¥) are of the form (2, 0) for 
transcendental z, since the homomorphism of Q[z, 5|-—>C given by 2-0, jw 
has kernel (#) if and only if w is transcendental. Further, (#)™(¥) =(0) in A, 
any prime ideal contains either (#) or (9), and the set of zero divisors in A is 
(Z)U (9). 

The maximal prime ideals in A are kernels of homomorphisms ¢: [2, 4] 
—C for which the image is a field, so that ¢(#) and $(4) must be algebraic, 
and since $(#)¢(7) =¢(#¥) =0, one of (#) and $(¥) must be zero, and the other 
algebraic. Such points (¢(@), $(¥)) in E are either of the form (z, 0) with 2 
algebraic, or (0, w) with w algebraic. The kernels of the homomorphisms to 
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(z, 0) and (0, w) are the ideals (p(%), ¥) and ((#), ¢g(9)) where p is the minimal 
polynomial of zg (and g of w). These ideals together with (#) and (#) are pre- 
cisely the points of Spec A. 

The points of ECC? are divided into equivalence classes corresponding to 
the ideals in Spec A: to each of (x) and (y) there corresponds a class of tran- 
scendental points, while to each maximal ideal (p(#), ¥) (where p is irreducible) 
there corresponds the set of algebraic points {(z, 0)} where z is a root of p. 

In the topology induced on E by Spec A, the closure of a transcendental 
point (g, 0) is the z-axis and the closure of an algebraic point (z, 0) is all (z’, 0) 
where 2’ is also a root of the minimal polynomial of z. A basis for the open sets 
in Spec A is the collection of all sets of the form V,= {PCSpec A| pEP}. In 
the special case p=, 


V,= | PESpec A|#e&P} = { (4, b(#))| pEA is irreducible or zero, pe}. 


The open set VgC Spec A corresponds in E to the complement of the w-axis. 
Similarly, V, may be regarded as the complement of the z-axis. The complement 
of a finite set of algebraic points corresponding to the maximal ideals 


(z, pi(9)); mr sy (%, br(5)), (prsi(%), 9); sey (palZ), 4) 


is Vea «+ * (\Vp,@- Since every polynomial pC A = Q[2, 9] can be written as 
p=phi(x) +p2(y) —c, where p1(0) = p2(0) = p(0, 0) =c, the points in V, are (z, 0) 
where ?;(z) =0, and (0, w) where p2(w) =0. 

Geometrically, the ring A =Q[#, 9] is the ring of polynomials defined every- 
where on £, the union of the complex axes. At a point (g, w) in E (where either 
z or w is 0) corresponding to a prime ideal P, the local ring Ap contains all 
rational functions whose denominators do not vanish at (z, w). The local ring 
Ag is the local ring at any (0, w) where w is transcendental. Algebraically, we 
have the set of denominators S=A-—(é#)= { p| p2(¥) 0} where, as above, 
p=pitp.—c. The kernel I of the homomorphism a:A—Ag is {gqEA|apES 
such that pqg=0 i . If p1-+p2—c is a zero divisor, c=0; if such a point is also in S, 
we have p2(¥)+#0 and #2(0)=0, so it is an element of (j)— fo}. Therefore 
I=(#). Thus 4/I=Q[#, 7]/(#) =Q[y] and the image of S in Q[¥] is O[F] 
— {0}; so Ag is isomorphic to the field of rational functions in y. Similarly 

For the local ring at a maximal prime other than (#, 9), for instance P= (4, 
7? —2), a similar argument will lead to the ring Ap= {p/qEQ(y)| (y?—2) does 
not divide g}. That is, Ap contains inverses for all functions that do not vanish 
at (0, 2), while two polynomials in @ and 9 which agree on an open set con- 
taining (0, »/2) are identified in Ap. Thus the map a: A—Ap? is neither one-to- 
one nor onto. Now if P=(&, 9), the set of denominators for Ap is exactly the 
set of polynomials :+2—c for c#0. So in this case the map a: A—A> is an 
inclusion which is not onto. 

Finally, we describe the rings of presheaf sections Av, for open sets U. If U 
is the complement of a finite closed set of maximal primes (4%, p:(¥)), ---, 
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(&, De(D)), (beri(%), 9), «++, (bn(%), FH) the set of denominators for Ay is 
{ pEA| pEUrcuP}, which is the set of all products p71 - - - p%* with nonzero 
constant term. The map from Ay—Ap when PECU will have kernel (%) if 
P=(&, p(¥)) (where p is any irreducible polynomial except 4); (9) if P=(§) or 
(9, p(£)) (where p(x) ¥x); and (0) if P= (4, 9). 

If U=V;z is the complement of the w-axis, the set of denominators for Av 
is the set of polynomials 7p() +c#*, for pC OF] and n=0. Thus 


Ay = {p(x)/«| p € Qlx]}. 


Since A; ~Q(x), the map piy),v: Av->A from the ring of sections to the stalk 
at (9) is one-to-one but not onto. For a maximal prime (¥, p) € Vz, where p(x) ¥x, 
P(9.»), v: Av—>Ag,») is an inclusion since x does not divide p. If U is the comple- 
ment of the closed set (%, pi(¥)), +--+, (&, pe(¥)), then VzCU, and the re- 
striction map pyz,v: Av—Avg has kernel (4). 

The ring of sections over Spec A is A, for every element of A vanishes at 
some point in Spec A. Thus for every kind of open set U in Spec A, the ele- 
ments of the rings Ay are the rational functions defined at every point of U. 

In particular the functions # and #-++¥ are sections on the open set Spec A. 
They agree on the proper open subset Vz, for (¥) is the kernel of the restriction 
MAD Pvz, Spec A, but do not agree on (#, ¥), which is in the closure of Vz. Thus the 
points and +4 in Ags, though distinct, cannot be separated by sections 
since any open set containing (#, 7) must intersect Vz. Thus this sheaf fails to 
be Hausdorff both vertically as well as horizontally. 


3. The sheaf of differential forms. Let bCX where X is a C* manifold over 
R«, We denote by C; the set of all functions from X to the real line R! which are 
C” in some neighborhood of ». Clearly C; is a vector space over R in which the 
sum f-+g is defined on the intersection of the domains of f and g. 

A tangent to X at p is a linear function t: C; —R! such that 


(fg) = t(g)-g(p) + f(p)-t(g) whenever f, g € C;’. 


The set of all tangents to X at p forms a vector space over R which we call X,, 
the tangent space to X at p. If y:[0, 1]—X is a C® function such that y(éo) =p, 
and if fECg, then foy:[0, 1|>X—R'. If (foy)’ is the derivative of fovy, 
the function ys: C°—R! defined by y«(f) = (f o y)’(éo) is a tangent. To interpret 
this geometrically, we consider a particular local coordinate system (Up, 7p) where 


Ty: Up—>R". If 7;: R"—R! is the projection function defined by m,(41, ° ++, tn) 
=f; we write x;=m7,;0T7,; then 7,(p)=(m(p),°°-, xn(p))ER™. If e 
=(0,0,---,1,°°+, 0)CR*, we think of the curve y(t) =) ‘(1p(p) +ée:) as 


the zth coordinate axis in U, since (d/dt) (mr, °¥)| o=e; Then the tangent yx 
satisfies 


d 
v#(f) = foy)’() = a ome om) 
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-(~ OT, ") (fo 7) _ Of o 75’) 
Oly Oly Tp (p) df; Tp (p) ) 
Thus yx(f) is usually denoted by (0/0x;)f; the tangents (0/0x;), --+, (0/0xn) 


form a basis for the vector space X, [8, p. 7]. We think of X, as an n-dimen- 
sional hyperplane tangent to X at p (Fig. 9). If 


0 
Y* = > a:——> 
Ox; 


then y«(f) is a-grad f, that is, the derivative of f in the direction a=(q, ---, 
On) ER. 


Fic. 9 


If ¢:X—Y is a C* mapping of manifolds we define the differential of @ to be 
the linear map d@:X,—Yap) defined as follows: if t€X, and fECh) then 
[db (t) |(f) =t(f of). If (Up, Tp) is a local coordinate system at pECX then x; 
= 7; 0 7, defines a C*-map from the submanifold U, to the manifold R!. Hence 
the differential dx; of x; is a linear transformation from X, to Ri). Since Ry is 
isomorphic to R! for any :€ R!, we may consider dx; as an element of the dual 
vector space X, of Xp. Since dx; (0/0x;) =45;; (where 6,;=0 if 77, and 1 if =), 
the differentials dx;, «+--+, dX, form a basis of xX; dual to the basis 0/0x:, 

- ++, 0/OXy Of Xp. 

Whereas each differential dx; is a function of just one variable, a differential 
form in general is a function of several variables. To be precise, a differential k- 
form @ on X» is an alternating k-linear function from k-tuples of elements of X, 
to R!, (0 is k-linear if it is linear in each variable separately, and alternating if 
flti «+ + te) =sgnof(fea) + + + toy) where oc: {1---Rto{1---k} isa permu- 
tation and sgna is +1 if o is even, and —1 if 7 is odd.) We denote by the wedge 
product dx;, /\ +++ /\dxu,the unique k-linear form on X, defined on a basis for 
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the set of k-tuples by 


6) 6) 
dt N+ A dul oo )=9, 


X5y OXj, 


where 6=1 if (41, ---, t%)=(j1,--°~-, je) and 0 otherwise. The set of all k- 
linear forms on X, is an n* dimensional vector space over R and the n* forms 
dx:,/\- ++ /)\dx%, where i;€ {A, mee, ni, form a basis for this vector space. 
The set of differential k-forms is a subspace of the set of all k-linear forms and it 
has as a basis the k-linear forms dx;,/\ - + - Adxi, where 43<i2.< +--+ <a. We 
call such a sequence of indices an increasing k-tuple. Thus the dimension of the 
space of differential k-forms is) if RSnand Oif k>n. 

Let S, denote the set of all increasing k-tuples of positive integers less than 
or equal to n. For x CS; we denote by dx, the k-form dx;,/\ - + + Adx.,, where 
s=(t1, °° +, 4%). Now let UCX be an open set and @ a function which assigns to 
each pCU a differential k-form at p, 6(p). Thus, once having chosen a local 
coordinate system at p, so the forms dx1, dx2,-- +, dX%, are defined, we may 
write 


O(p) = Dy ae(p)dae, 


sES, 


where each a,(p) is the coefficient in the expansion of 0() in terms of the basis 
{ dxe} sesp Thus a, may be considered as a function from U to R!. Since each 
(p) is a differential k-form at p, we call 6 a C* k-form on U if each function a, is 
C*: we denote by (Q*(U) the real vector space of all C* k-forms on U. If UC V 
then there is a linear transformation py,y:Q“!(V)—Q*(U) defined by restriction 
of the domains of @€Q*(V). The collection {Q#(U) } for U open in X together 
with the linear transformations py y is called the presheaf of differential k-forms. 

Since every paracompact manifold has a C® partition of unity subordinate 
to any open covering { Ua} [8, p. 85], presheaves over such manifolds satisfy the 
following special property: If U=UU. where U, is open in X, and if 6,€EQ*(U,.) 
are coherent in the sense that the restrictions to U.f\Ug of 0. and 6, agree 
(whenever U.(\Ugs~#@) then there exists a unique 9€Q‘(U) whose restriction 
to each U, is @,. Certainly if { fat is a C® partition of unity for { U.} so that 
fa: U—R', f. vanishes off U,, and > fe=1; so we may define @ to be > faba: 

Any such presheaf is called a sheaf, so when X is paracompact, we will call 
the system {0*(U), puy} the sheaf of differential k-forms. If Q*(p) is the set of 
differential k-forms at p, we may think of Q*(~) as the stalks of the sheaf S: 
S=UnexQ'(p). The projection p: S—>X assigns to each differential form at p 
the point p. The C” k-form 9€Q*‘(U) is a section of S, and the collection 
{6(U) CS| Uis open in X } forms a basis for the topology on S. 


4. Sheaves: General definition. Each of the three previous examples re- 
flects a different facet of the general concept of a sheaf. To emphasize this 
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Sheaf of Germs Sheaf of 
of Holomorphic Sheaf of Local Rings Differential 
Functions Forms 
Germ (fl. a/sGAp 
Stalk Space S={(x, [f])} | S=U{Ap]|PESpec A} 
Projection p: SC" p: S—Spec A 
Stalk {(z, [f].)| fEAz} Ap(local ring) 
Base Space c* Spec A={P|P is prime | X=C*manifold 
ideal in A} 
Presheaf Section a/sGAvu ¢6€or(U) 
Restriction Homomorphism pu.v: Av Au pu.v: OF( Va (U ) 
Fic. 10 


variety and to provide a coherent framework for the subsequent general defini- 
tion, we summarize in Fig. 10 the three sheaves already discussed. 

The sheaf of germs of holomorphic functions was defined to be the stalk 
space S together with a topology and a local homeomorphism p onto C*. The 
sheaf of local rings was defined similarly, though in this case we also identified 
the system consisting of the rings Av and the restriction homomorphisms 
Puy: Ay—Ay as a presheaf of rings. In the third case, the sheaf of differential 
k-forms was simply the presheaf of differential k-forms whenever the base space 
X was paracompact. The recognition of the equivalence of these two descrip- 
tions constitutes the beginning of sheaf theory, We now introduce definitions 
to formalize these two approaches and prove the definitions equivalent. 


DEFINITION I, Let (X, 7) be a topological space, and let © be a class of simi- 
lar mathematical objects (e.g., abelian groups, modules, rings). Let F be a 
function from 7 to @ and suppose for each pair U, V€r for which UC V there 
is a map (e.g., a homomorphism, module homomorphism, or isomorphism) 
pu v: F(V)—F(V) which preserves the structure of the objects of C. If pu,v 0 py.w 
=pu,w whenever U_VCW, and if py,z is the identity function on F(U/), we de- 
fine the function F together with the restriction maps py,y to bea presheaf over X. 
The elements of F(U/) are called sections of F over U. (In the language of cate- 
gory theory, a presheaf is a contravariant functor from the category of open 
sets and inclusion maps of X to some category of objects and morphisms.) A 
sheaf is a presheaf which satisfies the following two coherence axioms: 


1. If {U.} is a family of open sets in X, if U= Ua, and if the section s, 
t& F(U) agree on each Ug (1.e., of pu,.u(s) =pu,,.u(t) for each a), then s =t. 


2. If {U.} is a family of open sets in X, if U=UU., and if the sections 
Sa F(U.) are coherent in the sense that the restrictions of sq and sg to U.f\Usz 
agree (1.€., tf pu,rUg,U,(Sa) =PUAUg0,(Sa) whenever U.\Ugs¥ ©) then there exists 
a section s© F(U) such that for each a, pu,,u(s) =Sa- 
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DEFINITION II. Let (X, 7) be a topological space, and let © be a class of 
similar mathematical objects (e.g., abelian groups, modules, rings). A sheaf over 
X is a triple {S, a, X}, where Sis a topological space and 7:S—>X is a local 
homeomorphism (i.e., a map such that each point (CS has a neighborhood U 
on which | vy is a homeomorphism) such that each stalk m—1(x) CE @, and each 
operation is continuous as a function from Usex(m— (x) Xm (x)) (with the 
topology induced from SX,S) to Usexm (x) =S. 


We shall call a sheaf of type I a sheaf of secttons, and a sheaf of type II a 
sheaf of germs. The relationship between these two types of sheaves is precisely 
as illustrated by the preceding examples. 

To be specific, suppose F is a presheaf (of sections) over W; we construct 
the corresponding sheaf of germs by defining a germ at xCX to be an equiva- 
lence class of Az=UseuvF(U) under the relation se F(U)~tE F(V) if pw.v(s) 
=pw y(t) for some WCUP\V. Thus the germ of a section s© F(U) at a point 
x€ U is the collection of all sections ¢€ F(V) which agree with s on some neigh- 
borhood V of x. We denote, as usual, the germ of s at x by [s],, and let the stalk 
space S be { (x, [s].)|sEF (U), where xCU i. The topology on S is generated 
by neighborhoods of the form V(s, U) = { (x, [s].|xE U i, so the projection 
w:S—»X becomes a local homeomorphism. By interpreting the sections as 
functions, the so-called restriction maps pv.y really are restrictions and the 
topology on S is the strongest relative to which the sections are continuous. 
(The topology on S can also be characterized as the quotient of the topology 
on Uver (UX F(U)) under the equivalence relation induced by ~, where each 
U carries the subspace topology and F(U) is discrete.) 

Each stalk a-1(x) clearly inherits the operations of the F(U) and each such 
operation is continuous. For example, if each F(U) is an abelian group under 
addition and if [s|, and [¢],€a—'(x), then [s].+ [t], is defined to be [s+¢],. If 
V(s+é, U) is a neighborhood of [st+é]., then the inverse image of V(s+#, U) 
under + contains {(r, g)|rE Vis, U), gEVE, UV), r(r) =1(q)}, an open set in 
U,ex(a— (x) Xa} (x)). Thus {S, T, x} is indeed a sheaf of germs. 

Conversely, suppose {S, w, X } is a sheaf of germs (perhaps one constructed 
as above from some presheaf). If U is open in X, let F(U) be the collection of 
continuous functions s: U->F such that 7 o s is the identity on U. F(U) inherits 
the algebraic structure from the stalks by pointwise definitions, and the re- 
striction maps pu,y are just that—the restriction of s from V to the subset U. 
The first coherence axiom for sheaves is satisfied trivially since the sections are 
functions, and the second is satisfied since the F(U) contain all continuous func- 
tions from U to F which are inverses of 7. 

Now if {F, 7, X} is a sheaf of germs, the sheaf derived from its (pre)sheaf 
of sections is canonically isomorphic to F. However, if S is a presheaf of sections, 
the presheaf of sections S’ associated with the sheaf of germs derived from S is 
generally different from S, for since S’ is a sheaf, it may have more sections than 
S (in order to satisfy the second coherence axiom), while some sections which 
were distinct in S may be identified in S’ (because of the first axiom). Of course, 
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if S is a sheaf, then S’ is naturally isomorphic to S, so in this sense the two defini- 
tions of a sheaf are essentially equivalent. 

A common and convenient alternative to the construction of equivalence 
classes is the use of direct limits. If F is a presheaf of sections over X, and if 
xGX, the restriction of F to the neighborhoods of x forms a directed system 
({ F(U) \ ev {pu.y } zeUcV) since Pu,v O pv ,w=Ppu.w, whenever KE UC VC W. 
A few elements of such a system may be represented by the commutative dia- 
gram of Fig. 11. 


Pu.wi 
ECO) eeeenence F( (7) 


PY, F(Vi) Puy, 


aoe 


—Be FW 2) _ FUwW yr 
F(V2) 7! 


PY .W, p Vin aVe 


Fic. 11 


The direct limit of this system is, roughly speaking, the first object which 
can appear to the right of the diagram. Specifically, an object F, together 
with maps pv: F(U)—F, for each F(U) is called a direct lamit of the system 
({F(U)}.ev, {ov.v}sevev) provided that 


(i) whenever UCV the diagram in Fig. 12 commutes, 

(ii) F#, is universal with respect to property (i)—that is, if (Gz, { a, t) also 
satisfies property (i), then there exists a unique map 7: /,—G, such that for 
each oy: F(U)—G,, ov =7 O pv. 


Py 
FV) -———_—>_ iF, 


De vA 


F(U) 
Fic. 12 


Condition (i) makes explicit the idea of “appearing on the right of the dia- 
gram” while condition (ii) asserts that F, is the first such object. It follows 
trivially from these conditions that the direct limit is unique (up to isomor- 
phism), so we denote it by ¥™,-yF(U). 

Consider now the stalk of germs at x derived from the presheaf F. If pu 
denotes the map from F(U) to the stalk m—1(x) defined by py(s) =(x, [s].) 
(where x€ U), then a—1(x) ="'™,¢yF(U) since whenever x GUC V, py =pu 0 pu,y 
and w~1(x) is universal with respect to that property. To prove this last asser- 
tion we assume that (G, {ov}) is another direct limit and observe that if both 
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tiE F(Ui) and #€F(U:2) are in the same germ [s],, then ov,(t1) =ov,(te), for by 
the definition of [s],, there exists some VC U,(\ Us such that py,v,(é1) =pyv,u,(te). 
Then by the commutativity of the diagram in Fig. 13, we have 


ou,(t1) = ov O py,u,(ti) = ov O py,u,() = ov,(t,). 


F(U) eG 


Thus each element |s |, of t-1(x) is mapped to some point of G by the appropri- 
ate gy o py’: we call this point n(x, [s].) and thereby define the required unique 
map from 2—1(x) to G. 

So we may summarize all three approaches in one sweeping generalization: 
for each xCX, the stalk over x of the sheaf of germs is the direct limit of the 
restriction of the presheaf to the neighborhoods of x. 

We close this section with a fourth characterization of sheaves, this one also 
based on a universal property, and illustrated to some extent in the previous 
examples. Suppose § is a class of functions defined on open subsets of a topologi- 
cal space X. If F(U) is the collection of all fE©F whose domain is U, and if 
pu,yv is the restriction map (i.e., pu v(f) =f | uv whenever UC V and fE F(V)), the 
collection { F(U), pu,v} is a presheaf. If this presheaf is a sheaf S (as it will be if 
§ is the set of holomorphic functions on open subsets of C*, or the set of differ- 
ential forms on open subsets of a paracompact manifold X) then we can define a 
universal continuous function ®:S—R of type § so that each fE F(U) factors 
uniquely through the sheaf S: that is, there exists a unique /: U->S such that 
f= of. 

In general, any object S and map ®: S—R with the property that each func- 
tion f: UR factors uniquely through S via ® is called universal with respect 
to the characterizing properties of the functions f. The pair (S, ®) is uniquely 
determined (up to isomorphism) by this property. Thus, for instance, the sheaf 
of holomorphic functions is characterized by being the unique universal object 
for the family of holomorphic functions. 

Since f-1(V(f, U)) =U, f is continuous. Thus sheaves transform a compli- 
cated property of functions, such as analyticity, into the simpler one of continu- 
ity, for the topology on the sheaf is chosen precisely so that a continuous section 
on U (i.e., an element of F(U)) corresponds to one of the specialized (e.g., 
analytic, differentiable) functions of F(U). 
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5. History and applications. Sheaf theory is a particularly effective tool in 
those areas which ask for global solutions to problems whose hypotheses are 
local. Among the early papers which introduced the ideas, though not the lan- 
guage, of sheaf theory, many were concerned with the Cousin problems from 
the theory of functions of several complex variables; the first (or additive) and 
second Cousin problems ask respectively about the existence of a meromorphic 
function with specified poles and the existence of a holomorphic function with 
specified zeros. Henri Cartan and Kiyoshi Oka independently solved these 
problems, working in the ring of germs of holomorphic functions introduced in 
our first example, where the operations take into account the domains of the 
functions. Oka [1950] cites Cartan |1940] as the source of the notion of “sdéal 
holomorphe de domaines indéterminés” in this ring, and both Oka [1951] and 
Cartan [1944] refer to the article of W. Riickert [1933] which took the concept 
of ideal from polynomial rings and interpreted it in the ring of functions on a 
fixed domain. Cartan [1944] carried on the investigation of the sheaf of germs, 
still in the earlier terminology, clarifying the relations among the problems with- 
out achieving solutions. 

Independently, Oka in 1948 wrote a paper [1950] (seventh in a series pub- 
lished from 1936 to 1953 and collected in a single volume [1961]) which de- 
veloped the same material in a more complete form, and carried it through to a 
solution of the first Cousin problem. Building on Oka’s paper, Cartan was able 
to solve the second problem as well as to simplify Oka’s solution to the first, 
and his paper [1950] and Oka’s were published together. A footnote acknowl- 
edges Oka’s solution of the second problem in the meantime [1951]. 

The 1950 Cartan paper for the first time phrases the questions in the sheaf 
theoretic terms which had been developed in the Séminaire Cartan in 1948-49. 
An analytic sheaf, that is a sheaf of modules over the sheaf of germs of holo- 
morphic functions, is called coherent over an open set U if for every x€ U there 
is an open set U, such that the sections over U, generate the stalk at y for all y 
in a sufficiently small neighborhood of x. If fi, - - - , f, are functions holomorphic 
on a domain D, we may define the sheaf R of relations among the f; by taking 
the sections Ry of R over U open in D to be the set of k-tuples of holomorphic 
functions (gi, - + - , gx) for which >.7_,fig;=0 on U. In this vocabulary the first 
Cousin problem is to show that R is coherent, while the second problem simi- 
larly asks whether the sheaf over an analytic variety is coherent, where a vari- 
ety is the set of common zeros of a set of holomorphic functions, and the ideal 
of sections over an open set is the ideal of functions on the variety which vanish 
on that open set. 

Cartan borrowed the term “faisceau” (sheaf) from Leray [1945; 1946]. 
Leray’s concept was closer to that of a “presheaf.” Cartan [1953] attributes the 
topological definition to an exposition by Lazard in the Séminaire Cartan 
[1950]. In each case, the key concept was that of a system of local coefficients. 
Studying sets of invariants for an object (base space) by investigating what 
functions can be defined from it to some convenient object called a set of co- 
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efficients, as is done in cohomology, leads very naturally to a sheaf of coefficients 
since the presheaf structure allows coefficients to be assigned locally, that is, 
to each open subset of the base space. Formally, the principal construction of 
cohomology with coefficients in a (pre)sheaf follows the Cech construction of 
cohomology with fixed coefficients. 

Let X be a topological space and S a sheaf over X, say of abelian groups. 
For any open cover U of X, a q-cochain, q being a nonnegative integer, is an 
alternating function which assigns to every g-+1-tuple of sets in the cover U 
a section over the intersection of these sets (the zero section if the intersection 
is empty). C2(U, S) denotes the group of g-cochains. For each qg a coboundary 
operator 6%, 62:C2(U, S)—->C2t1(U, S) is defined by 

qt1 

64f(Ui,, a) Us, ,1) = 2 (—1)f(U;,, a) 0, sy Usa) 

j= 
where the caret over Ui, means that U;, is to be omitted from the arguments of 
f, and each of the sections on the right is to be interpreted as restricted to the 
intersection of all the U;,. By convention we write 6 for all 6%. Since f is al- 
ternating, 66=0, so the image 6(C*1(U, S)), whose elements are called co- 
boundaries, is contained not merely in C2(U, S), but in the set of cocycles 
Z1(U, S), the kernel of 


6:C7 (U,S) > Cot (U,S). 


The gth cohomology group H4(u, S) of the cover U with coefficients in S is the 
quotient -group Z9(U, S)/6(C(uU, S)). Although the construction of the 
cohomology group uses only the presheaf of sections of S, the sheaf property 
allows us to interpret H°(U, S). In order for 0-cochain to be a cocycle, 


df(Uo, U3) = PU pnUy, 0, f(U1)) _ Pu pnw, Uo S(U,)) 


must be zero, and in any sheaf, a collection of sections so related defines a 
unique global section. Thus H°(U, S) is Sx—independent of the cover WU. For 
all g, if W is a covering which refines a covering VU, the restriction maps can be 
used to define a canonical map H7(U, S)—H7(u, S). The direct limit, over all 
coverings U of X, of the groups H?(U, S) with these maps, is the gth cohomology 
group of X with coefficients in the sheaf S and is denoted by H2(X, S). 

One property frequently taken as axiomatic for cohomology theories holds 
also for this one. If 0O-F4G,H--0 is an exact sequence of sheaves over X, 
there is a long exact sequence of cohomology 


0-H (X, F) — HX, G) —> HX, 4) ~ HX, PF): -- 
— H™(X, H) > H4(X, F) > H4(X, G) > HYW(X, H) > A(X, FP): - - 
[2, p. 28]. As usual, a pair of homomorphisms ABC is exact at B if Im f 


= Ker g,so the exactness of 0—F4G2,H->0 means that F is a subsheaf of G 
(this requires f to be an open mapping), g is onto, and H is isomorphic to the 
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quotient sheaf G/F. The existence of the long exact sequence is the major rea- 
son for the usefulness of cohomology, for the 0-dimensional groups which begin 
the sequence are the groups of global sections, while the higher groups by their 
construction reflect the local properties of X. 

For instance, if we take F to be the sheaf of germs of holomorphic functions 
on a complex manifold X, and G to be the sheaf of germs of meromorphic 
functions, then F is a subsheaf of G, and the global sections of the quotient 
sheaf G/F can be interpreted as the data of the first Cousin problem, since each 
section describes the behavior of a function near its poles [9, p. 161]. Thus, 
since this problem asks whether there exists a function meromorphic on X with 
such poles, the first Cousin problem may be interpreted as asking whether the 
last map in the sequence 0—Fx—Gx-—(G/F)x is onto. This sequence is the 
beginning of the long exact cohomology sequence, and the next group in that 
sequence is H'(X, F). The Cartan-Oka result is that H7(X, F)=0 for all g21 
if X is a Stein manifold, a class of manifolds with “sufficiently many” holo- 
morphic functions, which includes all Riemann surfaces which are connected 
and non-compact. In addition to proving this result, Cartan [1953] and Serre 
[1953] give other applications of the fundamental theorems for a Stein manifold 
x: 

THEOREM A. For every coherent analytic sheaf S over X, H®(X, S), which ts 
the module of global cross sections Sx, generates the stalk Sz for every xCX. 


THEOREM B. For every coherent analytic sheaf S over X and g21, 
HX, S) = 0. 


Properties A and B characterize Stein manifolds. 

The proceedings of the 1954 AMS summer institute [1956] illustrate that 
by then the basic concepts of sheaf theory had been clarified apart from the 
original example, and the bibliographies indicate that applications had begun 
to diversify, particularly into algebraic geometry. For example, Kodaira and 
Spencer showed the equivalence of several different definitions of the arithmetic 
genus of an algebraic variety and provided a classification of complex line 
bundles [1953]. Hirzebruch gave a sheaf-theoretic statement and proof of the 
Riemann-Roch theorem [1953; 1956], and Weil of the deRham theorem [1952]. 

One seminar at the 1954 institute was based on an early version of Serre’s 
major article FAC [1955], the first entirely algebraic development of sheaf 
theory. The applications to complex variables had frequently made use of 
complex integration, and this tool was not available in abstract algebraic 
geometry. “Faisceaux algébriques cohérents” are coherent sheaves which are 
sheaves of modules over the sheaf of local rings on an algebraic variety. Serre 
showed that if the base field is the field of complex numbers, the theory of 
algebraic coherent sheaves is isomorphic to the theory of analytic coherent 
sheaves. Going further in the direction of an algebraic treatment, Grothendieck 
dealt with sheaves in the context of cohomology in an abelian category [1957]. 
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The publication of Godement’s book [1958] signals the appearance of sheaf 
theory as an independent discipline. The bibliography which follows lists both 
recent treatments of sheaf theory and books on other subjects which make use 
of sheaves. 
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ON THE FOUNDATIONS OF SET THEORY 
J. D. MONK, University of Colorado 


I want to discuss here the relevance to mathematicians, as teachers and 
researchers, of some of the recent discoveries about axiomatic set theory. Most 
readers have heard of these advances, which began just a few years ago with 
Cohen’s work. The results are certainly intellectually amazing to all of us. I 
think they may even give rise to certain changes in our teaching and research, 
and the purpose of this paper is to describe some possibilities along these lines. 
To set the stage and fix the ideas I shall first describe a few of these discoveries 
in a fairly precise way. Then, in the nonexact portion of the paper, I shall dis- 
cuss some possible changes in teaching and research, and also some philosophi- 
cal views which are affected by these discoveries. 


1. A survey of results. A much more comprehensive (and more technical) 
survey can be found in Mathias [7]. Here I state just a very few results, but I 
wish to emphasize that the nonmathematical arguments of the next section 
apply in some form to virtually all of the results described in [7]. I assume that 
the reader has a modest acquaintance with the idea of a language and a meta- 
language, and with the precise notions of a (first-order) sentence, a (formal) 
proof, and a theorem. In this section I work in a metalanguage and talk about 
the language of mathematics. I leave the metalanguage unspecified in detail; 
to begin with I assume that it is rather weak, with just enough machinery to 
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formulate the above precise notions. If T is a list of sentences (thought of as 
axioms for a certain theory), and ¢ is a single sentence, I write T+ @ to indicate 
that there is a formal proof of @ from T, i.e., that there is a list of sentences, each 
of which is either a logical axiom, appears in the list T, or is obtained from earlier 
sentences of the list by applying a rule of inference. I write T+ ¢ if there is no 
proof of ¢ from T. Before embarking on a description of set-theoretical matters 
I want to mention two famous theorems of Gédel which form a background upon 
which to view the latest results. 


THEOREM 1. (Incompleteness theorem.) If T 1s a consistent, sufficiently 
strong, effective list of sentences, then there 1s a sentence @ such that Ti+ @ and 


TH —#¢. 


Here “—” is an abbreviation for “not.” T is consistent if THyw for some y. 
To say that T is sufficiently strong means, roughly speaking, that T embodies 
enough mathematics to develop elementary number theory; technically speak- 
ing, Peano arithmetic, P, is relatively interpretable in T. To say that T is 
effective means that the list T is presented in a reasonable manner—reasonable 
enough for oe tno be able to recognize by some algorithm when a sentence is in 
the list 7. Certainly T is effective if T is finite: to check if a sentence ¢ is in T 
just look through the whole list, a process which in principle terminates with 
the last member of 7. But Theorem 1 applies to some infinite lists also. Pre- 
cisely speaking, one assigns numbers, called Gédel numbers, to all sentences, 
and T is called effective if the set of Gédel numbers of members of T is a re- 
cursive set. Clearly, to express this notion of effectiveness my original weak 
metalanguage must be strengthened enough to work in an elementary way with 
recursive functions, integers, and sets of integers. The other theorems of this 
section are also formulated in this stronger metalanguage, which is still much 
weaker than the ordinary language of mathematics. 

Theorem 1 itself has a profound philosophical significance. According to it, 
one cannot hope to base all of conceivable mathematics on a single axiomatic 
basis; it points out the necessity of a continuous search for additional axioms. 
The importance of the theorem can be more appreciated in connection with the 
opposed philosophical views of formalism and platonism which will be discussed 
in section 2. I want to point out now, though, that the incompleteness theorem 
has not had much effect on the attitude of the working mathematician. In con- 
trast, the latest results concerning the independence of the Axiom of Choice 
and the Continuum Hypothesis are already having an effect on teaching and 
research. I think that one of the main reasons for the lesser practical significance 
of the incompleteness theorem lies in the nature of the known proofs of the 
theorem. The sentence @ whose existence is asserted in the theorem 1s effectively 
constructed, but its intuitive meaning is not to be found in ordinary math- 
ematics. It can be interpreted as asserting a relationship between formal proofs 
and its own Gédel number, and its construction is another instance of the well- 
known Cantor diagonal method. Theorem 1, essentially due to Gédel, can be 
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found in Tarski, Mostowski, Robinson [10] (it follows immediately from 
Theorems I1, 17, 110, and II9 there). 

If 7 is an effective list of sentences, one can effectively construct a number- 
theoretic sentence ConT which expresses the statement that T is consistent 
(under a natural interpretation, via Gédel numbers). 


THEOREM 2. (Gédel’s second underivability theorem.) If T is a consistent 
sufficiently strong, effective list of sentences, then TH-ConT. 


See Feferman [2] for a proof of this theorem; it is also pointed out in [2] 
that one must take some care in the construction of the sentence ConT. This 
second theorem of Gédel again puts a limitation on what one can do in the 
foundations of mathematics. If T is a list of sentences which provides axioms 
for a large portion of mathematics, then by Theorem 1 not all mathematics is 
encompassed by 7. One is thus forced to be somewhat modest. Theorem 2 
forces a more severe degree of modesty: it is impossible to prove the consistency 
of T (without using devices not available in T itself). One would like to know 
that working with T has some significance, which it does not if 7 is inconsistent; 
so no method is available for logically proving that mathematics is significant. 

I now turn to set theory. Students are taught these days that all mathemat- 
ics can be based on set theory, indeed, that ordinary mathematics logically 
speaking is just a branch of set theory. It is, in fact, well established that al- 
most all mathematics can be reduced to set theory. The only doubts that may 
arise concerning such a reduction have to do with the recently developed theory 
of categories. Anyway, in talking about set theory I think one is talking about 
essentially all of mathematics. I shall fix upon a particular list ZF of axioms for 
set theory, the Zermelo-Fraenkel axioms. These are essentially the axioms used 
in Bourbaki [1] and Halmos [5], but I assume that the axiom of regularity is 
also included (see Monk [8]), while the Axiom of Choice is excluded. By the 
Axtom of Choice, for short AC, I mean the statement that if A is a family of 
nonempty sets, then there is a function f (called a chotce function for A) such 
that f(x) Ex for each xCA. If A isa finite family (finite defined conventionally), 
then such a function can be proved to exist within ZF, using only the simplest 
set-theoretical axioms (this was overlooked in Hall, Spencer [4], p. 282). In 
general, however, the existence of f requires infinitely many choices, and there is 
no principle within ZF which makes this legal. The axiom of choice can be 
proved equivalent, within ZF, to Zorn’s lemma, to the Well-ordering Principle, 
to Tukey’s lemma, etc. By the Continuum Hypothests, for short CH, I mean the 
statement that any infinite set of real numbers can be placed in one-to-one 
correspondence either with the integers or with the set of all real numbers. 
Finally, the Generalized Continuum Hypothesis, GCH for short, asserts that for 
any infinite set X and any family Y of subsets of X, Y can be placed in one-to- 
one correspondence either with a subset of X or with the set of all subsets of X. 
Since in ZF+AC one can show that the reals can be placed in one-to-one cor- 
respondence with the set of all subsets of Z (the set of integers), it follows that 


’ 


706 J. D. MONK : [September 


ZF+AC}tGCH-CH., Now I shall list a few of the important recent results in 
the foundations of set theory. 


THEOREM 3. (Gédel, 1938.) Con(ZF)=Con(ZF+AC+GCH). 
I could just as well formulate Theorem 3 as follows: 


If ZF 1s constsient, then so1s ZF+-AC+GCH. 


Theorem 3 is written as a theorem of number theory as so to make more 
precise the assumptions in the metalanguage and to establish a connection with 
Theorem 2. The question of consistency of ZF+AC+GCH reduces to the same 
question for ZF. By Theorem 2 this latter question cannot be given a rigorous 
affirmative answer. Since set theory, in particular ZF, has been used so much in 
the last century, mathematicians have grown confident that it is, in fact, con- 
sistent. Thus one can assert with the same degree of confidence that ZF+AC 
-+GCH is consistent. 

THEOREM 4. (Cohen, 1963.) Con(ZF)=Con(ZF+AC+ CH). 


In Theorem 4, ~CH can be taken in various very specific forms, such as 
2&0 = &., or 28o= Nz, or 280 = N,.3. The most general results of this sort have been 


established by Easton and Solovay; see, e.g. [9]. 


THEOREM 5. (Fraenkel, Mostowski, Cohen, 1929-1963.) Con(ZF)=Con(ZF 
+--AC). 


Again, ~AC can be taken in many more definite forms, for example: there 
is a countabie collection of unordered pairs without a choice function. 


THEOREM.-6. (Vitali, 1905.) ZF+ACL Ax (x ts @ set of real numbers, but x 
1s not Lebesgue measurable). 


THEOREM 7. (Solovay, 1965.) Con(ZF’+AC)=Con(ZF+ AAC+ Vx (af x 4s 
a set of real numbers, then x 1s Lebesgue measurable)). 


Here ZF’ is obtained from ZF by replacing the usual axiom of infinity by a 
stronger one which asserts the existence of an uncountable strongly inaccessible 
cardinal. This stronger axiom is coming more and more to be an accepted part 
of set theory. For example, category theory appears to require this axiom, or 
even stronger axioms, to justify its methods. Intuitively, Con(ZF’+AC) seems 
as plausible as Con(ZF+AC); like ZF+AC itself, the consequences of ZF’+AC 
have been rather well worked-out and some confidence can be placed in its 
consistency. Since, however, ZF’+ACt Con(ZF+AC), the new axiom of in- 
finity is much stronger than the old axiom (see Theorem 2). 


THEOREM 8. (Solovay.) Con(ZF)=Con(ZF+ 7AAC+4x (« 1s a set of real 
numbers and x 1s not Lebesgue measurable)). 

By Theorem 8, the Axiom of Choice is not equivalent to the existence of 
non-Lebesgue-measurable sets of real numbers. 


1970] ON THE FOUNDATIONS OF SET THEORY 707 


These theorems I have listed represent a small sampling of results known 
in this area; they are sufficient to form a basis for the non-mathematical argu- 
ments in the next section. The interested reader can think of many familiar 
theorems whose proofs involve the Axiom of Choice, for example, and ask 
whether results similar to the above for Lebesgue measure hold. Thus the exis- 
tence of Hamel bases, the Hahn-Banach extension theorem, the Boolean prime 
ideal theorem, the Banach-Tarski paradox, and the extendability of a partial 
order to a linear order are theorems which give rise to independence questions 
of this sort. Even assuming the Axiom of Choice, many statements of a set- 
theoretical nature have until recently been open. Examples are: the existence of 
a nontrivial measure on the set of all subsets of a set, and Souslin’s hypothesis. 
Some of these many natural hypotheses have now been settled (in the sense of 
being shown independent), while others are still under attack. The intuitive 
remarks in the next section apply to all of these questions. 


2. Meaning of the results. What do these results “do” for the ordinary 
mathematician? Before indicating some specific possibilities along these lines, 
it is worthwhile briefly to take a deeper view of the significance of the results. 
The results throw a great deal of light on a certain dichotomy in the philosophy 
of mathematics which now has a long history. Without trying to connect 
theories in the philosophy of mathematics with broader philosophical trends, | 
will distinguish two extreme views, platonism and formalism. These are not the 
only possible philosophies of mathematics. For example, intuitionism has a 
great appeal and is close to the beliefs of many practicing mathematicians. But 
the philosophies of most mathematicians can be construed as somewhere in 
the range between extreme platonism and extreme formalism. Practicing math- 
ematicians, consciously or not, subscribe to some philosophy of mathematics 
(if unstudied, it is usually inconsistent). If you make a simple reference to the 
real numbers, you express a tendency toward platonism. And if you refer to a 
theorem as correct because it follows from the axioms of set theory, you tend 
toward formalism. 

According to extreme platonism, mathematical objects are real, as real as 
the world we live in. Thus infinite sets exist, not just as a mental construct but 
in a real sense, perhaps in a “hyperworld.” Similarly, nondenumerable sets, 
real numbers, choice functions, Lebesgue measure, etc., have a real existence. 
From the point of view of platonism, the purpose of a mathematician is to dis- 
cover some of the facts of nature. His job is thus quite similar to that of a 
physicist, chemist, or biologist. The various possible axioms of set theory are 
then either true or false, and one of the main aims in the foundations of math- 
ematics is to develop correct intuitions so as to determine which are the true 
axioms; these may then be taken as a rigorous basis for set theory. Actually, for 
a platonist, axiomatic development of set theory is not essential, but is perhaps 
useful to keep from making mistakes. (Even a platonist, however, will admit the 
importance of axiomatic treatments outside set theory, as in topology or group 
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theory, because of the usefulness of axiomatizations for abstractions and classi- 
fications. ) 

A strict formalist, on the other hand, does not believe that any mathematical 
objects have a real existence. For him, mathematics is just the business of de- 
riving sentences from axioms. It is a game, in that some definite rules must be 
followed in such derivations. Unsolved problems give rise to goals for the game; 
the winner is the one who solves the problem. The analogy with games like 
chess and go is very close. A formalist chooses which game to play, that is, which 
axioms to take and which problem to work on, using practical and artistic 
criteria. One set of axioms may be best suited to be a base for a physical theory 
like relativity, for example, and hence because of the predictive ability of the 
physical theory this set of axioms has a practical value. And, of course, some 
problems are more practical in nature than others. The criteria for choosing 
axioms and problems, when not practical in nature, are extremely varied. A 
certain axiom may enable one to resolve many questions that are difficult with- 
out its aid; assuming that GCH, for example, infinite cardinal arithmetic is 
very much simplified. Many investigations are made in order to try to relate 
two seemingly distant areas in mathematics; one may cite the duality theory for 
Boolean algebras, relating algebraic to topological structures, or the investiga- 
tion of closure algebras—doing topology within algebra. These two criteria for 
choosing the right game are just examples, and are undoubtedly not the most 
important of those criteria which are not based on practical considerations. 

The results of section 1 have different meanings for platonists and formalists. 
The incompleteness theorem (Theorem 1) shows the platonist that he cannot 
hope to capture all of mathematics in a completely rigorous form for once and 
for all. There will remain beyond any fixed rigorous framework a statement 
whose truth must be determined by intuition. On the other hand, a strict 
formalist may even doubt that Theorem 1 says something relevant to his ac- 
tivity, since, as I indicated in section 1, the formulation and proof of Theorem 1 
require a metalanguage stronger than the minimal one needed to understand the 
notions of proof and theorem which are the basic “rules of the game” for the 
formalist. But if the formalist does admit the usual intuitive meaning of 
Theorem 1, the theorem will just be taken as evidence that one cannot be con- 
tent with just one axiom system if one wants to develop a comprehensive part 
of mathematics. To a platonist, Theorem 2 shows again the weakness of axiom 
systems; to him a system such as ZF, for example, is obviously consistent, since 
all the axioms of ZF are intuitively true. A formalist would also view Theorem 
2 as showing a weakness of formal systems, again, if he admitted the usual 
intuitive meaning of the theorem. The other results in Section 1, independence 
results in set theory, give several examples of important statements which 
cannot be decided on the basis of the usual axioms of set theory. Here the 
platonist will try to investigate the situation further, in hopes of finding an 
impelling intuitive principle with the aid of which these statements can be re- 
solved one way or the other. The formalist will simply view the results as giving 
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rise to several alternative paths of development of set theory. 

These two philosophical viewpoints do not make much difference in the 
communication of mathematical results to other mathematicians. A correct 
mathematical argument from given premises is recognized as such by a platonist 
and by a formalist. The philosophical questions inherent in the dichotomy can 
be, and are, ignored in the writing of most mathematicians. In the main this 
indifference to the philosophy of mathematics has had a good effect on the 
progress of mathematics; mathematicians have insisted on proving theorems 
rather than spending a bulk of their time with difficult and ultimately inclusive 
philosophical speculations. Nonetheless, the two views have an effect on the 
direction of mathematical research. For example, a platonist may convince 
himself that CH is false (cf. Gédel [3]). He is then less likely to try to derive 
consequences from CH. But a formalist may very well like CH and even GCH 
because he can prove many nice theorems with their aid. 

My remarks so far in this section concern mathematics itself, or at least 
mathematical research. I now turn to some specific possibilities for change in 
teaching and research which might come about because of the recent inde- 
pendence results. Mainly I will discuss the definition of the real number system. 
In beginning analysis courses it is customary to give the main properties of the 
real numbers and perhaps to carry out one of the constructions of the reals 
from the rationals. Many constructions of the reals are known; the ones using 
Dedekind cuts and Cauchy sequences are the most popular. All of these con- 
structions turn out to be equivalent. This fact is rightly used, I think, as evi- 
dence for the naturalness of the notion of real number. Another basic, and 
satisfying, result here is uniqueness: any two Dedekind-complete ordered fields 
are isomorphic. But what are the real numbers? Under a platonistic point of 
view, the real numbers exist in nature; in teaching beginning mathematicians 
it would be nice to be able to point and say: here they are. A natural definition 
would be as an isomorphism equivalence class of Dedekind complete ordered 
fields. However, such classes are too big, and are not admitted as existing in 
ZF. There is a sophisticated way of chopping such classes down to a manageable 
size, but the method is not suitable for elementary classes (cf. Monk [8], p. 
114). The only natural way out seems to be to fix upon a definite construction of 
the reals in order to give a specific definition for them. Then different mathema- 
ticians will have different definitions, but at least they can be shown equivalent. 

Frequently the uniqueness theorem is used as a basis for asserting that all 
questions about the real numbers can be resolved, at least theoretically: any- 
thing true of one Dedekind-complete ordered field is true of another. But then 
Theorems 3 and 4 pose a puzzle. CH is a property of. the real numbers (in a 
broad sense), and it is consistent to assume CH, but also consistent to assume 
—1CH (assuming ZF consistent). A closer analysis reveals the true state of 
affairs. Call a Dedekind-complete ordered field a system of real numbers. It is 
then provable within ZF that CH holds with respect to one system of real 
numbers if and only if it holds with respect to any other system of real numbers. 


710 J. D. MONK [September 


But neither side of this biconditional is actually provable in ZF. One may say 
that the uniqueness theorem (theoretically) reduces all questions about the 
reals to purely foundational, set-theoretical questions. It seems to me to be 
appropriate to bring discussions such as this down to the teaching level. Stu- 
dents should be aware of the possibility of getting different conceptions of the 
real numbers by choosing one or another of various hypotheses such as CH. 
It also seems to me that it would be useful to make students aware of the al- 
ternative philosophies of platonism and formalism. 

Similarly, in the important applications of the Axiom of Choice I think it is 
appropriate to point out various alternatives that exist. Above all, I think a 
retreat from dogmatism is called for. In proving the existence of non-Lebesgue- 
measurable sets, it should be pointed out that, at a price, one can as well as- 
sume that every set of real numbers is Lebesgue-measurable. Similar remarks 
are appropriate whenever the mathematical or foundational results are of 
significant import to most mathematicians; certainly when one discusses such 
topics as the Hahn-Banach theorem, Souslin’s hypothesis, the existence of 
maximal ideals, Tychonoff’s theorem, etc. 

The possibilities for remarks in classroom teaching thus appear to be very 
great, even though one cannot expect of teachers more than a mention of the 
independence results in question, since the proofs seem as yet inaccessible to an 
audience not able to devote some months to a study of these matters. 

Possible uses of these results in research are rather obvious, but limited. 
Thus most mathematicians do not work in areas where a choice of CH or-7CH 
would make a big difference. And most modern mathematics depends funda- 
mentally on the Axiom of Choice, so that the independence results such as I 
have mentioned are not of great practical import. But there are some instances 
in research where a new foundational hypothesis might prove useful. An analyst 
might like to assume that every set of real numbers is Lebesgue measurable. 
This can be done even while retaining a weak form of the Axiom of Choice. 
Again, the assumption that 28°=N, might facilitate the construction of counter- 
examples in certain contexts. Research based on Souslin’s hypothesis has not 
been done very much. Many more possibilities could be stated, and research 
using these unusual hypotheses is needed. From a platonistic point of view such 
research might lead to a better insight into the nature of our underlying set 
theory. 

I hope that the discussion I have given will convince more mathematicians 
to become familiar with the results obtained in the foundations of set theory 
and keep these results in mind in their teaching and research. 


This article is the approximate text of a talk given at the Rocky Mountain Section meeting of 
the Mathematical Association in May, 1969. I wish to thank Gebhard Fuhrken and Watson Fulks 
for useful comments on a first draft of this article. 
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THINKING GEOMETRICALLY* 
DANIEL PEDOE, University of Minnesota 


To many people the word “geometry” inevitably suggests a figure, a draw- 
ing. We are aware of the fact, apparently overlooked by Euclid, that we 
have to be very careful in arguing from a figure, that we may unwittingly as- 
sume properties which are not deducible from the given hypotheses, and may 
therefore arrive at incorrect logical conclusions. This may be a partial explana- 
tion of the fact that the whole subject of geometry, especially elementary 
geometry, is under attack these days. The leader of the attack, and a very 
formidable person he is, seems to be my old friend Prof. Jean Dieudonné. 
He is, of course, a very fine geometer, and a well-known member of the Bourbaki 
school. 

Dieudonné has made his views known on a number of occasions, and most 
explicitly perhaps in a long preface to a book Linear algebra and geometry, pub- 


* This paper is based on lectures given at the Universities of South Carolina and Toronto in 
the spring of 1968, and at Makerere College, Uganda and the University of the Witwatersrand 
in the summer of 1969. The invited address to the North Central Section of the MAA in April, 
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lished in 1964, and recently reissued in a third edition and in an English trans- 
lation [1]. This book was written as a teachers’ book for high schools in France, 
and introduces geometric ideas in the plane and Euclidean space via linear 
algebra. There are no diagrams, although their use is not forbidden in this book 
as they are in a more advanced one, where Dieudonné talks of a “strict ad- 
herence to axiomatic methods, with no appeal whatsoever to ‘geometric in- 
tuition’, at least in the formal proofs: a necessity which we have emphasized by 
deliberately abstaining from introducing diagrams in this book. My opinion 
is that the graduate student of today must, as soon as possible, get a thorough 
training in this abstract and axiomatic way of thinking,” [2, p. 5]. 

The high school book begins with a set of axioms for the real numbers, then 
with a set for vector spaces, and proceeds to develop linear algebra and to 
introduce the geometry of the Euclidean plane and space by these methods. 
Dieudonné says that he knows nothing about the way children between the 
ages of 11 and 14 think, but he thinks that the Hilbert axioms by which Eucli- 
dean geometry can be developed rigorously are too involved, and that a develop- 
ment via a simpler set of axioms is desirable. His book is written, he adds, 
merely to put on record for the benefit of any future historian who might be 
interested how Dieudonné believes that elementary geometry can be taught in 
a rational manner. It should be said, parenthetically, that using this book to 
teach from is a most stimulating intellectual adventure. 

In case the reader might not notice certain omissions in his book, Dieudonné 
makes it plain in his preface that he thinks French schools spend far too much 
time on special properties of the triangle, on trigonometry, on circles and sys- 
tems of circles, on conics and systems of conics, and so on. He is very amusing 
when he says that books on trigonometry filled with formulas are written for 
astronomers, surveyors, and for writers of books on trigonometry, and that 
school children should never be deliberately trained for any of these professions! 
He gives, in his book, the essentials of trigonometry for use in modern math- 
ematics, and there are no special formulas for the triangle. In fact, at a meeting 
of the North Atlantic Treaty Organization held some years ago, Dieudonné 
went on record as saying “Away with the triangle!” He does not want triangles 
mentioned, let alone studied in elementary geometry at all. 

On another occasion Dieudonné has asked: “Who ever uses barycentric 
coordinates?”, and in the preface to the high school book he complains that he 
studied systems of circles as a student, but has never come across them again in 
mathematics. If, says Dieudonné, one ever comes across a conic, one can treat 
it by the methods of the differential and integral calculus, as one does other 
curves. 

In the course of this paper I shall discuss some problems—and mathematics, 
after all, is concerned with posing and solving problems—in which ideas about 
triangles, barycentric coordinates, systems of circles and conics help in the 
solution. I shall not use the differential calculus. 

Perhaps I should assure any anxious reader that there is no evidence that 
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too much time is spent in American high schools on the study of triangles, 
circles, and whatnot. I asked my sophomore students recently, working back- 
wards to try to discover something they did know, whether anyone knew the 
formula for the area of a triangle. Eventually one student said: “One half the 
base times the height,” at which another student remarked very wisely: “Oh, 
I only had that for a right-angled triangle!” 


1. The first problem I wish to discuss in which geometrical thinking is help- 
ful runs as follows: Perpendiculars are dropped from the vertices A, B, Cof a 
triangle ABC in a plane 7 onto a distinct plane 7’, forming an equilateral tri- 
angle A’B’C’. Show that the length of a side of A’B’C’ satisfies the equation 


3x4 — 2x7(a? + b? + c?) + 16A? = 0, 


where a, b, c are the lengths of the sides of the given triangle ABC, and A is its 
area. 

This is a straightforward exercise in the use of Pythagoras. It occurred to 
me to wonder whether this equation has real roots. This was not mentioned in 
the problem, which turned up as an exercise in an English high school textbook. 
The condition is a?-+0?-+c? = 44/3A; with the help of a little algebra, this con- 
dition is equivalent to the condition 


(b? — c2)2 + (c? — a)? + (a? — 6%)? = O. 


This shows that the roots of the equation are always real, and that the inequal- 
ity for a triangle, which is therefore always valid, becomes an equality if and 
only if the triangle ABC is equilateral. 

This did not seem to be terribly exciting, until I discovered that in 1919 
Weitzenboeck [10] had published this inequality for a triangle, had proved it 
algebraically, and was obviously rather proud of his result. It had occurred to 
me in the meantime that a better way to show that the roots of the equation 
are always real is to inscribe an ellipse touching the sides of the triangle ABC 
at their respective midpoints, and to project this ellipse orthogonally into a 
circle. This is always possible, and the resulting triangle A’B’C’ around the 
circle must be an equilateral triangle. This projection is the basis of a film I 
have made called Orthogonal Projection, and the problem of inscribing an ellipse 
in a triangle to touch the sides at their midpoints is the basis of another film 
called Central Similarities [9]. 

Having arrived at an inequality for any triangle by showing that an orthog- 
onal projection into an equilateral triangle is always possible, I wondered 
whether a given triangle ABC can always be projected orthogonally into a 
triangle of gaven shape, with sides ka’, kb’, kc’. The method, using Pythagoras, is 
as before and one ends with the equation 


16A’*k* — 20k? + 16A? = 0, 


where A is the area of triangle ABC, where A’ is the area of a triangle with sides 
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a’, b’, and c’, and where 
© = a?(—a’? + 0/2 + ¢’?) + B2(a’? — 02 + c’?) + c2(a’? + 0"? — c?). 
The condition for real roots of this equation is 
©? 2 (16AA’)?, 
and this suggested the possibility of a two-triangle inequality: 
© 2 16AA’. 


I wondered whether I could show geometrically that the orthogonal projection 
is always possible, and so derive the two-triangle inequality. This is where 
barycentric coordinates (areal coordinates) are useful! 

Taking ABC as triangle of reference, consider the inscribed conic with 
equation 


p?X? + g?V? + r°Z? — 2pgX VY — 2rVZ — ArpZX = 0. 


If we consider the intersections of this conic with X-+Y+2Z=0, the line at 
infinity, the discriminant of the resulting quadratic equation is 


— pqr(p + q + r), 


and so we are assured that if p, g, r are all positive, that is, if the points of con- 
tact of the conic with the sides of ABC are all internal points, then the in- 
scribed conic is an ellipse. Project this ellipse into a circle by orthogonal pro- 
jection, and suppose that the triangle ABC projects into triangle A’B’C’. The 
center of the ellipse projects into the center of the incircle for triangle A’B’C’. 
The center of the ellipse has coordinates 


qrrirtp:ptrg, 


and the center of the circle has coordinates a’:b’:c’, where these are the lengths 
of the sides of triangle A’B’C’. Since the ratio of areas is unchanged by orthog- 
onal projection, and the center of the ellipse projects into the center of the 
incircle, we have 


gtrirtpiptqeaid'ce, 
so that 
prgqirz=—-VtUt+d:d—-UV+ed:adt+h —C. 


If we choose the ratios p:q:r to satisfy these equations and project the resulting 
ellipse orthogonally into a circle, then the projection of triangle ABC will be a 
triangle similar to triangle A’B’C’. 

We have not yet finished with the geometrical ideas associated with this two- 
triangle inequality. In 1937 Finsler and Hadwiger pursued the ideas of Weit- 
zenboeck, and derived his triangle inequality as follows: [3]. They erected 
equilateral triangles A’’BC, B’’CA and C’AB inwards on the sides BC, CA 
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and AB respectively of triangle ABC. Let V be the centroid of B’’CA and W 
the centroid of C’’AB. Then by trigonometry, 


(VW)? = k(a? + 6? + c? — 44/3A), 


where & is a positive constant for the triangle ABC. Since (VW)?20, we have 
a?-+b?+¢2> 4,4/3A, with equality if and only if the triangle ABC is equilateral. 

I was certain that this method would give my two-triangle inequality. I 
erected triangles A’’BC, B’’CA, and C’’AB inwards on the sides BC, CA and AB 
respectively of triangle A BC, each similar to a triangle A’B’C’, and then I had to 
determine what would correspond to the centroid of an equilateral triangle for 
these triangles A’’BC, B’’'CA and C’AB. Anyone who has worked with tri- 
angles knows that the centroid has no useful angles associated with it, but the 
circumcenter of a triangle has, and I found that the square of the distance be- 
tween the circumcenter V of B’’CA and the circumcenter W of C’’AB is given 
by the formula 


2(VW/a')? = (R’/a'b'c’)2[ Sy a2(—a!? + B+ 6) — 16AA'], 


where R’ is the circumradius of triangle A’B’C’. This shows that the triangle 
UVW is similar to the triangle A’B’C’, and also gives the two-triangle inequality 


a?(— a’? + 0’? + ¢’2) + b2(a’? — 0/2 + c’?) + c2(a’? + 0”? — c’”) & 16AQ’, 


with equality if and only if the two triangles ABC, A’B’C’ are similar. This 
further condition comes immediately from the construction. I imagine that if 
Finsler and Hadwiger had thought of this, they would have been rather excited, 
since they spent so much time on proving the inequality for one triangle. If one 
of the two triangles is already equilateral, the two-triangle inequality simplifies 
to the triangle inequality for the other triangle. There is a much easier method 
for obtaining it [5], of course. 

One final remark on this discovery. Since my note on the two-triangle in- 
equality mentioned orthogonal projection, it was rejected by a journal I sent 
it to, as being “unsuitable.” I rewrote the note, and gave the derivation from the 
method of Finsler and Hadwiger, illustrious names, after all, and obtained 
publication in a “Research Note” [6]. This was in 1943, and things published 
then did not travel, but I think that this discovery of the first interesting in- 
equality for two triangles does show how geometrical thinking, necessarily based 
on some knowledge of geometry, can be fruitful. 


2. My second look at geometrical thinking is bound up with what is called 
the Six-Circle Theorem. We have four circles Ci, Co, C3, Cs in the inversive 
plane, and we suppose that C;, C2. intersect in the points P,, Q;, that C2, C3 inter- 
sect in Pe, Qe, that C3, Cy, intersect in Ps, Q3, and that C4, C; intersect in P4, Qu. 
The theorem says that if the four points P,, P2, P3, Ps lie on a circle, then so do 
the four points Qi, Qe, Qs, Os. (Figure 1.) 

This is a well-known theorem, and can be proved in a number of ways. It is 
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Fic. i 


Fic. 2 


an exciting theorem, and I am not just representing my own state of mind about 
it. It makes even the Russian writer Jaglom excited. He gives it in his recent 
rather curious book Complex numbers in geometry |4, p. 35], where he says: 
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“This proposition seems rather elegant, but not particularly promising—an 
ordinary theorem of which there are many in elementary geometry. However, 
the consequences which follow from this simple theorem can surely be called 
remarkable. As the first of these consequences we mention a whole series of 
theorems due to the English geometer Clifford.” One chain of theorems begins 
with three lines, which determine three points by their intersections. There is a 
unique circle through these points. Four lines determine three sets of three lines, 
and therefore determine four circles. As a consequence of the Six-Circle Theorem 
these four circles meet in a point, which Jaglom calls the central point of the 
four lines. Five lines determine five sets of four lines, and each tetrad determines 
a central point. These five central points can be shown to lie on a circle, which 
Jaglom calls the central circle of the five lines, and so on. We are in the inversive 
plane, and must accept the fact that all lines contain the point at infinity, so 
that in the Six-Circle Theorem any one of the circles may be a line, and among 
the intersections of two lines there is always the point at infinity. 

I agree with Jaglom that complex numbers afford a ready means of proving 
theorems in plane geometry, but in his book he goes on to develop the theory of 
dual complex numbers at great length. These are numbers of the form a+b, 
where a, b are real and e?=0. The application is to oriented circles, that is to 
circles with a sense, and he obtains, after many pages of work, another attractive 
theorem, on oriented circles and their tangents. If Ci, Cz, C3, C4 are four oriented 
circles, and the tangents to @1, Cz are #1, qi, the tangents to Ce, C3 are po, ga, 
the tangents to @3, @, are $3, g3, and the tangents to Cy, @; are pu, qu, then if 
there is an oriented circle which touches the oriented lines 1, peo, p3, p4, there is 
also an oriented circle which touches the oriented lines qi, G2, 93, ga. (Fig. 2.) 

I shall show that both the Six-Circle Theorem and the theorem on oriented 
circles arise quite naturally if we map the circles of the Euclidean plane onto the 
points of £;. We map the circle with equation 


X? 4+ Y2— 2pX — 2%V+r=0 


onto the point (f, g, r) of £3. This is a simple, yet fundamental mapping (it is 
equivalent to stereographic projection), and it does illustrate what Henri 
Lebesgue has referred to as one of the important aspects of geometry, that it is 
almost a plastic art. Seeing theorems from different aspects, I believe, is one of 
the great charms of geometry. In the mapping I have just described, those sys- 
tems of circles called coaxial systems, or pencils of circles (given by the equation 
kC-+k’C’=0, where C and C’ are given circles, and k, k’ vary over the real 
numbers) are mapped onto the lines of #3. Is there anything more natural than 
a line in #3? There appears a quadric 2 of equation X?+ Y?—Z=0, and the 
points of this quadric represent the point-circles of the plane, those with zero 
radius. We soon find ourselves considering theorems which are obviously 
theorems of projective space, and there is a most attractive interplay between 
theorems for circles in the plane and theorems on points and lines in projective 
space, 83. All this is worked out in [7], and in more detail in [8]. 
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The Six-Circle Theorem becomes the following theorem in S;. Suppose A, B, 
C, D are four distinct points in S;, suppose 2 is a given quadric, and suppose 
AB intersects 2 in P;, Q;. Assume also that the line BC intersects Q in the points 
P2, Qe, the line CD intersects 2 in the points P3, Q3, and finally the line DA inter- 
sects 2. in the points P., Q4. Then if the points Pi, Po, P3, P4 lie in a plane, so do 
the points QO, Q>, Os, O4. 

This theorem can be proved by using one of the most remarkable theorems 
on quadric surfaces in S;, the theorem of the eight associated points. This says 
that if three quadrics intersect in eight distinct points, then any quadric which 
is made to pass through seven of these points will automatically pass through 
the last point of the eight. The eight points P;, Q;(2=1, 2, 3, 4) are associated, 
since we can find three distinct quadrics which contain them. One is the quadric 
Q itself, another is the reducible quadric which consists of the planes ABD and 
BCD, and a third is the reducible quadric which consists of the planes ABC and 
ADC. The two reducible quadrics intersect in the lines AB, BC, CD, and DA, 
so the three quadrics have only the eight points P;, Q; in common. Let the plane 
in which the four points P; lie be denoted by 7, and let the plane Q102Q3 be 
denoted by 7’. Then the reducible quadric formed by the planes 7 and 7’ con- 
tains seven of the eight associated points, and must therefore contain the eighth 
point Q,. If this does not lie in 7, it must lie in 7’. But it is seen immediately that 
if QO, is coplanar with the points Pi, Pe, Ps, Ps, then the points A, B, C, D are 
coplanar, and the theorem is trivial. 

Of course, since we are using the theorem of the eight associated points in 
all its strength, we must be certain that our proof of this theorem holds for all 
possible cases. Unfortunately, few textbooks give an acceptable proof. 

In $3; we have a Principle of Duality, and so our theorem has an evident 
dual. When we write this down, and consider how oriented circles can be repre- 
sented in &3, this theorem for $3, interpreted in /3, gives us the theorem which 
Jaglom only proves after many pages of work on dual complex numbers. I 
shall not give the details here. They can be found in [8]. 


3. My last example on thinking geometrically is about conics, and I shall 
leave the question open as to whether the methods of the calculus would give a 
complete solution of the following problem. Given five distinct points in the 
Euclidean plane, we can draw a unique conic through them. If we begin with 
six distinct points, there are six conics which can be drawn to pass respectively 
through sets of five of the six given points. The problem is: can the six points be 
chosen so that (a) the six conics are all distinct ellipses, or (b) the six conics are 
all distinct hyperbolas, or (c) the six conics are all distinct parabolas? This 
question was asked some years ago by C. V. Durell, senior mathematics master 
at Winchester, one of Britain’s great public (private) schools. The solution was 
given at sight by Dr. Beniamino Segre when I showed him the problem. Dr. 
Segre, who was then in England, is an illustrious successor of the great Italian 
geometers Corrado Segre, Guido Castelnuovo and Francesco Severi. The Segre 
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theorem is that it is possible to find x>5 points in the Euclidean plane so that 
the (5) conics through sets of five of them are either all distinct ellipses or dis- 
tinct hyperbolas, but it is impossible to find six points so that the conics through 
sets of five of them are all distinct and parabolas. 

The proofs of the first two statements are similar, but the proof of the third 
part of the theorem has a completely different flavor, as we shall see. If we con- 
sider five points (x3, vz) @=1, 2, 3, 4, 5), we can obtain the equation of the conic 
which passes through them in various ways, by writing down a determinant, 
for instance. We are interested only in the highest degree terms. If these are 
AX*’+2HX V+BY. then the conic is an ellipse if and only if H?-—-AB<0O, and 
an hyperbola if and only if H?—AB>0. In terms of the coordinates of the five 
points, we have the condition 


o(x1, V1, °° * 5 ys) > 0, 


where ¢ is a polynomial in the ten coordinates (x:, yi), as the condition for an 
ellipse, and —@>0 as the condition for an hyperbola. Since a polynomial is a 
continuous function of its variables, we know that we can draw small circles 
with their centers at the points (x:, yz) such that if @>0 at the points (xi, y:), 
then we still have ¢>0 at all points within the neighborhoods of the points. 
Now, the geometrical idea is to begin with all the ~ points on an ellipse. Then 
all the conics through sets of five of the points are ellipses, the same ellipse, and 
so @>0 for each set of five points. We now only have to vary the z points so as 
to keep ¢>0, but also so as to obtain (5) distinct ellipses. The treatment for 
hyperbolas is similar; of course, the details have to be filled in. You may say 
that this is essentially a calculus proof, that is, one which uses analysis. The 
proof that we cannot find six points to produce six parabolas is quite different. 

As soon as a geometer of what I must call the old school (I am one of them) 
sees six points in a plane, he thinks of the plane representation of a general 
cubic surface in $3, in which plane sections of the surface are mapped onto cubic 
curves in the plane which pass through the six given points. Conics through 
five of the six given points intersect cubics through the six points in 2-3—5=1 
variable point, and since the cubic curves represent plane sections of the surface, 
the conics represent lines on the surface. The points themselves through which 
the cubics are drawn also represent lines on the surface, so we have twelve lines 
of what is called a double-six on the surface; two sets of six lines, each of which 
meets five lines of the other set. This information is not relevant to the prob- 
lem, but merely indicates a fascinating branch of projective geometry. If the 
conics through sets of five of the six given points are all parabolas, then they all 
touch a special line in the plane. A line in the given plane meets the variable 
cubic curves through the six points in three variable points, and therefore repre- 
sents a space cubic curve on the cubic surface. We therefore have the situation, 
if there are indeed six parabolas, that the cubic surface contains a space cubic 
curve which touches the six lines of one half of a double six of lines on the sur- 
face. We show that this assumption leads to a contradiction. 
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If we are skilled in the use of birational mappings of the projective plane 
(Cremona transformations), we can map the sextuplet of lines on the surface 
represented by the conics through sets of five points of the plane onto six points 
of the plane, and see what happens. The space cubic curve, originally mapped 
onto a line of the plane, becomes an irreducible curve of order five with cusps at 
the six fundamental points which now represent the parabolas through the 
original sets of five points. The order is five, because cubics through the six new 
points still represent plane sections of the surface, and if 3N—6-2=3, we have 
N=5. The singularities must be cusps because the space cubic touches the 
lines which are now mapped onto points of the plane. 

But there is no irreducible curve of order five with cusps at six distinct 
points. If we apply Pluecker’s Formula to obtain the number of variable tan- 
gents which can be drawn to this quintic curve from an arbitrary point of the 
plane, we obtain the number 5-4—6-3=2. It can be proved, however, that the 
only irreducible curve to which two variable tangents can be drawn from an 
arbitrary point in the plane is a conic. We therefore have a contradiction. 


4, Not very long ago a mathematician, hot in the pursuit of modern math- 
ematics, complained to me that it is not easy to find a book which deals with 
the plane representation of the cubic surface, and he needed it for his work. I 
met a young mathematician in Oxford recently who has studied the new alge- 
braic geometry, but feels he must now learn some geometry. And there are 
others who have the new algebraic techniques at their finger tips but are search- 
ing for geometric ideas to motivate their research. There is no doubt that 
geometry should not be neglected, although the feeling against it in some quar- 
ters is almost Freudian in its intensity. The grudges arise from having been 
taught too much geometry, or having been taught geometry very badly. There 
is no reason why a course in geometry should not be well taught, and the danger 
of having too much is no longer a serious one. The United States is passing 
through a transition period in the teaching of geometry, and it should soon 
catch up with other countries, such as the Soviet Union, which have always 
given a lot of attention to the teaching of geometry. If one could find nothing 
else to say about geometry, it has always been conceded that students enjoy it, 
and the Pleasure Principle should not be neglected in mathematics. I have 
tried to communicate this feeling in my recent book [8], in which I have put 
down the geometric ideas which turn up in mathematics. 


References 


Dieudonné, Linear Algebra and Geometry, Hermann, Paris, 1964. 

, Foundations of Modern Analysis, Academic Press, New York, 1960. 
Finsler and Hadwiger, Comment. Math. Helv., 10 (1937) 316-26. 

Jaglom, Complex Numbers in Geometry, Academic Press, New York, 1968. 
Pedoe, Problem E 1562, this MonTHLY, 70 (1963) 1012. 

, Proc. Cambridge Philos. Soc., 38 (1943) 397-98. 

————, Circles, Pergamon Press, London, 1957. 


SOM wy 


1970] WILLIAM LOWELL PUTNAM MATHEMATICAL COMPETITION 721 


8. Daniel Pedoe, A Course of Geometry for Colleges and Universities, Cambridge University 
Press, England (in the press). 
9. University of Minnesota College Geometry Project. 
10. Weitzenboeck, Math. Zeit., 5 (1919) 137-146. 


THE WILLIAM LOWELL PUTNAM 
MATHEMATICAL COMPETITION 


J. H. McKAY, Oakland University 


The following results of the thirtieth William Lowell Putnam Mathematical 
Competition held on December 6, 1969 have been determined in accordance 
with the regulations governing the Competition. This competition is supported 
by the William Lowell Putnam Intercollegiate Memorial Fund left by Mrs. 
Putnam in memory of her husband and is held under the auspices of the Math- 
ematical Association of America. 

The first prize, five hundred dollars, is awarded to the Department of 
Mathematics of Massachusetts Institute of Technology, Cambridge, Massachu- 
setts. The members of the team were Don Coppersmith, John J. Keary, and 
Jeffrey C. Lagarias; to each of these a prize of one hundred dollars is awarded. 

The second prize, four hundred dollars, is awarded to the Department of 
Mathematics of Rice University, Houston, Texas. The members of the team 
were Alan R. Beale, David A. Cox, and James B. Hobelman; to each of these 
a prize of seventy-five dollars is awarded. 

The third prize, three hundred dollars, is awarded to the Department of 
Mathematics of the University of Chicago, Chicago, Illinois. The members of 
the team were Robert B. Israel, David S. Fried, and Kiyoshi Igusa; to each of 
these a prize of fifty dollars is awarded. 

The fourth prize, two hundred dollars, is awarded to the Department of 
Mathematics of Harvard University, Cambridge, Massachusetts. The members 
of the team were George Sicherman, Gerald I. Myerson, and Mark A. Mostow; 
to each of these a prize of fifty dollars is awarded. 

The fifth prize, one hundred dollars, is awarded to the Department of 
Mathematics of Yale University, New Haven, Connecticut. The members of the 
team were Gregg J. Zuckerman, J. Lance W. Jayne, and Frederic B. Weissler; 
to each of these a prize of fifty dollars is awarded. 

The five persons ranking highest in the examination, named in alphabetical 
order, are Alan R. Beale, Rice University; Don Coppersmith, Massachusetts 
Institute of Technology; Gerald A. Edgar, University of California at Santa 
Barbara; Robert A. Oliver, University of Chicago; Steven Winkler, Massa- 
chusetts Institute of Technology. Each of these has been designated as Putnam 
Fellows by the Mathematical Association of America and is awarded a prize of 
two hundred and fifty dollars. 

The five persons ranking second highest in the examination, named in 
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alphabetical order, are Avner D. Ash, Harvard University; Martin J. Cohen, 
University of California at Los Angeles; Alan S. Lederman, California Institute 
of Technology; Paul S. Vita, Harvard University; and Gregg J. Zuckerman, 
Yale University. To each of these a prize of one hundred dollars is awarded. 

The following teams, named in alphabetical order, won honorable mention: 
University of Alberta, the members of the team were Stephen Whitney, Eleanor 
McDonald, and John Mallet-Paret; University of California at Los Angeles, 
the members of the team were Martin J. Cohen, Paul F. Klembeck, and Michael 
D. Miller; California Institute of Technology, the members of the team were 
Thomas R. Davis, Jerry M. Feinberg, and Randolph S. Tuler; Princeton Uni- 
versity, the members of the team were Allen H. Back, Peter Malcolmson, and 
Steven H. Weintraub; Reed College, the members of the team were Thor O. 
Wenzel, R. Neil Vance, and Joe P. Buhler. 

Honorable mention is given to the following twenty-six individuals, named 
in alphabetical order: Allen H. Back, Princeton University, Allen E. Barnes, 
Drexel Institute of Technology; Kenneth A. Brakke, Universtiy of Nebraska; 
Joe P. Buhler, Reed College; David M. Christie, Massachusetts Institute of 
Technology; Thomas R. Davis, California Institute of Technology; Zbigniew 
Fiedorowicz, Illinois Institute of Technology; Douglas A. Hensley, Umversity of 
Kansas; William L. Hibbard, University of Wisconsin at Madison; Dean 
Hickerson, University of California at Davis; Kiyoshi Igusa, University of 
Chicago; Robert Israel, University of Chicago; Robert J. Kimble, United States 
Naval Academy; Nicholas Littlestone, Harvard University; Haynes R. Miller, 
Harvard University; Gerald Myerson, Harvard Unwersity; James R. Paulson, 
Princeton University; Adbeel N. Quifiones, New York University at Washington 
Square; William H. Rowan, Stanford University; George Sicherman, Harvard 
University; Walter Rees Stromquist, University of Kansas; Jon Sussman, San 
Diego State College; Robert E. Tax, Unwersity of Chicago; Steven H. Weintraub, 
Princeton University; Steven Zucker, Brown University; Michael Zuker, McGull 
University. 

The other individuals who were ranked in the top one hundred, arranged 
by college, are: John J. Mallet-Paret and Stephen L. Whitney, University of 
Alberta; Charles W. Kaufman, Bates College; Philip Trauber, Brooklyn College; 
Douglas M. Lublin, Brown Umiversity; Daniel R. Farkas, University of Cah- 
fornia, Santa Barbara; Peter Van Wyck Loomis, University of California, Davis; 
Michael D. Miller, University of California, Los Angeles; Jon Gregor Bjornstad 
and Steven G. Krantz, Universtiy of Caltfornia, Santa Cruz; Jerry M. Feinberg 
and David J. Smith, Calfornia Institute of Technology; James Richard Spriggs, 
Case Western Reserve University; David S. Fried, David J. Saltman, and Frank 
Anthony Wilczek, University of Chicago; Martin H. Ellis, Czty College of New 
York; Robert B. Lumbert, Dartmouth College; Daniel E. Frohardt, Grinnell 
College; Ira M. Gessel, Mark A. Mostow, John P. Robertson, Jonathan M. Ros- 
enberg, and Paul J. Weiner, Harvard University; James L. Hlavka, Harvey Mudd 
College; James J. Callahan, Manhatian College; Charles E. Blain, Scott S. 
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Brown, Harold G. Donnelly, Daniel C. Galehouse, John J. Keary, Jeffrey C. 
Lagarias, Wralf T. Penquin, and Kenneth P. Rietz, Massachusetts Institute of 
Technology; Douglas A. Leonard and Robert L. Scott, University of Michigan; 
Serge Hamelin, Umversity of Montreal; Christopher A. Gurwood, New York 
Umversity, Uniwersity Herghts; Joseph L. Dunn, New York Unversity, Washing- 
ton Square; Curtis D. Herink, North Central College; Kim B. Bruce and Everett 
L. Bull, Jr., Pomona College; Alan M. Weinstein, Princeton University; Erik R. 
Lockeberg, Queens University; R. Neil Vance, Reed College; David A. Cox and 
James B. Hobelman, Rice University; Jerrold W. Grossman, Stanford University; 
Benjamin J. Kuipers, Swarthmore College; Mark E. Kaminsky, The Cooper 
Union; Daniel A. Gautreau, Daryl N. Geller, James M. Kavanagh, Hugh D. 
Miller and Joseph G. Sunday, University of Toronto; George Schick Lueker, 
Valparaiso University; Kent Brothers, University of Victorta; David G. Potter, 
University of Waterloo; Paul F. Daniels, Willams College; Ronald R. Pannatoni, 
University of Wisconsin, Madison; Kenneth P. Baclawski, University of W1s- 
consin, Milwaukee; Frederick J. Bruch, Umersity of Wisconsin, Parkside; 
Jockum L. Aniansson, Dale H. Peterson and Frederic B. Weissler, Yale Uni- 
versity. 

One thousand five hundred and one students from two hundred eighty-eight 
colleges and universities participated in the examination on December 6, 1969. 

A listing of the top five hundred contestants may be obtained from the 
Director. The list, which includes addresses and expected dates of graduation, 
may be helpful to departments of mathematics in selecting graduate students. 

The Questions Committee, consisting of Leo Moser (chairman), Albert 
Wilansky, and Warren Loud, prepared the problems (listed below) for the 
competition. 


PROBLEMS. PART A 


A-1. Let f(x, y) be a polynomial with real coefficients in the real variables x and y defined over 
the entire x-y plane. What are the possibilities for the range of f(x, y)? 


A-2. Let Dz be the determinant of order 2 of which the element in the ith row and the jth 
column is the absolute value of the difference of ¢ and 7. Show that D, is equal to 


(—1)""'(m — 1)2-*, 


A-3. Let P be a non-self-intersecting closed polygon with n sides. Let its vertices be Py, 
Po,+++, Py. Let m other points, Q:, Qo,+ ++, QO,» interior to P be given. Let the figure be tri- 
angulated. This means that certain pairs of the (n-+m) points Py, «+ + , Q~ are connected by line 
segments such that (i) the resulting figure consists exclusively of a set TJ of triangles, (ii) if two 
different triangles in T have more than a vertex in common then they have exactly a side in 
common, and (iii) the set of vertices of the triangles in T is precisely the set of (n-++-m) points 
Py, * ++, Qn. How many triangles in T? 


A-4. Show that 
1 Py 
f wtdn = Yo (—1)tly-n, 
0 


n=l 


(The integrand is taken to be 1 at x =0.) 
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A-5. Let u(t) be a continuous function in the system of differential equations 
dx dy 
—=-2 t —=-—2 t). 
7 y + u(t), q x -+ u(t) 


Show that, regardless of the choice of «(#), the solution of the system which satisfies + =xo, y=yo 
at ¢=0 will never pass through (0, 0) unless x9 =o. When xo =o, show that, for any positive value 
to of t, it is possible to choose u(#) so the solution is at (0, 0) when t =p. 


A-6. Let a sequence {x,} be given, and let yp =%n-1-+2%n, 1 =2, 3, 4, - + + . Suppose that the 
sequence {y,} converges. Prove that the sequence {x,} also converges. 


PART B 


B-1. Let 2 be a positive integer such that 2-+1 is divisible by 24. Prove that the sum of all 
the divisors of 7 is divisible by 24. 


B-2. Show that a finite group can not be the union of two of its proper subgroups. Does the 
statement remain true if “two” is replaced by “three”? 


B-3. The terms of a sequence T, satisfy 
Tn 
Talnii = 0 (n = 1,2,3,--+) and lim = 1, 


no n-+1 
Show that rT; =2. 


B-4. Show that any curve of unit length can be covered by a closed rectangle of area 1/4. 


B-5. Let a:1<a2<a3< +--+ be an increasing sequence of positive integers. Let the series 
DY 1/dn 
n=l 
be convergent. For any number x, let k(x) be the number of the a,’s which do not exceed x. Show 
that limz.. k(x) /x =0. 


B-6. Let A and B be matrices of size 3X2 and 2X3 respectively. Suppose that their product 


in the order AB is given by 
8 2-2 
AB = 2 5 (| ° 
—2 4 § 
Show that the product BA is given by 
9 0 
BA = | . 
0 9 
SOLUTIONS PART A 


The number in parentheses, immediately following the problem number, is the number of 
participants who received a score of 8, 9 or 10 (10 is maximum possible) on the problem. In the 
case of A-1, A-2, B-1 and B-2, this applies to all 1501 participants. For the other problems, the 
count applies only to the 839 qualifiers. 


A-1 (15). The continuity of f(x, y) implies that the range is connected (i.e., 
if a, b arein the range and a<c<b then c isin the range). If the range is bounded 
above and below, then the polynomial f(x, kx) is a constant for each value of 
k and thus f(x, y) is the constant f(0, 0). Thus the only possibilities are: (i) a 
single point; (ii) a semi-infinite interval with end-point; (iii) a semi-infinite 
interval without end-point; and (iv) all real numbers. 
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Examples are easily given for (i), (ii) and (iv). An example for (iii) is harder 
to find. One way is to have each cross-section of the surface (for fixed y) be a 
parabola with a minimum which decreases asymptotically toward some con- 
stant as y approaches +. A suitable example is (xy—1)?+x?. 


Comment: When the examination was printed it was believed that (iii) was not possible. The 
arguments used were erroneous. This extra complexity is the primary reason for the few high 
scores. 


A-2 (286). Subtract the first column from every other column. Then add the 
first row to every other row. The last row now has all zeros except for (x —1) 
in the first column. D, is (—1)""!(1—1) times the minor formed by deleting 
the first column and last row from the transformed determinant. This minor has 
only zeros below the main diagonal and thus is equal to the product of its 
diagonal elements. Hence the minor has value 2”-? and D, =(—1)""!(” —1)2”-?. 


Alternate Solution: From the bottom row of Dns1, subtract 1/(#—1) times 
the first row and n/(n—1) times the nth row. This shows that Day 
= —[2n/(n—1)|D,, for n>1. The result follows easily by iteration and the 
observation that Dz= —1. 


A-3 (39). Let ¢ be the number of triangles. The sum of all the angles is 7 
(since it is 7 for each triangle) and it is also 2mm-+(n—2)r. 


Alternate Solution: Let t be the number of triangles. In Euler’s formula 
V—E+fF=2, F=t+1, and V=n-+m. Since every edge is on two faces, 2E 
= 3t-+-n. Substitution leads directly to the answer t= 2m+n—2. 


- Comment: It should have been stated in the problem that the interior of the polygon is tri- 
angulated. If any of the additional line segments are outside of the polygon, the answer is different. 


A-4 (30). A reasonable way to get a series (other than using Riemann sums, 
which apparently doesn’t work) is to write the integrand as a power of e and 
use the series expansion for e. Then uniform convergence can be applied to 
interchange integration and summation, and show that 


1 rn | 1 
f etdx = >) — «“™(log «)™dx. 
0 


m=0 M!S 9 


Let F(m, k) = fo x™(log x)*dx. Integration by parts shows, if applied to a typical 
term in the summation, why we are interested in F(m, k) and also shows that 
F(m, k) =—k/(m+1)F(m, k-—1) for m20 and k21. As a result, F(m, m) 
=(—1)"m!(m+1)-"F(m, 0) =(—1)"m!(m+1)-*-1. To get the given formula in 
the problem, replace m+1 by m and adjust the limits on the summation ac- 
cordingly. 


A-5 (45). Subtracting the two equations eliminates u(#) and provides the 
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simpler equation d(x —y) /di=2(x—y), which has the solution x —y = (xy — yo) e**. 
If xo*vo, the right hand side is never zero and so x=y=O can never occur. 

For the second part, %o=yo. In this case, x«(t) =y(t) and so every solution 
is a parametrization of the line x =y. We can attempt to get a solution of the 
form x=xX9—alt, y=yo—at. This will be a solution if u(t) =2(*»—at) —a. By 
taking a =%Xo/to, x=y=Oatt=to. 


A-6 (24). Let }=limn..yn and set = 9/3. We will show that #=limase Xn. 
For any ¢>0 there is an N such that for all n>JN, | »n 5 <e/2. 


(/2>|yn —5| = | tna + an — 32] = | 2m — 2) + (41 — 2) | 


2| x, — 2| — | v1 — #|. 


IV 


This may be rewritten as |x,—%| <e/4+4|x,-1—2|, which can be iterated to 
give 
™m 


| nim — 2] < /4( > 2) + 2-4 | yn — #1 < e/2+2-D | yw, 1 — B|. 


=0 
By taking m large enough, 2-(m+1)| x, 1 — 2 <e/2. Thus for all sufficiently large 
k, | x. —2| <e. 
SOLUTIONS. PART B 
B-1 (229). The condition 24| n+1 is equivalent to n=—1 (mod 3) and 


n= —1 (mod 8). Let d be a divisor of n, then d=1 or 2 (mod 3) and d=1, 3, 5 or 
7 (mod 8), Since d(z/d) =n = —1 (mod 3) or (mod 8), the only possibilities are: 


d=, n/d =2 (mod 3) or vice versa 
d =1, n/d = 7 (mod 8) or vice versa 
d = 3, n/d =5 (mod 8) or vice versa. 


In every case, d+n2/d=0 (mod 3) and (mod 8). Thus d+ 2/d is a multiple of 
24. Note that d#n/d and thus no divisor is used twice in the pairing, so the sum 
of all the divisors is a multiple of 24. 


B-2 (376). The number of elements in a subgroup is a divisor of the order of 
the group. Thus a proper subgroup can have no more than half of all the ele- 
ments. Two subgroups always have the identity in common and hence their 
union cannot be the entire group. 

An example for the second part of the problem is the Klein group, which has 
an identity and three elements x, y, 2 of order two. The product of any two 
distinct elements from { x, y, gh is the third. This group is the union of three 
proper subgroups. 


Alternate Solution: Let G=HWUK, with H and K proper subgroups. There 


exists RC K with RE H. None of the elements in RH are in H and so RHCK. 
Hence HCkK =K and K=HUK =G, a contradiction. 
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B-3 (78). The first relation implies that 

(n—1)(*»—3)---3 1 
(= Dn 4-2 Ty 
_ (@— Am —3)+++2 
— (n-2Dn—4-- +4 


if 2 1s even, 


n 


“Ti if 2 1s odd. 


n 


If 2 is odd, 
Tn 22 4 4 6 (1 — 1) (n — 1) 
= (T))?-— — — ——... 
Tn41 13 3 5 5 (1 — 2) n 
The Wallis product is 7/2 =73$¢$ ---. After an even number of factors the 


partial product is less than 7/2 and after an odd number of factors the partial 
product is greater than 7/2. Thus for the case when x is odd, T,/Tn4i<4mT?. 
A similar calculation shows that, when is even, 7,/Tn41<2/mTj. Since the 
limit of T,/Tn41= 1, 1 is less than or equal to both 477 and its reciprocal. This 
implies that r7j=2. 


B-4 (5). Place the curve so that its endpoints lie on the x-axis. Then take 
the smallest rectangle with sides parallel to the axes which covers the curve. 
Let its horizontal and vertical dimensions be a and 0 respectively. Let Po and Ps 
be the endpoints of the curve, and let Pi, P2, P3, and P, be the points on the 
curve, in the order named, which lie one on each of the four sides of the rect- 
angle. Draw the broken line PoPiP2P3P.4P;. This line has length at most one. 
The horizontal components of the segments of this broken line add up at least 
to a, since one of the vertices of the broken line lies on the left end of the rec- 
tangle and one on the right end. The vertical segments add to at least 26 since 
we start and finish on the x-axis and go to both the top and bottom sides. This 
implies that the total length of the broken line is at least (a?-+ 402) 1/2, 

We now have that a@ and b both lie between 0 and 1 and that a?+4b?S1. 
Under these conditions the product ad is a maximum for a=4+/2, b=4+/2 and 
so the maximum of ad is ¥. Thus the area of the rectangle we have constructed 
is at most 4. 


B-5 (50). The following proof shows it is not necessary to stipulate that 
the a, be integers. Suppose for some e€>0 there are x;— © with k(x;) /x;2e€. Note 
that if 1S” Sk(x,;), then (because the a, increase) a, Sdeay) Sx; and 1/a,21/x;. 
Now for any positive integer JV, 


>, 1/dn, = sup >, 1/a, = sup Hes) — N = sup(e — N/x;) =e. 
n=N J n=N J j j 


But this contradicts the convergence of > )7.,1/a,, which implies 


lim >) 1/a, =0. 


N72 n=N 


Comment: The above solution was formulated by a grader on the basis of a student’s solution. 
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B-6 (6). Observe that ABA B=9AB. AB is of rank two so A is onto and B is 
one-to-one. Hence there exist matrices A’ and B’ such that A’A=I=BB’, 
where J is the 2 X2 identity matrix. Then A’(A BAB) B’=BA =9IJ. 


Alternate Solutton: (AB)?=9AB. The rank of BA is greater than or equal 
to the rank of A(BA)B, which is 2. Thus BA is nonsingular. But (BA)! 
= B(AB)*A = B(QAB)A =9(BA)? and the result follows since BA has an inverse. 
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CORRECTION TO “LIFE WITHOUT 7,” 


ALBERT WILANSKY, Lehigh University 


It was pointed out to me by D. E. Sanderson and J. W. Taylor that part of 
Example 2.2 (this MONTHLY, 77(1970)157-161) is not correct. Sanderson sup- 
plies the following example of a Baire space with a retraction onto a non-Baire 
subspace. Let J be the integers. Subsets of the plane are defined thus: 
let S={(m/2", 0) : m, nEI } Zn= (x, 1/n) : x ¥ m/2* for all mE }, 
Z=U{Z,:nEI}. The ZUS is a Baire space, Sis not, and r:ZUS—sS is a retrac- 
tion onto 8S, where 7 maps each component of Z onto the midpoint of its pro- 
jection on the X-axis. 

The following substitute example was suggested by G. A. Stengle. Let P be 
the property of certain topological spaces X:“X is homeomorphic with an in- 
terval [a, b|, a<b”. Then P is preserved under retraction but not under taking 
continuous images or closed subspaces. 


MATHEMATICAL NOTES 


EDITED BY DAviD DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


A BOREL SET WHICH CONTAINS NO RECTANGLES 


RICHARD DARST AND CASPER GOFFMAN, Purdue University 


This note contains an elementary example of a set E of positive two dimen- 
sional Lebesgue measure mz in the plane which not only contains no measurable 
rectangle of positive measure, but also has the property that every measurable 
rectangle of positive measure intersects the complement B of £ in a set of posi- 
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tive measure. This latter property implies that if E is augmented by a set X of 
planar measure zero to form a measurable set Y, then Y will contain no measur- 
able rectangle of positive measure. 

Recall that a measurable rectangle of positive measure is a set of the form 
UX V, where each of U and V is a measurable subset of the set R of real num- 
bers, and the one dimensional Lebesgue measure m, of both U and V is positive. 

Let A be a measurable subset of R such that both A and its complement 
R—A have subsets of positive measure in every interval [1, p. 195, Ex. 4.1]. 
Denote by B the measurable subset of R? whose points belong to the lines with 
slope one which intersect the x-axis in a point of A, and let H=R?—B. 

Suppose that C XD is a measurable rectangle of positive measure. Then there 
are measurable subsets F and G of C and D and a set N of planar measure zero 
such that 

(1) every point of Fis a point of density of F, 

(2) every point of G is a point of density of G, and 

(3) CXD=(EXFP)JUN. 

(See [1, p. 174, Theorem 2] and [2, p. 129].) Without loss of generality, we as- 
sume that (0, 0)C FXG. 

Let U be an interval of length e, symmetric about zero, and such that 


m(U(\ F) > 9% and m(Ul\G) > .%. 


Let @ be defined on R by (ft) =mi([UNFI|A |x; x=y+é for some 
yE(UNG))). 

Then (0) =mi([UN FIA [UNG]) >.8e. The function ¢@ is continuous since 
if T:= [x+t:xC€T| then for S and T measurable sets, mi(S\T;) is continuous 
in t. This is obvious since it clearly holds if Sand 7 are intervals, and measurable 
sets can be approximated by finite unions of intervals. 

Let 6 be a positive number such that ¢(¢) 2.8, whenever |¢| <6. Then for 
|e] $6, m([xEG (UNF); x—y=t] for some yE(UNG)) 2.8. Thus 


m(BO\ [FX G]) = V2 (.8e)m(A A [ — 5, 6]) > 0. 


References 
1. Casper Goffman, Real Functions, Prindle, Weber and Schmidt, Boston, 1967. 


2. S. Saks, Theory of the Integral, translated by L. C. Young, Warszawa-Lwow, 1937. 
A CONTINUOUS ALMOST PERIODIC FUNCTION HAS EVERY CHORD 
M. C. Tews, College of the Holy Cross 


A continuous real-valued function f with domain the set of real numbers is 
said to be almost periodic if, for each positive number e¢, the set 


Efe, f} = {t:| f+ —f(x)| < efor all «} 


is relatively dense, i.e., there is a positive number m such that E je, f } has at 
least one element in each open interval of length m. That a continuous periodic 
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function with period p is almost periodic follows by choosing m equal to p-+1, 
since Ele, f}D {npin an integer } for all e>0. It is the purpose of this note to 
show that the technique used in [2] to prove that a continuous periodic func- 
tion has horizontal chords of arbitrary length will yield the same result for 
almost periodic functions. 

Before proceeding to the proof, we recall some facts about almost periodic 
functions, all of which are stated and proved in Chapter 1, section 1 of [1]. 

(i) tCE {ef} uf and only if —tEE}e, f}. 

(ii) If 0<qsm, then E{q, f} CE{m, f}. 

(iii) Almost pertodic functions are uniformly continuous. 

(iv) If f and g are almost periodic, then Ele, f INE {e, g} is relatively dense 
for each e>0. 


The notation (m, m) is used for the interval {x:m<x<n}. 

Let f be an almost periodic function and let a be a positive real number. To 
prove that f has a chord of length a, we shall show that f(x-+a)—f(«) =0 for 
some real number x. If this is not the case, then the continuous function defined 
by x—f(«+a)—/f(x) is either always positive or always negative, by the inter- 
mediate value theorem. We assume, without loss of generality, that f(x+a) 
—f(x)>0 for all x, and shall obtain a contradiction. Let e>0. By (iii), we can 
choose a positive number 6 such that 


| f(#) — f(y)| < 6/2, whenever |x —y| <3. 


The function g, with period a, defined by g(x) =sin(27x/a) is used as a tool. 
Let p be any positive number less than the minimum of 6 and a@/8 and let 
B=g(p). Suppose ¢ is such that 


| g(a + ) — g(x)| <6 for all x. 
Choosing x =0, this inequality gives 
_ (2a 
< 8 = sin (= r), 
a 


; ( =") 
sin { -—— 
a 


hence 


however, choosing x =a/4 gives 


ar /a 2x a Qn 
sin E (= + ‘)] — sin (=.=) ; < sin— #, 
a\A4 a 4 a 


which is the same as | cos(2rt/ a) —1| <sin(2rp/a). Hence 


2a _ aap 
cos (=*) > 1 — sin— > 0, 
a a 
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so that 
a a 
iG U (no- = nat), 
—o<n<-}eo 4 4 


Since p is less than a/8, intersecting these two sets yields 


t€ U_ (na — p,na + p) 


—o<n<-t-oo 
and, since p is less than 6, {CU_ « <n<+«(na—5, na+6). Therefore, 


E{g,g}C U § (na—8,na+8). 


—o<cn<+foo 
Finally, let? =min {8, ¢/2}. We have proved that 
E{n, g} NE{n, f} C E{n, g} CE{8, g} C U (na—8,na+8), 


—0<n<-++00 


where the second set containment is by (ii). By (iv), the set on the left is rela- 
tively dense, and hence contains arbitrarily large numbers. We can, therefore, 
choose t€E{n, g}NE{n, f} such that t=ma-+gq, where m is a positive integer 
and |t—ma| =|q| <6. By the choice of 6, for all x we have 


f(a + ma — t) — f(x) < &/2 
and, since by (i), —¢is alsoa member of EZ {n, f } , we have 
f(a + ma) — f(x + ma — 1) <4. 


Recalling that x—>f(*«-+-a) —f(x) is positive, we obtain 


0<f(«*+a)—f(x) S > [f(w + ka) — f(x + (k — 1)a)| 


= f(x + ma) — f(x) = [f(« + ma) — f(x + ma — 1)| 
+ [f(a + ma — t) — f(x)| <n +6¢/2 Se. 


However, since ¢ is arbitrary, this implies that f(x-+a)—/f(x) is zero for all x, 
which is our desired contradiction. We conclude that f has a horizontal chord of 
length a. 

In [2], the “Dirichlet box principle” is used as a basic tool. That tool is used 
in this note but is hidden in the proof of 10° of [1] which is used to prove (iv). 
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A CHARACTERIZATION AND A CANONICAL DECOMPOSITION OF 
HURWITZIAN MATRICES 


R. DatKo, Georgetown University, and V. SEsHapDRI, McGill University 


A complex 2 Xn matrix A is called Hurwitzian if all its eigenvalues have real 
parts negative. In this note we shall use a result of A. M. Lyapunov to obtain a 
characterization and a canonical decomposition for such matrices. First some 
notation will be established. 


NotaTIon: (a) A, B,- ++, etc. will denote complex Xn matrices. (b) A* 
will denote the conjugate transpose of a matrix A. (c) If Q is a positive definite 
Hermitian matrix, then we shall denote this by Q>0. (d) If O>0 then Q!/2 will 
denote the unique positive definite square root of Q (see, e.g., [1] p. 92). 


The basis for the results obtained in this note is the following lemma (see, 
e.g., [2] p. 155): 


LEMMA. Jf A 1s Hurwiizian, the solution in B of the equation A*B4+BA=—C 
is unique and 1s given by the formula 


pm feces 
0 


If C>0, then B>0. Moreover, of C>0 and B>0 and A*B+BA=—C, then A 
ts Hurwitzian. 


Let A be Hurwitzian and choose C= J. Then on the basis of the lemma, there 
is a unique B>0 such that 


(1) A* B+ BA = —I. 
Define the matrix S by the relation 
(2) —3B'1+S= A. 


Substituting this expression for A in (1) yields the equation S*B+BS=0 or, 
what is the same, 


(3) —S* B = BS. 


We multiply both sides of (3) by B-*/2 on the left and then by B-"/? on the right 
to obtain 


(4) — B-1/2§* Bil2 — Bil2g B-1/2 
which shows that S is similar to a skew-Hermitian matrix. 


PROPERTY 1. Any Hurwiizian matrix A can be written in the form 
A= — Q + T, 


where Q>0 and T is similar to a skew-Hermitian matrix. 
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Proof. Let A be a Hurwitzian matrix and let B satisfy (1). Using equation (2) 
we write A=—4B-!+S. By equation (4), S is similar to a skew-Hermitian 
matrix and since B>0O, B-/?/2>0. Letting Q=B-!/2 and T=S establishes 
Property 1. 


PROPERTY 2. A matrix A is Hurwitzian tf and only af 1¢ ts similar to a matrix 
of the form 


—Q + T ? 
where O>0 and T is skew-Hermitian. 


Proof. If A is Hurwitzian and B satisfies (1) then, by equation (2), A 
=1B+S. Multiplying both sidés of this last equation by B/? on the left and 
then by B~-/? on the right we obtain 


(5) BU? 4 B-12 = (B-1/2) + BUA B-12, 


Letting Q=B-1/2 and T=B2SB-'?2 and taking cognizance of equation (4) 
establishes the necessity of the property. 

Next, suppose A is similar to K=—Q-+T where Q>0 and T is skew- 
Hermitian. Then K+K* = —2Q. Hence K satisfies the hypotheses of the lemma 
where B=J>0 and C=2Q>0so0 that K is Hurwitzian. From this it follows that 
A is Hurwitzian. 

Property 1 is the canonical decomposition mentioned above and Property 2 
is the characterization. 


References 


1. R. Bellman, Introduction to Matrix Theory, McGraw-Hill, New York, 1960. 
2. A. Halanay, Differential Equations, Academic Press, New York and London, 1966. 


ON GROUPS OF MOTIONS GENERATED BY TWO ROTATIONS 
LEOPOLD FLATTO, Yeshiva University 


1. Let Ri and R2 denote respectively the rotations of the Euclidean plane 
about the distinct points P; and Pe, the angle of rotation being 27/n, where n 
is a positive integer. It is a well-known fact that the group G generated by R; 
and R, is discrete if and only if n=1, 2, 3, 4, 6. A simple geometric proof of this 
fact was given by Barlow (see [1], page 60). In this note we give an algebraic 
proof which actually yields somewhat more information. In section 2 we give 
an application of Theorem 1 to the study of level curves of harmonic functions 
in two variables. 


THEOREM 1. Let G be the group generated by two distinct two-dimensional rota- 
tions, the angle of rotation being 2r/n. If nA~1, 2, 3, 4, 6 then the subgroup Go con- 
sisting of the translations in G is dense in the group Gr of all translations. 


Proof. We may assume that the centers of rotations of Ri and Rez in the com- 
plex plane are given respectively by 0 and 1. Thus 
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Ri(z) = §2, Raz) = f2 + (1 — 8), 


where {=exp[2mi/n]. Let Tm(z) =RUR2Ry "+t (z). Direct computations yield 
Tin(s) =2+o"(1—$). 


Go contains the translations 
TiTiT,-1(2) =2+ G+ ha)(1 — 9), 
TiT2Ty(2) =2+ (G+ kay — 9), 


where x=(+¢—!=2Re¢ and Jj, k are arbitrary integers. 

For n>2, T)TiT%_, and T{T3T ’ are translations in two different directions. 
If x is irrational, then Kronecker’s theorem [2, page 370] states that the numbers 
\jt+kx} (—0 <j, k< ©) form a dense set of real numbers. Thus Gp is dense 
in Gr provided « is irrational. If x is rational then ¢?—x¢+1=0 implies that the 
minimal polynomial of ¢ over the rational field has degree <2. But the minimal 
polynomial of ¢ over the rational field is the cyclotomic polynomial of degree 
o(n) [3, page 162]. Thus 6(”) $2, so that n=1, 2, 3, 4, 6. 

We have thus shown that Go is dense in Gr except for n=1, 2, 3, 4, 6. These 
latter values are indeed exceptional as G is then known to be discrete [1]. 


2. We now give an application of the previous discussion. We employ the 
following terminology. A curve I is said to be a level curve of a harmonic function 
if there exists a function harmonic in the entire plane, except for isolated singu- 
larities, which vanishes on I but not identically. We obtain the following result: 


THEOREM 2. Let T' be the wmage of the line x=c (c¥0) under the map w =z", 
z=x-+1y. Then I 1s a level curve of a harmonic function 1ff n=1, 2, 3, 4, 6. 


Proof: Let v(z) =u(2”). If u(z) is a harmonic function vanishing on I’, then 
v(z) is harmonic and vanishes on the line x=c. It follows from the reflection 
principle for harmonic functions that v(z) = —v(R(z)) where R denotes the re- 
flection in the line x=c. Let Ri(z)=¢s and R2(z)=RR;1R(z), ¢=exp[2ri/n]. 
Then v(Ri(z)) =v(Re(z)) =v(z) so that v(g(z)) =v(z) for gEG, G being the group 
generated by Ri and Re. Ri and Re denote respectively rotations by an angle 
2r/n about the points 0, 2c. It follows from Theorem 1 that for 11, 2, 3, 4, 6 
y(z) is a constant. Since v(z) =0 on I’, u(z) =v(z) =0 for all z. Hence T is not a 
level curve for these values of x. 

For 2=1, 2, 3, 4, 6, I is a level curve of a harmonic function. This may be 
shown as follows. We establish the existence of a nonconstant function f(z) with 
the following properties: 

1. f(z) is meromorphic in the entire plane. 

2. f(¢z) =f(z) for all z, where ¢ = e?*#/”, 

3. Im[f(z)]=0 on the line x=c. 

It follows that f(z") is well defined and meromorphic in the entire plane. Fur- 
thermore, Im[f(z/")]=0 for z€I’. The nonconstant harmonic function u() 
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=Im|f(z"")] has only isolated singularities, as f(2/") is meromorphic in the 
entire plane, and I is a level curve of u(z). 

It remains to establish the existence of f(z) having the properties 1-3. Since 
the existence of an f(z) for a given implies the existence of an f(z) for any 
divisor of 7, it suffices to consider the cases n=4, 6. Suppose that n=4. Let Q 
denote the lattice consisting of the points mc-+-nci, where —-~<m<mo, —o 
<n<o, and denote the points in Q by w. Let f(z) = >_. 1/(g—w)*, where the 
summation extends over all wE&Q. f(z) is a nonconstant elliptic function with the 
periods c and cz. We observe that as w varies over 2, so do —@ and —w. It 
follows that 


1 
(3.1) fliz) = )) ———— = Lop: - Loa = f(z) 


w (ig — w)4 (g + i @ 
___— 1 1 1 
3.2 = ——— aE md —_—_—— , . 


Thus f(g) is real for z=zy. Since f(z) has period c, f(z) is real for z=c-+zy. Thus 
f(z) satisfies properties 1-3. 

The case n=6 is treated in a similar fashion. Let 2 denote the lattice con- 
sisting of the points mc-+ncf where —-7 <m< 0, —-w<n< om, Let 


1 
f(z) = DU Goa) 


f(z) is a nonconstant elliptic function with the periods c, cf. We observe that as 
w varies over 2, so do —@ and ¢—1w. It follows that 


3.3 = > ———__ = 
83) 0) =D Ea =F oe gop to 
__ “ 
4 yy) = SS 
G4) f@) = poo TD Gata = ye 7 fe). 


Thus f(g) is real for z=7zy. Since f(z) has period c, sect is real. Thus f(z) 
satisfies properties 1-3. 

Finally, we remark that if 7 =1 or 2, then I is a level curve of a nonconstant 
harmonic function without singularities. For we may choose f(z) in these cases 
to be respectively 7(—c), cos 7z/c. This is no longer true for n=3, 4, 6. In the 
latter cases the curve I’ contains a closed loop. Thus if u(z) is an everywhere 
harmonic function vanishing on I, then u(zg) will vanish inside the loop and 
hence identically. 
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ON THE LOCATION OF SINGULARITIES OF ANALYTIC FUNCTIONS 
A. R. Buass, Harvard University, and C. V. STANOJEvic, University of Missouri at Rolla 


Consider an analytic function given by its Taylor expansion about a point 
which, for convenience, we take to be the origin 


(1) f(z) = > Ono". 


If this series has a finite radius of convergence, then there is at least one singular 
point of f on the circle of convergence [4, Sec. 10.2]. In principle, the location of 
singular points is determined by the coefficients a,; thus, the problem arises of 
actually finding the singularities in terms of the a,. A theorem due to Pringsheim 
[5], [4, Sec. 5.7] asserts that, if all but finitely many of the a, are real and non- 
negative, or even satisfy only | arg On| <6 for some B<7/2, [3], then the positive 
real point on the circle of convergence is singular. A rather deep theorem of 
Fabry [1], [2] asserts that, if @,/an41 approaches a limit, a, (so, by the ratio test, 
|a| is the radius of convergence) then a is a singularity. 

In this paper, we shall prove, in an elementary way, a theorem related to 
that of Fabry. The hypothesis of our theorem assumes something more than 
convergence of @n/@n41, but our conclusion also asserts more than the conclusion 
of Fabry’s theorem. 

Before stating our theorem, we must introduce some definitions and nota- 
tions. Suppose z is a fixed point inside the circle of convergence of (1). Then (1) 
may be differentiated term by term as often as we want. Let us differentiate it 
k times and then multiply the result by z*/k!. The resulting series 


; E+()e 


converges absolutely. In particular, its partial sums are bounded. If z is not a 
zero of f(z), then we can find a number M such that 


N n or) ") 
On 2” Qn z” 

Xo()e| sa Eel 

In general, M will depend on z and on k. If for some value of 20, it happens 


that M may be chosen independent of 2, then we call such a value of z a special 


point for the function f. 
For brevity, we shall write “the circle about p” instead of “the circle of 


convergence of the Taylor series of f(z) about p.” 
Let us remark that the hypothesis of Fabry’s theorem says precisely that 


the series 


(4) > (= - ==). _ 4% 


n=1 On41 an 


<M for all VN 2 k. 


(3) 
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converges. Now we are in a position to state our theorem. 


THEOREM. Let an analytic function f have the expansion (1), and suppose none 
of the dy are zero and the series (4) converges to a—(ao/a1) and converges absolutely. 
If 2401s a special point for f, then the circle about z passes through a. 


Proof. Let us introduce the notations 

Nn on 

S,(N) = ») ) ane" S, = > ( ) one = lim S;,(NV). 
n=k k n=k R N-+00 

Applying the ratio test to the expansion 

ioe) [oe] nN ive) Sn 

fw) = an(") s+) (w — 9)! = Sy 2 (w= 9) 
k=0 \ n=k k ko 2° 


we see that the circle about zg has radius lim, | 2(Sz/Si41)| provided that this limit 
exists. We shall show that it does exist and equals |a—z|. Observe that, since 2 
is special, we have 


(3’) |S,(N)| S$ M|S,|  forallN and kN. 
We calculate: 


2(S,(V — 1) + Srai( NV — 1)) — aSz41(N) 


-_ a( > @ OnZ” >> ( " ) axe") — aSi4i1(V) 


n=k n=k+1 k+1 
N=1 (yn +1 N’ n 
= ») ( ) ans —a > ( ) ane 
nk \k +1 n—kv1 \k +1 


al n An—1 
x, (ih) (== - 2) Ont 

N-1 /a,-1 ) (2 ewe (Fe y= (7 Je 
= - j + — iq: 
2G On+1 Ze, k+1 ue an ° J, k+1 a 
N—1 

> (o -— ) SuasGo) +( - 2) Sias(N). 


n=k+1 an On+1 an 


If we let N— ©, the last term vanishes because a =limy ay_1/ay, and Sz41(N) is 
bounded, so we get 


| 2S, — (@ — 2)Stgi| = 


> (=. _ <: ) Sesa(n) 


n=k+1 Qn, n+1 
[oa] 
On—1 Qn 
SM|Seil > — 
n=k+1 an On+1 
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by (3’). Thus 


Qn—1 an 


an On+1 


Sk 


= o(1), k—-> %, 


S 


<u > 


n=k-+-1 


— (a — 2) 


k+1 


because (4) converges absolutely. This calculation shows that the circle about 
g has radius exactly |a—z|, and therefore passes through a. 

If we combine the theorem just proved with the fact that the circle about 
any point has a singularity on its circumference but none in its interior, and the 
obvious fact that the circle about O has radius a, we immediately find the 
following results: 

If (4) converges absolutely, and if there is a special point 2 on the straight 
line segment joining O to a, then a is a singularity of f (as in Fabry’s theorem). 
If there is a special point z on the line segment joining O to —a, then, not only 
is a a singularity, but it is the oly singularity on the circle about O. If there 
are two special points, one on the line segment from O to za, the other on the line 
segment from O to —za, then either a or —a is singular (in fact we know, by 
Fabry’s Theorem, that a is singular), and no point other than these two on the 
circle about O is singular. 
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RESEARCH PROBLEMS 
EDITED BY VICTOR KLEE 


In this Depariment the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (if any are known to the author) and by a brief description of known 
partial resulis. Manuscripts should be sent to Victor Klee, Depariment of Mathematics, 
University of Washington, Seattle, WA 98105. 


ON SETS OF DISTANCES OF n POINTS 
PAUL Erpds, Israel Institute of Technology, Haifa 


Let f(~) be the largest integer so that m distinct points in the plane always 
determine at least f(m) distinct distances. It is easy to see that f(3) =1, f(4) =/(5) 
=2, f(6) =f(7) =3. I proved [1] 
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——— Cyn 
(1) /n—-1—-1<f(n)< Vion 
Moser proved [2] 
n2l8 
(2) f(n) > ry ~ 1 


which is the best-known lower bound for f(x). 

It seems likely that con/(log n)/? is the right order of magnitude for f(z). In 
fact perhaps the following result holds: Let x1, - + - , x, be 2 distinct points in the 
plane. Then for at least one point x; there are at least c3z/(log m)‘/? distinct 
numbers amongst the distances d(x;, x;), where 1Sj Sn. 

Assume next that the points x1, ---, %, are vertices of a convex n-gon. I 
conjectured [1] and Altman [3] proved that the ~ points determine at least 
[n/2]| distinct distances. (The regular n-gon shows that this is best possible.) I 
made two further conjectures [1]. Let x1, +--+, %n be the vertices of a convex 
n-gon. Then there always is an x; so that there are at least [7/2] distinct dis- 
tances among the d(x;, x;), where 1SjSn and j#1. This is probably true but 
has not yet been settled. The second conjecture asserts that there always is an 
x;so that there are no three vertices equidistant from it. 

The second conjecture would clearly imply the first, but Danzer disproved 
it (unpublished). In fact Danzer showed that for each k, there is a convex n-gon 
with n>n,(k) so that every vertex has at least k vertices which are equidistant 
from it. 

Let g(x) be the largest integer so that there are 7 points x1, +--+, x, in the 
plane for which there are g(7) pairs x;, x; with d(x;, x;) =1.I proved [1| 


mitellog logn < g(n) < cni!?, 


It seems likely that the lower bound gives the correct order of magnitude, but 
I could not even prove g(n) =o0(n?!/2), 

All these problems can be posed in the case the points are in k-dimensional 
Euclidean space E,. Curiously some of them become more tractable for k= 4 [4]. 


Let 7 points be given in E,. L. M. Kelly and I proved [5] that there are 
always three of them which determine a nonisosceles triangle. The regular 
pentagon and its center shows that 7 is best possible. Croft [6| proved that 9 
points in #3 gives the best possible answer and believes that 2k-++3 points in E; 
always determine the vertices of a nonisosceles triangle. 

More generally one can ask the following question. Let f(, k) be the smallest 
integer so that if x1, ---,x;arel=f(n, k) points in E,, one can always select k 
of them so that all the C;,.2. distances are distinct. A good estimation for f(n, k) 
seems difficult even for R=1. I conjecture f(z, 1)=(1-+0(1))n?. A result of 
Turdn and myself [7] shows that f(n, 1) = (1-++0(1))n?. 


For some of these and other geometric problems, see my Hungarian paper in Mat. Lapok, 8 
(1957) 86-92. 
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CLASSROOM NOTES 


EDITED BY DAvIpD DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


SIZING UP SETS AND CONTINUITY—DIFFERENTIABILITY RELATIONSHIPS 
S. G. WAYMENT, Utah State University 


1. Introduction. Let S be the closed interval [0, 1]. There are three standard 
methods of describing the “size” of a subset of S, i.e. (a) cardinality, (b) cate- 
gory, and (c) Lebesgue measure. The purpose of this paper is to demonstrate by 
examples the discrepancies which arise when comparing the sizes of sets with 
these methods. One of the resulting sets is used to construct a function which 
satisfies conditions “similar” to conditions under which no such function exists. 
More specifically, we construct a function which is continuous on a dense set D 
of measure zero and is discontinuous on S—D which has measure one. We also 
ask an interesting question concerning the existence of a continuous function 
whose derivative exists only on a countable dense set. In the final section of the 
paper we give a function which is a counterexample to some results stated in [1] 
and [2] on the derivative of a function with a dense set of discontinuities. 


2. Comparing some sets. Let M, be a set consisting of z elements from 5S. 
Let H be the set of reciprocals of positive integers, let R be the set of rational 
points in S, let C be the Cantor set, and let P be the nonmeasurable set described 
in [3]. Finally, suppose (r;) is an ordering of R and let J, be an open interval 
centered at r; such that the length /(7,) <<1/k2*. For k2=3 let O, =U, Ju. Then 
the measure of O, satisfies m(O,) <1/k. Further, O, =.S—O; is nowhere dense and 
Urs Ox is first category while Gs; =(Mrp-3 O; is of second category and m(G;) =0. 
If we assume the continuum hypothesis and recall that countability of 4CS 
implies that m(A) =0 and A is of first category, then we can obtain an ordering 
relative to “size” by each of the three mentioned methods. We shall use < and 
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= to represent this ordering under each method and shall insert parenthesis to 
indicate a rather loose classification as to “size.” 

(a) With respect to cardinality we have (finite)<(countably infinite) 
<(uncountably infinite) and 


(6<Mi<-+-<Mia<---)<(H=R)<(C=0,=0,=G;=G;=P=S). 


(b) With respect to category we have (nowhere dense) < (first category) 
< (second category) and 


(@=M,=---=M,=---=H=C=0,) <(R=G) <(0,=G:=P=S). 


(c) With respect to measure we have (measure zero) < (measure between 
zero and one) < (measure one) and 


(¢6=M,=---=M,=-+---=H=R=C=G) < (<0, <(G =S). 


Note that P is not comparable with respect to measure. 

The following relations are known to hold: 

(i) S(which is uncountable) is not the union of a countable class of countable 
sets; 

(ii) S (which is of second category) is not the union of a countable class of 
sets of first category; and 

(iii) S (which has measure one) is not the union of a countable class of sets 
of measure zero. 


It is of interest to note that S=G;UG; is the union of two “small” sets, one 
of measure zero (second category) and one of first category (measure one). In 
view of mentioned results, no other such decomposition of S into “small” sets 
holds. 


3. On the existence of certain functions. A standard example of a function 
which is continuous on the irrationals and is discontinuous on the rationals is 
f(x) =0 when xG@S—R and f(x) =1/q when x«=/qER. (In this and the next 
example p/g is understood to be lowest terms.) It is well known and easily 
proved, using category arguments, that there does not exist a function which is 
continuous on the irrationals and discontinuous on the rationals. We next 
establish the existence of a function which is “almost” of this nature. 


THEOREM. There exists a function f defined on S=|0, 1] such that f is con- 
tinuous on a dense set D of measure zero and 1s discontinuous on S—D. 


Proof. Let D=G; and let f,(x) =0 if «CO, and fi,(x) =1/2* if xe S—O,. Then 
f= dors fs is the desired function. Note that if x€G,, then f(x) =0 and if «EG; 
then f(x) >0. Since Gs=S, this shows f is discontinuous for «Gs. If x€G; and 
e>0, then there exists N such that )o;.y 1/2‘<e and there exists an open inter- 
val I so that x»<ICO; fori=1, 2, ---,N. Hence we conclude that if x CJ then 


0 < f(x) — flee) = fla) < x 1/28 <e 
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and f is continuous at x9. Hence f is continuous on a dense set of measure zero 
and discontinuous elsewhere. 

REMARK. If f is continuous on a dense set, then the set of points of continuity 
must be of second category. Hence there does not exist a function f continuous 
on G; and discontinuous on G; since m(G;) =1 and hence G; is dense but of first 
category. 

Another standard example of a function that is continuous on the irrationals 
and discontinuous on the rationals is given by g(x)= ) ospase 1/g® where 
b/q is rational. Since g is increasing, g is integrable and G(x)=/J@ g is con- 
tinuous for each x. Furthermore G’ (x) exists and is equal to g(x) at points where 
g is continuous. Also, G’ (x) is easily seen not to exist at any rational and hence 
G is a continuous function whose derivative exists at each irrational and fails to 
exist at each rational. It is known that there exists a continuous function h(x) 
on [0, 1] such that 0<hA(x) <1 and the derivative fails to exist for each x. Hence 
h(x) [ [#1 («—71,)? is a continuous function whose derivative exists only on the 
finite set {r;}. This leads to the following question: 

QUESTION. Does there exist a continuous function whose derivative exists at 
each rational and fails to exist at each irrational? 


4, A Counterexample. On pages 126-7 of [1] a proof is given to show that if 
f is a function with a dense set of discontinuities, then f’ exists only on a set of 
first category. However, the author of [1] divides an inequality by 2,—x, and 
apparently assumes this quantity always to be nonnegative since the inequality 
is notreversed. The damage isirreparable as consideration of f(x) = SD osrse 1/q¢ 
illustrates. As already mentioned, f is continuous at each irrational in [0, 1] and 
discontinuous at each rational in [0, 1]. Further, if x is an irrational number and 
N is an integer, then there exists a 6 such that (x —6, x +6) contains no rational 
with denominator (in lowest terms) less than NV. We note also that the maximum 
number of rationals with denominator g in (x, y) is g(y—x). Hence if x<y<x-+6, 


Af = f(y) — f(®) = Do 1/¢® < do [aly — #)]1/q? = (y — x) DD 1/9? 


w<rsy qzN qzN 
Thus by choosing WN sufficiently large (6 sufficiently small) we obtain 
0 SAf/(y — *) <e 


and since a similar argument is valid for y<x we conclude f’(x) =0 for each 
irrational x€ [0, 1]. The set of irrationals in [0, 1] is of second category. The 
proof in [1] actually shows that if the left hand limit or right hand limit fails to 
exist on a dense set, then f’ can exist only a set of first category. 

It is further stated in [1] and [2] that “if a function is discontinuous at the 
points of an everywhere dense set and differentiable (hence continuous) at the 
points of another everywhere dense set, then it must be continuous and not 
differentiable at the points of a set of second category.” This result is also invalid 
since f(x) = > o<r<z 1/q@’ is continuous and differentiable at each irrational. 
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THE NUMBER OF IRREDUCIBLE POLYNOMIALS OF DEGREE n OVER GF() 
G. J. Smumons, Sandia Laboratories and University of New Mexico 


Many algebra texts [1, 2, for example] introduce the number theoretic 
Mébius inversion formula prior to discussing the cyclotomic extensions of a 
field, since it provides a simple means for computing the cyclotomic polynomials. 
It is apparently not well known, however, that the Mébius function can also be 
used to derive an expression for the number of irreducible monic polynomials of 
degree 2 over GF (p). Carmichael [3] gives a lengthy calculation of this quantity, 
which he calls N,, by applying the inclusion-exclusion principle to the number 
of prime factors of the divisors of x. The following simple derivation of NV, can 
either be used in a course in number theory as an example at the time the 
Mobius inversion formula is introduced, or in a course in algebra as a supplement 
to the discussion of irreducible polynomials. 

We include a brief derivation of the following theorem, which appears in 
most algebra texts [3, p. 258; 4, Problem 45.8, p. 369, etc.] and a statement of 
the Mébius inversion theorem to make this note self-contained. 


THEOREM 1. The polynomial x?"—x is the product of all monic irreducible 
polynomials over GF (p) whose degree d divides n. 


Proof. Let p(x) be a monic irreducible polynomial of degree d over GF (p). 
Then p(x) can be used to extend GF(p) to a field K containing p4 elements. If 
p(x) divides x?"—x then K may be embedded in the splitting field GF(p*) of 
x?" —x, Then GF(p") is a vector space of dimension e over K. Viewing both fields 
as vector spaces over GF(p) and counting dimensions, de=n. Conversely, if 
de=n, then K is isomorphic to the fixed field of GF(p*) under the automorphism 
x—>x?* since each aC K satisfies a” =a. Thus p(x) has a root a in GF(p"). Buta 
is also a root of x” —x, hence p(x) divides x?" —x. 

Since each monic irreducible polynomial of degree d has d elements of GF (p”) 
as roots, and no element of GF (”) is a root of more than one irreducible poly- 
nomial, counting shows that each irreducible polynomial occurs at most once in 
the factorization of x*"—x.f 

If Ng is the number of irreducible monic polynomials of degree d over GF (), 
then the sum of the weighted degrees is given by 


(1) p* = >» adNa 


din 


where d ranges over all divisors of n. 
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The Mobius function, u(d), of an integer d with the prime factorization 


aR 


d= pipe 2 be 
is defined by 


w(1) = 1 
u(d) =0 ifanya;> 1 
u(d) = (—1)* if a, = ag = ++ = aq, = 1. 


The Mobius inversion formula for an arithmetic function is given by the follow- 
ing theorem [1, 2, 4]: 


THEOREM 2. Let f(n) and g(n) be arithmetic functions satisfying f(n) = > ang (d). 
Then g(n) = >-ajn w(d)f(n/d), where w(d) is the Mobius function 

THEOREM 3. The number of irreducible monic polynomials of degree n over 
GF (p), Nn, ts 


1 
(2) Nn = — 2, u(d)pr'4. 


nN din 
Proof. Equation (2) is the Mobius inversion of equation (1).9 
For example, Vy for p=2 is given by: 


1 
Nie = 7 0" — 26 — 24 + 2) — 335. 


The surprising rate at which JN, increases for even small values of p is seen in 
the following table for p=5 and 7. 


p 
5 7 
n 


2 10 21 
3 4.0 112 
4 150 588 
5 624 3,360 
6 2,580 19,544 
7 11,160 117,648 
8 48,750 720,300 
9 217,000 4,483, 696 


By considering the representation of NV, given by equation (2) to be a p-ary 
number, it is easily shown that N,>0 for all positive x and p. This result pro- 
vides another proof that there exists a monic irreducible polynomial of each 
degree 1 over GF(p) and, consequently, that there exists a finite field with p” 
elements (obtained by extending GF(p) by the roots of the monic irreducible 
polynomial of degree 2) for each prime p and integer x. 
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ON THE CONVOLUTION OF DISTRIBUTION FUNCTIONS 
B. SCHWEIZER, University of Arizona 


Let D be the set of distribution functions of one-dimensional random vari- 
ables, i.e., functions defined on the real line R which are nondecreasing, left-con- 
tinuous, and have inf 0 and sup 1. The binary operation of convolution is jointly 
continuous with respect to weak convergence on D. This fact is well known and 
a proof using characteristic functions is trivial. Such a proof is, however, non- 
elementary. The aim of this note is to provide an elementary proof and, in the 
process, an application of the Lévy metric as well. We shall need the following 
simple facts: 

(1) The function L:DXD—-R defined by 


L(F, G) = inf{F(« — h) —h S G(x) S F(x +h) +h, for all x} 
a 


is a metric, namely the Lévy metric, on D. 

(2) Convergence in the metric space (D, L) is equivalent to weak conver- 
gence; that is to say, if { F,,} is a sequence in D and Fis also in D, then L(F,, F) 
— 0 if and only if F, converges weakly to F (i.e., F, converges pointwise to F at 
each continuity point of F). 

(3) Convolution is a commutative operation on D. (This is usually derived 
from the corresponding property for characteristic functions but is easy to prove 
directly.) 


THEOREM. Convolution 1s jointly continuous on the metric space (D, L). 


Proof. Let F, GED and e>0 be given. Suppose /”’, G’ED are such that 
L(F, F’)<¢/2 and L(G, G’)<é/2, so that, for all xCR, 


P( -) Cop sr(2+<)+- 
a 27 ) s a, 2 


and 
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Then 


(F+@)(x) = {Fle - sdGu) s J [Fi (2+<- y) + =] ac6) 


—O 


-/ («+ — 9) GQ) +5 aco) 


—o 


(PF a (#+- i G P)(«+- a 
= * —- — = — 
Y >) 2 ee ) 


2 
Prolesd juno! 
= XxX ee ee ae 
- 2 PO PG 


sf G@+e- sar) +o +4 
= (C#P\(ute te=(PaGVets) te 


Similarly, (F’ * G’)(x—«) —eS(F* G)(x), whence L(F’ * G’, F* G) Se and the 
theorem is proved. 


MATHEMATICAL EDUCATION 
EDITED By J. G. HARVEY AND M. W. POWNALL 


Material for this Department should be sent to either of the editors: J. G. Harvey, Department 
of Mathematics, University of Wisconsin, Madison, WI 53706; M. W. Pownall, Department 
of Mathematics, Colgate Universtiy, Hamilton, NY 13346. 


THE QUESTION OF ACCREDITATION AND CERTIFICATION! 


Introduction. Identification of persons who are really well qualified to teach 
mathematics in two- and four-year colleges is a perennial problem; and it is 
being aggravated by the growing number and diversity not only of schools to 
be staffed, but of graduate programs producing applicants. Similar problems 
exist in identifying persons who are qualified to work as mathematicians in 
industry or in other nonacademic positions. For these and other reasons, some 
consideration of the possibility of establishing a system of accreditation or 
certification in mathematics seems timely. The evidence [1] that we may be 


1 This is a digest of a Report on Accreditation and Certification prepared in 1969 by the CUPM 
Panel on College Teacher Preparation at the request of the MAA Board of Governors, and dis- 
cussed by the Board on January 23, 1970. The Board appointed a committee of its members to 
survey mathematicians’ views on this issue. The present digest, prepared by this committee, 
provides some background for questions that will be circulated through section officers within the 
next few months. The members of the committee are R. G. Bartle, Grace Bates, D. Bushaw, E. A. 
Cameron, D. T. Finkbeiner (chairman), J. Hashisaki, L. H. Lange, and F. L. Wolf. 


PROBLEMS AND SOLUTIONS 


EDITED BY Emory P. STARKE 


ASSOCIATE EDiTors: JOSHUA BARLAZ, GRATTAN P. MurpHy. COLLABORATING EDITORS: 
LEONARD CARLITZ, GULBANK D. CHAKERIAN, HASKELL COHEN, ISRAEL N. HERSTEIN, 
Murray S. KLAMKIN, DANIEL J. KLEITMAN, ROGER C. Lynpon, MArviIN Marcus, 
ALBERT WILANSKY, OSWALD WYLER, and UNIVERSITY OF MAINE PROBLEMS GROUP: 
GEORGE S. CUNNINGHAM, CLAYTON W. DopGE, Howarp W. EVEs, WILLIAM R. GEIGER, 
CHARLES A. GREEN, Eric S. LANGForRD, PHitip M. Locke, JoHN C. MarrHUBER, 
EDWARD S. NorTHAM and WILLIAM L, SOULE, JR. 


All problems (both elementary and advanced) proposed for inclusion in this Department 
should be sent to E. P. Starke, 1000 Kensington Ave., Plainfield, NJ 07060. Proposers of 
problems are urged to enclose any solutions or information that will assist the editors. Ordi- 
narily, problems in well-known textbooks and results in generally accessible sources are not 
appropriate for this Department. No solutions (except those accompanying proposals) should 
be sent to Professor Starke. 


ELEMENTARY PROBLEMS 


Solutions of Elementary Problems should be sent to Problems Group, Mathematics Depart- 
ment, University of Maine, Orono, ME 04473. To facilitate their consideration, solutions of 
Elementary Problems in this issue should be typed (with double spacing) and should be mailed 
before December 31, 1970. Contributors (in the United States) who desire acknowledgement of 
receipt of their solutions are asked to enclose self-addressed stamped postcards. 

An asterisk means that no solution was submitted. 


E 2247. Proposed by N. S. Mendelsohn, University of Manitoba 


Let ai, d2,..., Gn be a set of relatively prime positive integers. Let F(a, 
de,...y, Gm) represent the largest integer which cannot be represented in the 
form €101-+Code+ ... +€nQn where C1, Co, ... , Cy, are nonnegative integers. Prove 
the following: 


(1) If (a, 6) =1 and a>0, b>0 and ¢ is a positive integer such that ¢ is non- 
representable in the form Aa-+ Bd, with A, B nonnegative integers, then 
F(a, b,c) < F(a, b). 

(2) If (a, 6) =1, a>0, a<b and such that ta<b<(t+1)a where ¢# is an in- 
teger, then F(a, b, ab —(t+1)a—b) =ab—a—2). 


E 2248. Proposed by A. W. Walker, Toronto, Canada. 


If a, b, c, 7, R are the side lengths, inradius, and circumradius of any plane 


triangle, then 
1 s+(—+—+-) s(4+5+5) 57 
2*R°~ 3\a 6 ch \a B c/}” 4¢? 


E 2249.* Proposed by A. L. Holshauer, Chartotte, N. C. 

Given any three triangles A;, A2, A3. Let Hi, He, Hz be the orthocenters; 
G1, Ge, G3 the centroids; and O1, Oo, O; the circumcenters. Prove that the centroids 
of the triangles H;H:H3, GiG2G3 and O,O,03; are collinear, and their circumcenters 
are likewise collinear. 
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BE 2250. Proposed by C. A. Kotiman, Louisiana State University 


Prove that, if one is given any rectangular sheet of paper and a number 
«>0, he may, by repeated foldings of the paper in half or in thirds (lengthwise or 
widthwise or both), arrive at a smaller rectangle with ratio r of length to width, 
satisfying 1—-exrSi+e. 


E 2251. Proposed by T. C. Brown, Simon Fraser University, Burnaby, British 
Columbia 


Consider a rectangular array of dots with an even number of rows and an 
even number of columns. Color the dots either red or blue in such a way that 
every row has the same number of red and blue dots, and likewise every column. 
Whenever two dots of the same color are adjacent in the same row or column, 
connect them with a line segment of that color. Show that the total number of 
blue segments equals the total number of red segments. 


E 2252. Proposed by Harry Lass, Jet Propulsion Laboratory, California In- 
statute of Technology 


Given 2 urns numbered 1,..., 2 and & objects, with k <n. Suppose each of 
the objects is placed at random in one of the urns. Find for r=1, 2,..., the 
probability that the number of objects in the first 7 urns is less than or equal tor. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


Editorial Note. Through an oversight the following names were omitted from the list of solvers 
of Problem E 2180 [1970, 406]: E. N. Fischman, R. Garfield, and Chung-kiu Wong. 


An Equilateral Property of All Triangles 


E 2139 [1968, 1114; 1969, 1066]. Comment by A. W. Walker, Toronto, 
Canada. The concluding statement in the published solution is incorrect. For a 
(30°, 60°, 90°) triangle ABC, O is the midpoint of AB, H coincides with C, and 
the apex of one of the equilateral triangles on base OH is therefore the point B, 
while the apex of the other is the reflection of O in AC and obviously does not 
coincide with an escribed center of triangle ABC. 


A Product of Sixty-six Inversions 


E 2140 [1968, 1114; 1969, 1067]. Commeni by A. W. Walker, Toronto, 
Canada. The published solution is incomplete in that it omits an important case, 
namely when the mentioned orthocenter H coincides with the common point P 
of the three circles A, B, C. If H and P are distinct points, then (A;, Bi, Ci) 
are circles with a common point P, as stated, but as H—P, Pi— © ; and when H 
and P coincide, then (A, By, Ci) are straight lines forming a triangle. The equiv- 
alence of the given relation and Thomsen’s is then direct, and not inversive as 
stated in the solution. (For Thomsen’s proof of his result, see Math. Zetischrift, 
34, 1932, p. 690.) 
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A Functional Equation 


E 2176 [1969, 554; 1970, 310]. Proposed by R. S. Luthar, University of 
Wisconsin at Waukesha 


Find all continuous real functions f such that 
(: + *) _ fe) + £0) © 
x—yl f(x) — fr) 


(1) 


Editorial Note. Part of the solution by L. E. Ward, Sr., was omitted in print- 
ing; the incompleteness of the published solution was pointed out by A. N. 
Aheart. A corrected solution follows. 

Placing y=0 shows that f(1) [f(«) —f(0) | =f(x)+/(0), whence we infer that 
f(0) =0 and f(1) =1, since f cannot be constant. Replacing y by x —2 and then x 
by x —1 and y by 1, we obtain 


f(x) + f(® — 2) ( a ) fl*-1)+1 


fe —1)—1— 


2 —{)= ) 
) Ie) = He D 


It therefore follows that 


x — 2 


z \ fl) 
3) (5)  f(% — 2) 
and 
| feat 
(4) f(a) = le =D. 
By (3), f(4) =f?(2); this together with successive use of (4) yields 
P(2) +1 
5) =. 
1) = Tia) = tf 
On the other hand, 
3+2\ (3) + f(2) f(2) +1 
5) = _ a, 
1) (3 — ) f(3) — f(2)) 1+ 2f(2) — f?(2) 


Hence [f(2)—1]?=1+2f(2) —f2(2) or f?(2) =2f(2). But f(2)0 since, from (1), 
f(2) =0 would imply f((x+2)/(* —2)) =1. Hence f(2) =2. 

Now assume that f(x) =x forx=0,1,2,...,2 (#22). Then, with x=n-+1, 
the first equation of (2) gives at once f(x+1)=2+41. It follows that f(x) =x 
for all nonnegative integers. 

Putting y=cx yields 


( it 9) _ fa) + flex) A +SO_ 
t—c} fle) — flee) 1 — FO) 
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from which f(cx) =f(c)f(«). Now let x =p/g, a rational fraction, and c=q. Then 
f() =f(o/Of(@, showing that f(«)=x for all positive rational numbers. By 
continuity it follows that f(«) =x for all positive numbers. 

Finally, placing y=—x in (1) gives f(0)=f(x)+/f(—<x), so that f(—x)= 
—f(x) = —x. We conclude that f(x) =x. 


A Special Property of 3 
E 2190 [1969, 937]. Proposed by Harry Pollard, Purdue University 


Show that if m and v are positive integers, the smaller of the numbers7/m 
and /n cannot exceed 7/3. 


I. Solution by Douglas Lind, Stanford University. Let f(x) =x'/*. Elementary 
calculus shows that f(x)—-0 as x0, f(x) 1 as x, f is increasing in [0, e], 
and f is decreasing in [e, ©). Hence C=sup(f(k):2=1, 2, .. .) =max(f(2),f(3)). 
Since 3?> 23, f(3) >f(2), whence C= 31/8, Then f(m) $31 for all positive integers 
m.lfin2m, then m'/" <mU/m< 318 whence min(m"/*, nim) < 31/8, 


II. Solution by Charles Wexler, Arizona State University. Suppose first that 
m=n, meaning that we must show that Wn <~/3 or that 3*273. Proof by 
mathematical induction shows that this is true whenever 121. 3"2nW9=3"t12> 
3n3 = n3-+ 3n?+3n-+ (n—3)n?+ (n?—3)n, which is greater than (n+1)* when 
n= 3. The cases n=1, 2 are trivial.) 


Next suppose that 1 sn<m. Then nU™ sn" <s 31/8, 


Also solved by a hundred and seventeen other readers. 


Another Variation of a Problem of Euler 


E 2191 [1969, 938]. Proposed by M. J. Zerger, Umpqua Community College, 
Roseburg, Oregon 


Find the solution set of x@+) = (x-+1). 


Solution by G. A. Heuer, Concordia College. There is only one solution, even if 
all different combinations of branches are considered in the case where x <0. This 
solution is an irrational number between 2 and e. 

First assume x>0. A condition equivalent to the given one is that f(x) = 
f(x+1), where f(x) = (log x)/x. Inspection of f’(«) shows that f is strictly in- 
creasing on (0, e) and strictly decreasing on (e, ©). It follows easily that there is 
a unique x9<e for which f(xo) =f(xo+1). Since f(2) <f(3), we know that xo>2. 
The approximate value of xo is 2.2932. 

For x<0, (log x)/x= [log(x+1)]/(«+1) is still an equivalent condition. 
Now log x is not real, so log(x-+1) is not; thus x+1<0 also. We consider all 
branches of the logarithm; thus we seek values of x, and integers m, m such that 


log| «+ 1| + in(2m + 1) _ log| «| + in(2n + 1) 
x-+1 7 x 
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By equating imaginary parts we obtain x = (2n+1)/(2m—2n), which requires x 
to be rational. By equating real parts we have 


(log] «+ 1])/|#+1] = og] «|)/[a|, 


since x+1<0. The only possible solution here would be |x|=xo+1, |x+1| 
=%931.€.,X% = —X9—1. We eliminate this possibility by showing x to be irrational. 
Divide both members in the original equation by x? to obtain 


x = [(x + 1)/2]*, 
and then x/*=1-+1/x. Suppose x»=r/s, where ~ and s are relatively prime 
integers. Then (7/s)**=1+s/r; 
= (1+ s/r) = s(r+s)'r7; pst = s8(y + 5)", 
This is impossible in view of (7, s)=1, unless s=1. But xo is not an integer, so it 
is not rational. 


Also solved by Anders Bager (Denmark), Walter Bluger, G. C. Dodds, R. A. Gibbs, Michael 
Goldberg, Emil Grosswald, Thomas Hughes, P. G. Kirmser, Bill Knight, Patricia L. LaFratta, 
R. S. Lee, Charles Linnett, O. P. Lossers (Netherlands), D. E. Myers, C. S. Ogilvy, Stan Pauli, 
Corrado Quintiliani, Steven Russ, Simeon Reich (Israel), E. F. Schmeichel, E. M. Stone, Student 
Problems Group of St. Olaf College, Roger Weitzenkamp, Charles Wexler, and P. H. Young. 


An Upper Bound 


E 2192 [1969, 938]. Proposed by R. S. Luthar, University of Wisconsin at 
Waukesha: 


It is easy to show that 
1 1 1 
Dathuathate: <i 
i ar a a 2 
where the summation runs over all primes in each term of the above series. 


Determine a better upper bound. 


Solution by O. P. Lossers, Technological University, Eindhoven, Netherlands. 


co 


0 ite its ; top ~ = (Ga) 


n==2 Dp 


Dp 
<5+X(-- ) > _ log 2 < 0.8069 
— =-—-—lo ; ; 
2 <3\tn mt+1/ 2 ° 


Also solved by Anders Bager (Denmark), G. C. Dodds, R. B. Eggleton (Australia), T. E. 
Elsner, Leon Gerber, Michael Goldberg, Emil Grosswald, J. E. Hafstrom, C. T. Haskell, Dean 
Hickerson, Bill Knight, J. R. Kuttler, Douglas Lind, P. A. Lindstrom, D. C. B. Marsh, Arthur 
Marshall, Michael McCoy, C. B. A. Peck, R. S. R. Rao (India), Simeon Reich (Israel), Peter 
Ross, Steven Russ, E. F. Schmeichel, Ralph Schreiber, Peter Schroth (Germany), R. E. Shafer, 
J. S. Shipman, J. R. Smart, David Spear, S. E. Spielberg, Student Problems Group of St. Olaf 
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College, Hugo Sun, H. H. Thoyre, T. N. Tiwari, L. K. Tolman, E. W. Trost (Switzerland), Steve 
von Abele, Roger Weitzenkamp, Charles Wexler, J. E. Wilkins, Jr., and P. H. Young. 

Note. Many contributors determined still better bounds. The citation for diligence goes to 
Wilkins who reports that the sum is 0.773156669049795128, correct to 18 decimal places. 


A Weak Betweenness Relation 


E 2193 [1969, 938]. Proposed by D. J. Simanaitis, Case Western Reserve 
University 


For A, B and C, three points in the Euclidean plane, define B to be weakly- 
between A and Cif and only if angle ABC2 120°. Determine the minimal number 
of points required to insure the existence of at least one such weak-betweenness 
relation. 


I. Solution by Bill Knight, Student, University of Wyoming. Let N be the 
minimum number of points needed. The regular polygons of sides 3, 4 and 5 show 
that NV must be at least 6. It will be shown that N=6. 

If A, B and Care collinear with B between A and C, then angle ABC will be 
greater than 120°, and B will be between A and C. If a point is in the interior of 
any triangle, it will be seen that that point is weakly-between some two of the 
vertices of the triangle. 

Now assume that some six points are given in the plane in such a way as not 
to have a weakly-betweenness relation. The points are not collinear, so consider 
the convex hull, such that each of the six points is a vertex (since no three can be 
collinear), or isinterior to the hull. The sum of the angles of any six-sided polygon 
is always 720°, so that at least one of the angles must be greater than or equal to 
120°. It follows that one of the six points is in the interior. By connecting each 
pair of vertices of the hull, the interior is divided into triangles. Clearly, every 
point in the interior of the hull is also interior to one of these triangles and at 
least one weak-betweenness relation exists. Thus V=6. 


II. Solution by Jordi Dou, Barcelona, Spain. El numero minimo es 6. Ya que 
entre los 5 vertices de un pentagono regular no hay weakly-between points 
(w.b.p.) y si A; (1 $76) son 6 puntos, considérese la interseccion de los semi- 
planos A:A;(t4j) que contienen los 6 puntos, si la interseccion no contiene 
ningan punto interior sera un exagono convexo y por lo menos un vertice sera 
w.b.p.; si contiene un punto A; interior, A; sera interior de un triangulo A;A;,A, 
y sera w.b.p. de dos de sus vertices. 


Also solved by Anders Bager (Denmark), J. C. Binz (Switzerland), Walter Bluger, W. G. 
Brady, E. P. Del Norte, Leon Gerber, Michael Goldberg, M. G. Greening (Australia), Ned Harrell, 
Dean Hickerson, D. C. Kay, J. R. Kuttler, Charles McCracken, R. A. Melter, Norman Miller, 
Robert Patenaude, Jiirg Ritz (Switzerland), Simeon Reich (Israel), Kenneth Rosen, Steven Russ, 
Bill Sands, E. F. Schmeichel, Ralph Schreiber, G. F. Schumm, David Spear, H. H. Thoyre, Steve 
von Abele, Roger Weitzenkamp, Ruth Worth, Mark Yu, and the proposer. 

Note. Melter notes that the solution follows from Exercise 2, p. 132 of Blumenthal, Distance 
Geometry. Rosen and Schmeichel found the problem to be substantially the same as Problem A-1 of 
the 1964 Putnam Competition (see this Monraty [1965, 735 ]). 
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A Four Square Problem 
E 2194 [1969, 938]. Proposed by Marion B. Smith, University of Wisconsin, 
Baraboo 
Do there exist nonzero integers a, b, c, d such that a?-++b?=c?—d? and ab =cd? 
Solution by E. Rosenthall, McGill University. Using the idea of Fermat de- 
scent yields a negative answer. Clearly it suffices to determine that there are no 
solutions in positive integers. Suppose the system has a solution in positive 


integers and among all solutions choose one for which d is minimal. Then there 
are positive integers x, y, 3, w with the conditions 


(1) (v,z2)=1, wv) =1 
such that a=kxy, b=ksw, c=kxw, d=kzy, since all solutions of ab =cd in posi- 
tive integers are of this form. We also have a?-+-5? =c?—d? and it follows that the 
positive integers x, y, 2, w form a solution of 
(x? -- 27)(y? + w?) = 2x?w?, 

where (1) then implies that only either z or y is even and also that (x?-++2?, x?) =1 
and (y?-+-w?, w?) =1. Then we have 

x?-+ gw, y+ w? = 2x3, z even 
or 

x? + 22 = 2w*, y? + wt = x3, y even. 


Considering the first system (the other system can be similarly considered) 
it follows from the first equation that there are relatively prime integers m, n of 
opposite parity with m >n satisfying 


c=m—n*, ¢=2mn, w= m+n? 
which, upon substitution into the second equation of the system yields 
y+ (2mn)® = (m? — 02) 
and hence, since (2mn, m*—n*) =1, there are positive integers 7, s with r?>s? 
such that 
y=r?—s*> mn=rs, m—n? =r? -+ 5%. 


Then r, s, m, m is another solution of the given system in positive integers. But 
d =k(2mn) (r?—s*) which implies n<d, and this contradicts the minimal pro- 
perty of d. 


Also solved by Anders Bager (Denmark), Walter Bluger, W. J. Blundon, W. F. Fox, Michael 
Goldberg, M. G. Greening (Australia), Thomas Hughes, J. A. H. Hunter, O. P. Lossers (Nether- 
lands), D. C. B. Marsh, W. W. Meyer, Simeon Reich (Israel), K. A. Ribet, E. F. Schmeichel, 
Ralph Schreiber, David Spear, Allen Stenger, E. W. Trost (Switzerland), T. N. Tiwari, Steve von 
Abele, Charles Wexler, and the proposer. 
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Note. Most solvers note that the hypothesis implies (ad--bc)? =c4—a‘ and that this Diophan- 
tine equation is well known as having no solution. 


A Divisibility Problem in Polynomial Rings 


E 2195 [1969, 938]. Proposed by T. J. Bruggeman, Xavier University, 
Cincinnati, Ohto 


Which polynomials of the form >\7.; x% are divisible by 507.1 x*-1? That is, 
find necessary and sufficient conditions for a;,2=1,2,+°-,n. 


Solution by Robert Gilmer, Florida State University. We assume that f(x) = 
> 1x! and g(x) =>_%., «4% are elements of R[x |, where R is a commutative 
ring with unity e. We write a; as gqim-+r;:, where OS7r;<m. Then, since x"=e 
(mod f(x)), we have g(x) =>_%., x (mod f(x)). For any j between 0 and m—1, 
we let 2; be the number of a,’s which are congruent to j modulo m. Then 07.1 x” 
=o nex) = h(x); since f(x) is monic, it is clear that f(x) divides h(x) if and 
only if me=me= +++ =Nm—1e. Thus, f(x) divides g(x) if and only if ~;and n; are 
congruent modulo the characteristic of R for any 1 and 7 between 0 and m—1. 
One interesting corollary of this result is that m must divide nz if f(x) divides 
g(x). 


Also solved by Daniel Frohardt, Leon Gerber, Michael Goldberg, C. S. Ogilvy, Simeon Reich 
(Israel), E. F. Schmeichel, Ralph Schreiber, Sid Spital, Steve von Abele, and the proposer. 


Note. Problem E 2000 [1968, 908] is a special case. 


Multiplicative Function and its Dirichlet Inverse 
E 2196 [1969, 1062]. Proposed by R. Sivaramakrishnan, Trichur, India 


Given a multiplicative arithmetic function f(z), let its Dirichlet inverse be 
f-(n). By definition, f-1() satisfies the equation 


- _ fi (n = 1) 
CHOP Ma/a = ‘ nan 
Prove that ) aiaf(d)f—'(n/d)d =f(n)o(n) if and only if f(z) is completely multi- 


plicative. 6(z) is the well-known Euler function. 


Solution by Dan Marcus, Harvard University. Rather than assume that f is 
multiplicative, we assume only that f(1) 0; this guarantees the existence of f—}. 

If A is the set of all arithmetic functions (complex-valued if we wish) which 
do not vanish at 1, then it is well known that A is a commutative group under 
the Dirichlet product 


(fxg) (m) = D1 f(d)g(n/d), 
d\n 
and that A is a commutative monoid under pointwise multiplication (fg)(”) = 


f(n)g(n). Obviously the pointwise identity is the constant function 1, and the 
Dirichlet identity is (f*f-!)(~) =61, (Kronecker delta). If we let I(x) =n, then 
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the problem asks us to show that (fl)*f-1=f@ if and only if f is completely 
multiplicative; this is evidently equivalent to showing that (f¢)*f=/fJ if and only 
if fis completely multiplicative. 

Suppose that f is completely multiplicative. Then pointwise multiplication 
by f distributes over the Dirichlet product [J. Lambek, Arzthmetical functions 
and distributivity, this MONTHLY 73 (1966) 969-973 |. Since #1 = J (a well-known 
number theoretic result), we have that 


fI = f(e* 1) = (fo) * FI) = (fd) *f, 


as was to be shown. 
Conversely, if fJ = (fd) *f, then in particular f(1) = (f(1))? so that f(1) =1. For 
n>1, we have that 


(1) (n — 1 — $(n))f(n) = D' fDoOf(n/ 9), 


where the prime on the sum indicates that we sum over all divisors of m except n 
and 1. Now define g by defining g(p) =f(p) if p=1 or p is prime, and extending g 
to all x so as to be completely multiplicative. By the first part of the problem 


(2) (n — 1 — (n))g(n) = 2 g(d)¢(d) g(n/d), 


holding for all n>1. Equations (1) and (2) now imply that if z is composite and 
if f(d) =g(d) for all d<n, then f(x) =g(n), since ¢(n) 4¥n—1 if n is composite. 
Thus f = g by complete induction, and f is completely multiplicative. 


Also solved by B. Averbach, Trygve Breiteig (Norway), L. Carlitz, N. J. Fine, M. G. Greening 
(Australia), Emil Grosswald, Douglas Holdridge, Wells Johnson, R. Sita Ramachendra Rao 
(India), David Rearick, Simeon Reich (Israel), E. F. Schmeichel, E. A. Smith, E. W. Trost 
(Switzerland), and the proposer. 


Editorial Note. Most solvers noted that if f is completely multiplicative, then f~!=fy, where u 
is the Mobius function. Thus 


2 dt(d)f-(n/d) = p> df(d)f(n/d)u(n/d) 
= f(r) py du(n/d) = f(n)o(n). 


(The fact that J * u=¢ is well known; M. W. Potter supplied the reference, Cashwell and Everett, 
The ring of number-theoretic functions, Pacific J. Math., 9 (1959) 975-985). The converse was gener- 
ally shown by assuming that f was multiplicative and then proving by induction that f(p*) = (/(p))* 
for p prime. Carlitz and Rearick noted that f need not be assumed to be multiplicative to show this. 

Lambek (bid.) showed that f is completely multiplicative if and only if f distributes over all 
Dirichlet products: f(g * 4) =fg * fh for all g, h. The present problem asserts essentially that f is 
completely multiplicative if and only if f(¢@ * 1) =f¢ * f1, i.e., that f distributes over the particular 
Dirichlet product ¢ * 1=T. 
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The Functional Equation F(x”) =[F(x)]" 
E 2197 [1969, 1063]. Proposed by M. S. Klamkin, Ford Scientific Laboratory, 
and D. J. Newman, Veshiwa University 


Solve the functional equation F(x”) = [ F(x) ]*. 


Solution by the proposers. Let log F(x) =G(x)(log x)*, where a=(log n)/ 
(log m). Then G(x”) =G(x). A general solution for G(x) is G(x) = H(log log x), 
where H is periodic with period log m. Thus 


F(x) = exp|H (log log x) - (log x)«]. 


It is to be noted that the problem was deliberately incompletely formulated 
in that no class of functions F and no domain of x were specified, nor the con- 
stants m, n. If F(x) is to be real, then it is assumed that x >1. In the above it is also 
assumed that m, n>0 and m#1. (The case m=1 is easily handled. If mn=0, 
the equation may be solved by inspection. The above solution is also valid when 
m,n are not both positive, provided a can be chosen so that m*=n.) 


Also solved by Michael Amling, E. P. Del Norte, G. A. Edgar, Leon Gerber, G. A. Heuer, and 
E. W. Trost (Switzerland). 
Sidney Penner had earlier posed the special case m=n=2. 
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All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State University, 
New Brunswick, N.J. 08903. Solutions of Advanced Problems in this issue should be 
typed (with double spacing) on separate, signed sheets and should be mailed before December 
31, 1970. Contributors (in the United States) who desire acknowledgment of receipt of their 
solutions are asked to enclose self-addressed stamped postcards. 


An asterisk means that no solution was submitted. 

5736 [1970, 532]. Correction. Proposed by M. S. Klamkin, Ford Scientific 
Laboratory 

Solve the nonlinear difference equation of rth order 

D, = a,Dn} + a2Dy_ 1D 9 +o + aD Dy-2 ++: n-rt1Dnnrs 

(m, 7, @;, constants). 

5746. Proposed by Leonard Carliiz, Duke University 

Let GF (2*) denote the finite field or order 2”. For a © GF(2*) put 


e(a) = (—1)*, ta) =atarta®+-e--+a™, 


> eet yte+——__}, 


24Y 2 ye-+ 2x + xy 


S(a) 
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where the summation is over all x, y, z © GF (2”) such that yx +2x-+xy0. Show 
that 


S(a) = (—1)"2" >) e(x + ax’), (xx’ = 1). 


270 
5747. Proposed by H. M. Edgar and Martin Billtk, San Jose State College 
Let 1<m<m< +++ <m, be integers. Let the integers 5; satisfy 0S); 


<Sn,;—1 for every value of 2 with 1S2Sk and assume that (vj, 1;) is not a di- 
visor of (b;—);) for all 247, 11, 7k. Prove that there must exist an integer 
x satisfying x#b; (mod n,) for all 2 with 1S7Sk. 


5748. Proposed by R. L. Graham, Bell Telephone Laboratories. 
Let 0<a,<Qns1+dn2, n= 1. Show that }o-_, a, diverges. 


5749.* Proposed by R. L. Graham, Bell Telephone Laboratories 
Let 0<a,< ... <a, be integers. If (a, 0) denotes the greatest common 
divisor of a and 8, show that 
a; 
=n 
(a;, aj) 


for some z and 7. 


5750. Proposed by John Horvath, University of Maryland 

Let f be a decreasing, bounded function, defined on the interval OSxS1 
of the real line. Prove that there exists a sequence (fn) of continuous, decreasing 
functions having the same bounds as f, which converges almost everywhere to f. 

5751. Proposed by Marvin Marcus, University of California at Sania Barbara 


Let Sm denote the symmetric group of degree m. For 7G Sm let c(a) be the 
number of cycles (including cycles of length 1) in the disjoint cycle decomposi- 
tion of ¢. Prove that for 1=mSzu, 


(1) (") _ >) (sgn o)n°@), 


ml mM! ceS_ 
Also prove that for any positive integers m and n, 
n+m—1 1 
(2) ( )- YS yo. 
m m!| cES,, 


5752.* Proposed by Kesiraju Satyanarayana, Rajahmundry, Ind1a 


If 
™m 2m + 1\t 2m+1—p 1 
Cnr = » (—1)»( ) > ~~ 
p=0 p 


z=p+1 x 
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is expressed as a fraction in lowest terms, show that 

(1) Om 1=1/(2m+1). 

(2) For m=1 to 7, the following results hold: 
(i) Om,2, Om,3 have the sign of (—1)”. 
(ii) The numerator of |on,2| is a power of 2. 
(iii) The numerator of | Om,3| /(2m-+1)!is the product of some successive 
prime numbers immediately following 2m-+1. 

(3) Do the results in (2) hold in general? 


SOLUTIONS OF ADVANCED PROBLEMS 
On ABA-Groups 


5673 [1969, 565; 834]. Proposed by Stanley E. Payne, Miami University, 
Ohto 

Let G be a group with subgroups A and B such that (A:AMB) =(B: AM B) 
=k =3, and such that there is an n< © and an xGG with Ax’, Bx’, 1SiSn, 
being all the distinct left cosets of A and B in G. Then prove that ABA(\BAB 
AAB+BA if AB+BA=A+B. 


Solution by the proposer. Consider the coset geometry m(G, A, B) in which 
points are all left cosets Ax and lines are all left cosets Bx, xEG. Then Ax is 
incident with By if and only if Ax By ©. A chain of incident points and lines 
starting with A must have the form A, Bai, Abiai, Baebiai, - - - for which the 
chain is irreducible if and only if each a;¢B and each 0;¢A. Here A:AMB 
=k 23 implies A is incident with at least three distinct lines B, Bx’, Bx’, Then 
A, B, Ax-*, Bx-**3, Ax-**4, Bxi, A is a closed irreducible chain, whence A = 
A bsa3beQeb1a1 for a; A\B, 6;€B\A, which is essentially what we needed to 
show. (For reference see Higman and McLaughlin, Geometric ABA-groups, III. 
J. Math., 5(1961) 382-397.) 


Complete Metric and the Real Line 

5676 [1969, 566]. Proposed by Albert Wilansky, Lehigh University 

Let X = fo}U, co) with the ordinary topology of the real line. Suppose 
that a complete metric is given for this topology. Must (1, ©) contain a closest 
point to {0}? 

Solution by W. O. Alltop, China Lake, California. We answer the question 
negatively. Clearly X is homeomorphic to y= {P } \UR, where R is the real line 
with the usual topology, and P is a point isolated from R, i.e., {P} is an open 
set. For x, yER let 


d(x,y) = |x —y|/A+ |[«—-y]). 


This well-known distortion of the usual metric leaves (R, d) homeomorphic to 
(R, | |). Moreover, the Cauchy sequences of (R, d) are precisely those of 
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(R, | |). Thus, (R, d) is complete. Now for xE R, let 
d(P,«) = (2+ |«#|)/Q+ ||). 


This distance from P to R is 1, but this distance is not achieved for any point of 
R. It remains only to show that (Y, d) is a metric space, i.e., that the triangle 
inequality holds for d. For x, yER we have 


d(P,«*) + d(P,y) >2>1> d(&@, y). 
Also 


Hel — [yl 


d(P, y) — d(P,x)| =——— de 
dP) — P| = TITS Talla 


IIA 


—_—_——_—— = d(x, . 
1+ | | (x, ¥) 


Thus, d(P, y) Sd(P, x)+d(x, y), so d is a metric for Y. 


Also solved by Linda W. Brinn, J. M. Cibulskis, D. E. Cooper, E. P. Del Norte, L. J. Dickson» 
Michael Golomb, D. A. Hejhal, Ellen Hertzmark & W. C. Waterhouse, D. W. Hodwin, H. C: 
Kranzer, M. D. Mavinkurve (India), W. G. McArthur, Torquist Memp, J. T. Rosenbaum, D. A. 
Zave and Martin Ziegler (Clausthal). 


The Spectral Radius of a Matrix 


5677 [1969, 701]. Proposed by B. W. Levinger, Case Western Reserve Uni- 
versity 


Let A =(a.;) be a real nXn matrix with a:;;20, 1S2, jn. Prove that 
r(A) Sr[4(A+A7)], where r(C) denotes the spectral radius of a matrix C. 


I. Solution by Emeric Deutsch, Polytechnic Institute of Brooklyn. By Frobe- 
nius’ theorem (see e.g. P. Lancaster, Theory of Matrices, Academic Press, New 
York-London, 1969, p. 288), 7(A) is an eigenvalue of A. Now by Bendixson’s 
theorem (see e.g. A. S. Householder, The Theory of Matrices in Numerical 
Analysts, Blaisdell, New York-Toronto-London, 1964, p. 69), we have 6, S7r(A) 
<P, where B12 622 --- 2Ba are the eigenvalues of 4(A+A"). Hence 7(A) 
<r[3(A +A") ]. . 


II. Solutson by Bertram Walsh, University of California at Los Angeles. The 
result of the problem is generalized by the following two propositions: 

PRoposITION J. Let H be a (complex) Hilbert space, 4€&£(H), and let 
ReA =4(A +A*). IfNEo(A), then Red S||ReAl| =r(ReA). 

Proposition II. Let £ be an ordered (real) Banach space, and let K denote 
its cone of nonnegative elements. Suppose that E=K —K and that K is normal 
in the sense that the topology of E is generated by a norm which is an increasing 


778 ADVANCED PROBLEMS AND SOLUTIONS [September 


function on K. If AE &(E) is nonnegative (i.e., if A[K]CK), then r(A) Go(A), 
where the spectrum is taken in the complexification of E. [This may be found 
in H. Schaefer, Topological Vector Spaces, 1966, Appendix, Thm. 2.2, p. 263. 
The matrix version goes back to Frobenius (S.-B. Preuss. Akad. Wiss., Berlin 
1908, 471-476; 1909, 514-518; 1912, 456-477.) | 

To put these propositions together to solve the original problem, one need 
only give R” the Euclidean norm and the coordinatewise order; its complexifica- 
tion is the Hilbert space C”, and one may set \=r(A) in Proposition I by virtue 
of Proposition II. More generally, one may take any nonnegative operator A 
on Li (u), where (X, S, uw) is a given measure space, and derive the inequality 
r(A) Sr(ReA); the complexification of Lz(u) in this setting is of course LZ(y). 


Also solved by G. P. Barker, J. R. Kuttler, Henryk Minc, R. K. Mueller, R. C. Thompson, 
and the proposer. 


Note. Barker notes that the result is generalized by replacing the condition a;;20 by its 
consequence that 7(A) is an eigen-value of A. 


Converegnce Set for sin(Anx +e,) 


5679 [1969, 701]. Proposed by H. Kestelman, University College, London, 
England 


{rn f and {e¢,} are real number sequences with limps. An= ©. Prove that 
the set of x for which the sequence {sin(A,x+é,)} converges has Lebesgue 
measure zero but may have cardinal c. 


Solution by H. E. Bray, Rice Institute. We may express €, as 27Ra+dn, Rn 
an integer, 0S6,<27. Thus sin(A,*+e,) =sin(A,x+6,), where the sequence 
{5,}, being bounded, contains a subsequence converging to a number 4. Hence 
if sin(A,« -+é,) converges so does sin(A;,x +6), where in}, 1 5,3 are correspond- 
ing subsequences of {An}, {5,} and limé,=6. We may now drop primes and 
suppose that sin(A,x+6,) converges on a set E which is of course measurable 
since the lim sup and lim inf of a sequence of measurable functions are measur- 
able. 


sin(A,® + 6,) = sin d,« cos 6, -+ cos Ayw sin bn 
= sin \An® cos 6 + cos A»x sin 6 + O(1). 

Hence | sin(A,*-+6,) converges to zero on every measurable set E, by a well 
known property of the Lebesgue integral, and therefore sin(A,x+6,) cannot 
converge to a positive (negative) function on a set of positive measure. 

It remains to prove that sin(A,« +e.) cannot converge to zero on a set £ of 
positive measure. Supposing on the contrary that sin?(A,.-+e,) converges to 
zero on / and writing: 


2 sin?(A,% + én) = 1 — cos(2A,% + 2en), 


it follows that 
cos(2An% + 2e,) = sin(2\n% + 2e, — $2) 
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converges to unity on #. But this is impossible if | £| >0 as we have proved. 
Hence sin(A,x-+e,) cannot converge on a set of positive measure. 

If now we consider all values of x representable in the binary scale in the 
non-terminating form: 


x = .a,0a,00a;,000a,.0---, a;=O0 orl, Dia; = ~, 


where a, is followed by n zeros preceding @,4,1 and occupies the (p,)th place, 
where p, =n(n+1) /2, it follows that if}, =2- 27 then 


Ant = Zkaw + 22(.000 - + + dny100 + + > Gny200-- - ) 


where k, is an integer determined by x and 1. 

And since the second term is less than 27/2” it follows that sin \,x converges 
to zero on a set of values of x which are in one-to-one correspondence with the 
set of non-terminating numbers of the form: 


y = .210903 7 * ) > dn = ©, 


Each point of the interval 0<y 1 is representable in one and only one way by 
a number of this set. Hence sin\,x converges to zero on a set of measure zero 
and of cardinal number c. 


Also solved by G. J. Foschini, D. A. Hejhal, R. K, Meany, P. van der Steen (Netherlands), 
and the proposer. 


Ideals in a Dedekind Domain 
5680 [1969, 701]. Proposed by Kamlesh Wasan, University of Delhi, India 


Consider a Noetherian integrally closed domain R in which each semi- 
primary ideal (i.e., an ideal whose radical is a prime ideal) is irreducible. Prove 


Ris a Dedekind domain. 


I. Solution by James R. Smith, Appalachian State University. By a theorem 
in Zariski and Samuel, Commutative Algebra, Vol. I, p. 275, all that is needed is 
to prove that every proper prime ideal of R is maximal. So, let P be a prime 
ideal of R, M a maximal ideal which contains P. Then P(\M” is a semi-primary 
ideal for all n, for if xE P, xC M, so x*C PI\M". Thus PV M*=P or Pf\M" 
= iM. If for some n, M™=P(\M", then P=M since P is prime, and we are 
finished. If for every n, P-.\M*=P, then PC\\nenM"=0 (Zariski and Samuel, 
p. 216). 


II. Solutson by Robert Gilmer, Florida State University. The following more 
general result is true: 


If Ris a Noetherian commutative ring with identity, and if for each maximal 
ideal M of R, M? is irreducible, then each ideal of R is a finite product of prime 
ideals. 

Proof. K. Asano (Journal of the Mathematical Society of Japan, 3(1951), 
pp. 82-90, Satz 5) has proved that each ideal of a commutative ring S with 
identity is a finite product of prime ideals if and only if S is Noetherian and 
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there are no ideals properly between M and MM? for each maximal ideal M of S. 
Hence we show that the given hypothesis on R implies that there are no ideals 
properly between M and M? for each maximal ideal M of R. 

M/M? is canonically a vector space over the field R/M under the scalar 
multiplication (,-+M)(m+ M?*)=rm+M*. Further, the mapping A—A/M? is 
a one-to-one correspondence between the set of ideals of R lying between 
and MM? and the set of subspaces of M//M?, and this correspondence preserves 
intersection. Since M? is an irreducible ideal of R, the zero subspace of M/M? 
is irreducible; this implies, however, that 1//? has dimension at most one and, 
consequently, there are no ideals of R properly between Mand M?. 


Also solved by M. L. Laplaza, Ka Menehune, and the proposer. 


Zeros of Hermite-type Polynomials 
5681 [1969, 702]. Proposed by C. L. Sabharwal, St. Louis University 


Let H,(x) be a monic Hermite polynomial of degree n, given by 
a” 2 
H, (x) = (—1)re"/? — Io (e*'/?), 
and let g and m be nonnegative integers with g--m even. Show that 
min(g,m) m! 
Po) = (1 (A) amt 8), 
=0 (m — y)! 


has g-+m distinct nonzero real roots if g>m, and 2¢q distinct nonzero real roots 
if gm. 


Solution by O. P. Lossers, Technological University, Eindhoven, The Neiher- 
lands. Substitute the Hermite polynomial in P(x); then by means of Leibniz’ 
formula one has 


(1) P(a) = (1) se eo’ °) (a) 


adxy 


— (eH 


ag 
= (—1) 9"? dnt (aeme#"12) 
x 


We distinguish the following cases: 
(i) m=0. Then one has 


ag 
P(x) = (—1)%e""/? — (e*"/?) = H, (x). 
dx 


It is well known that the Hermite polynomial H,(x) has q distinct real roots. 
If g is even all roots are nonzero; if g is odd one root is zero and the remaining 
g—1 roots are different from zero. 
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(ii) g=0. Then one has P(x) =x”, i.e., P(x) only has a zero root of multiplic- 
ity m. 

(iii) m>0, O<qsm. In view of (1), P(x) has a zero root of multiplicity 
m—q. Because of x™e-*"/?—>0 when x—> + ©, it follows from Rolle’s theorem, us- 
ing induction on g, that P(x) has 2g distinct nonzero real roots. 

(iv) m>0, g>m. According to Rolle’s theorem P(x) has g-+m distinct real 
roots. If g-+m is even all roots are nonzero; if g-+m is odd one root is zero and 
the remaining g-+-m—1 roots are different from zero. 


Also solved by the proposer. 


Convergence Domains of Power and Dirichlet Series 


5682 [1969, 702]. Proposed by Robert Spira, Michigan State University 


What is the relation of the radius of convergence of > an8" and the abscissa 
of convergence of }\a,n-*? 


Solution by A. C. Hindmarsh, Livermore, California. The relation between 
the radius of convergence R of > a,8" and the abscissa of convergence A of 
> ann is as follows: 

(a). If R>1, then lim sup|a,|1/*<1. Thus, for some a<1 we have |a,| <a” 
for all large n. Hence > a,n-? is dominated by }Ja"-*, with x=Re(z), and 
this converges for all x. Therefore A = — ~. 

(b). If R<i, then lim sup| da| 1/™>{, and for some a>1, we have | a.| >a" 
for infinitely many z. For such n, ann-| >a*n-*, which tends to © as no. 
Therefore. > /a,n-* diverges for all z, or A= +. 

(c). If R=1, we may have A finite, as in the case a,=124, or A= — ©, as 
when a,=n7~'e", or A =-+ 0, as when a,=e". 


Also solved by D. A. Hejhal, M. D. Mavinkurve (India), and the proposer. 


Note. Mavinkurve calls attention to the following which appears in M. A, Evgratov, Collec- 
tion of Problems in the Theory of Analytic Functions, Moscow, 1969, 12.03. Let {rn} be an increasing 
sequence of positive numbers and let lim supn.»(2/dn) =a2>0, and lim sup |a,|!/"=p>0. Show 
that the sum of the series >)? ane * is regular in the halfplane Rez>« log p. 


‘Packing’ the Unit Circle 


5684 [1969, 835]. Proposed by Steven Minsker, Massachusetts Institute of 
Technology. 


Let U= {2z: | 2 <1} in the complex plane. We pick a sequence of open disks 
U,, of radius r, such that the following three conditions are satisfied : 


(a) UCU, (b+) UNV; = 2, (©) Yn<-, 
n=] 


n=1,2,3,-°+,147. Let X =U—U,;., Un. Prove that X has positive Lebesgue 
measure. 
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Editorial Note. As pointed out by several readers, this problem has been solved and generalized. 
See number 5528 [1968, 913]. 

Also solved by R. O. Davies (England), D. A. Hejhal, R. A. Horn, Douglas Lind, J. B. Linder, 
O. P. Lossers (Netherlands), Simeon Reich (Israel), and the proposer. 


The Arithmetic-Geometric Mean Inequality 


5685 {1969, 835]. Proposed by D. E. Daykin, University of Malaya, Kuala 
Lumpur 


Let m, n be positive integers and ay, d2,-++, @, be positive reals. For 
4=1, 2,3, +--+, put Qay:=a; and b;=Ai44+Qee+ + + + +Qii:m. Then show that 
M™A\0_ ++ * On<bibe + + + by, except if all the a; are equal. 


Solution by E. F. Schmetchel, College of Wooster. The desired inequality is 
equivalent to []?_,(b;/m) >aya2 - - + a,. Noting that 
b; 


1 m m 1/m 
ED ai 2 (I ens) | 
k= 


Mm M kal 


with equality only if all the a;,, are equal, we have 


ft (2)> BBs)" = (ihe) om 


i=1 \M t=1 \ k=l i=1 


unless all the a; are equal. (It should be noted that for m=1 and any set {ai} fea 
the inequality must be replaced by an equality.) 


Also solved by J. C. Binz (Switzerland), John Bessman, M. T, Bird, L. Carlitz, C.S, Karuppan 
Chetty (India), R. O. Davies (England), N. J. Fine, David Friedman, M. G, Greening (Australia), 
Robert Heller, G. A. Heuer, M. S. Klamkin, R. A. Kopas, J. R. Kuttler, Gérard Letac (France), 
O. P. Lossers (Netherlands), Andrej Makowski (Poland), R. H. C. Newton (Wales), P. J. Owens 
(England), Simeon Reich (Israel), T. Tamura (Japan), A. C. Williams, Aleksandras Zujus, and 
the proposer. 


Klamkin uses an extension of Hélder’s inequality 
m . n m Ifp; 
Ves =I (> ui) 
fuel jum] j=l teal 
to demonstrate the problem. (See Beckenbach and Bellman, Inequalities, p. 20.) 
Archimedean Ordered Subfields of Q(f) 
5686 [1969, 835]. Proposed by J. K. Washenberger, Virginia Polytechnic 
Institute 


The field Q(#) of quotients of polynomials with rational coefficients is the 
standard example of a non-Archimedean ordered field. In this field p(t)/g(t) >0 
if the leading coefficient of the product p(f)q(t) is positive. Characterize the 
Archimedean ordered subfields of Q(t). 


Solution by G. A. Heuer, Concordia College. The only Archimedean ordered 
subfield of Q(¢) is Q. For, let F be such a subfield. Then Q, the prime field of 
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Q(t), is a subfield of F. If p(t) has larger degree than q(t), then p(t)/q(#) is larger 
than every element of Q, so neither p(t)/g(t) nor q(t)/p(t) can be in F. If p(t) 
and g(t) have the same nonzero degree, then p(t)/q(t) =c-+7(t)/q(t), where c is in 
QO and r(t) has smaller degree than q(t). Since r(é)/q(t) is not in F, neither is 


p(t)/q(t). Thus F=Q. 


Also solved by J. W. Andrushkiw, John Bryant & Robert Gilmer, R. R. Guenther, Melvin 
Henriksen, M. L. Laplaza (Puerto Rico), O. P. Lossers (Netherlands), M. D. Mavinkurve (India), 
and the proposer. 
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Introduction to Commutative Algebra. By M. F. Atiyah and I. G. Macdonald. 
Addison-Wesley, Reading, Mass., 1969. 137 pp. $7.50. (Telegraphic Re- 
view, October 1969.) 


This is a short 126 page introduction to commutative ring theory. It as- 
sumes the student has had a very solid first course (graduate level in North 
America, Part II level in England) in general algebra. Few specific results are 
assumed, but the maturity and sophistication that comes from such a course 
are assumed. The material is approximately what one might cover in a short 
course on ideal theory in commutative rings. However, here the emphasis is 
more on modules and localization than on classical ideal theory. In no way is it 
a substitute for such texts as Zariski-Samuel Commutative Algebra I, II, or 
Bourbaki Algébre Commutative. 

The authors acknowledge that the subject has its origins in algebraic number 
theory and in algebraic geometry; but here the emphasis is algebro-geometric 
and a serious student of number theory will still have need to study such texts 
as Hecke: Algebratschen Zahlentheorte. The authors’ main motivation is the 
preparation of the student for a systematic study of homological algebra, the 
first step for serious study of modern algebraic geometry and class field theory. 
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SUMS OF SQUARES 
OLGA TAUSSKY, California Institute of Technology 


1. Introduction. Sums of squares is a major concept in mathematics going 
back to ancient days and yet of great current interest. It is a subject which 
links many different branches of mathematics and produces results which have 
a certain similarity but whose complete connection is still not understood. In 
recent years logicians have been much interested in the subject too. There are 
applications to and from logic to sums of squares. Statistics from its beginnings 
has been involved in sums of squares. 

This article describes some of these ideas, but is by no means comprehensive. 
In particular the chapter on number theory is very incomplete. The presenta- 
tion is a spotlight treatment, sometimes putting very deep results next to 
easier ones, although the latter may have a particular appeal and even im- 
portance. Proofs are included only when they are very brief. Some new ideas 
are incorporated. 

This account is devoted to algebra and number theory on the whole, apart 
from describing facts which link up with analysis and topology and ought not 
to be separated. But sums of squares also occur in the very definition of the 
Hilbert space and all its consequences, e.g. in Parseval’s theorem, in the defini- 
tion of L?-convergence, in normed algebras and such like. The composition of 
infinite quadratic forms will not be discussed. Not even the theory of finite 
euclidean and unitary space will be included, nor facts concerning orthonormal 
vectors, nor the theory of norms of finite matrices. Hence orthogonal and uni- 
tary matrices are not treated, nor automorphs of quadratic forms. 


2. Pythagorean triangles and Fermat’s last theorem. The first sum of 
squares we meet in our life is in Pythagoras’ theorem 


(1) a® + 6? = ¢?, 


where a, 0 are the sides of a right angled triangle with hypotenuse c. Later we 
meet in elementary trigonometry 


(2) cos?a + sin?a = 1 
and much later we meet 
(3) cos? zg -+ sin? zg = 1 
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for all complex values z. The Pythagorean triangles too turn up at an early 
stage in our education. They are right angled triangles whose sides have integral 


ratios like 
3, 4,5; 5,12,13;---. 


They are already mentioned in an old-Babylonian text discovered by Neuge- 
bauer and Sachs. It is known that there are infinitely many such triangles and 
that they are obtained from a parametric formula 


(4) A(m? — n?), A2Zmn, A(m? + n?) 


with A, m, n any integers. Sometimes the same triangle can be obtained several 
times by this formula, e.g. the triangle 6, 8, 10 is given by A\=1, m=3, n=1, 
and by A=2, m=2, n=1. That every expression (4) leads to a Pythagorean 
triangle is clear. The converse will now be proved. We start with some general 
remarks. 

At least one of a, 0 is even. For, if a=2n+1, b=2m-+1, n, m integers, then 
c?=4N-+-2, N an integer. This is impossible, for the square of an odd number is 
always of the form 4 7r+1, 7 an integer, and that of an even number is divisible 
by 4. We shall assume that 0 is even. If a and } have a common factor d#2, 
then d| c. Hence a/d, b/d also define a Pythagorean triangle. If, however, d =2 
and b/2 is odd then we cannot remove it, unless a/2 is even, in which case we 
interchange the role of a and 0. If, however, both a/2 and 0/2 are odd then 
(c/2)? would again be of the form (4M-+2), M an integer, which is impossible. 

The following elementary proof for the converse uses the fact that the ex- 
pressions (4) suggest the formulas for cos 2a, sin 2a. Let 


a? + 5? = ¢?, 
Define the angle a(0<2a<7/2) by 
a b 
—= cos2a, — = sin 2a. 
C C 
Since cos 2~@=cos?a —sin’a and cos?a-+sin’a =1 we have 


1 a 
co’ a = (1 + <\ = 71, a rational, 
C 


, 1 a , 
sin?a = - 1 ——] = 72, a rational. 


Since sin 2~@=2 sin a cos a we have 
1 0b 


sina cosa = I = 7, a rational. 
C 


Hence 


r= JST iho. 
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Hence 


. T nN 
sina = -— cosa = — cosa, 
T1 m 


where ”, m are integers which we may suppose without a common factor. Put 


COs? a 


= Ao (a rational) so that sin? a = Agn?. 
m 


This gives 
a= \(m? — n*), b = d-2mn, 

where \=Aoc. We claim that d is necessarily integral. For, if \ were fractional 
with p, a prime, dividing the denominator, then p| (m?—n?), p| 2mn. lf p¥2, 
then p| m or b| n. Since p| (m+n) or p| (m—n) it follows that p| mand p| n which 
is a contradiction. If p=2 and pim, pin then 0 is not even as was assumed. 

In contrast to the various elementary proofs of (4) a proof using Galois 
theory will now be given. It is based on Hilbert’s Theorem 90 which concerns 
algebraic extension fields with a cyclic Galois group. This theorem is obtained 
nowadays as a special case of a theorem in Galois cohomology (see e.g. Jacob- 
son, Algebra III.) 

Let F be a cyclic extension of a field K of relative degree /. Let S be a gen- 
erator of the Galois group of F over K. For any a€ F we write a§ for the auto- 
morphism defined by S. We then have 


norm r/x(a) = norm sy;x(a’). 
Hence by the multiplicativity of the norm we have 
norm r/x(a/a’) = 1. 


Hilbert’s theorem states that, conversely, any element BEF with norm 6 =1, 
is of the form B=a/a® for a suitable aE F. 

Apply the theorem to the situation where F is the extension obtained from 
the rational number field Q by adjoining —/ —1, i.e., the set of elements m+n, 
m, nEQ. This field is cyclic with respect to Q and of degree 2. Further 


(m+ in)S =m— in and norm(m-+ in) = m*+ n’. 
Let then a?+0?=c? hold with a, b and c¢ in Z, the ring of integers. This implies 
a 6b 
norm (= + 4 —) = |. 
C C 
By Hilbert’s theorem 
a 6b m+in — (m+ in)? 
C C m—-in m+n 


for some m, »©Z. Comparing the real and imaginary parts (4) emerges. 
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We add three further comments to the study of Pythagorean triangles: 

(1) The expressions (4) can be given another interpretation: It is clear that 
Pythagorean triangles have much in common with complex numbers. The prod- 
uct of two complex numbers m+im, mo+1ne is Mmym2—nynNgt1(myne+mem). 
This associates two bilinear forms mym.—n No, minet+men, with the field of 
complex numbers. The corresponding quadratic forms 


m* — n*, 2mn 


are exactly the expressions (4). 

Similarly, one can associate a set of 2 bilinear (respectively quadratic) forms 
with any basis of an algebra. This idea is being investigated separately, par- 
ticularly for algebraic number fields. 

(2) The Pythagorean triangles form a group under a certain composition 
law: more precisely to every triangle a, b, ¢ consider the whole set da, Ad, Ac, 
with \=1, 2, - +--+, as an element of the group under consideration. Further 
identify all the four pairs-:a, +b. We may therefore assume a, b as coprime 
positive integers. Exactly one of these two integers is then even, because 
a?-+-b%=c? is a square, hence cannot be =2(4). We will assume that 0 is an even 
number. 

Let a:, b;,7=1, 2, be a pair which generate a Pythagorean triangle. Then it 
follows at once that 


A= Q102 +- bybo, B= Arbo — Ab 
again generate a Pythagorean triangle. The set of triangles generated by A, B 
is the “product” of a1, b1 and de, be. If a1 = a2, bk = be, we obtain 
2 
A=ath, B=0 


which is equivalent with A =1, B=0. We consider this as the unit element in 
our group. For, this element when composed with a, b gives a, —b which is 
equivalent with a, b. The associative law too follows for our composition if we 
again allow the above identification. 

(3) The two quadratic forms f=x?—-y?, g=2xy associated with Pythagorean 
triangles, have a special property. Let 


f= x - yi, is = 2s; a= 1,2. 
Then the pair of forms 
fife — g182, fae + fogs 
are again the same forms f, g, but applied to the indeterminates: 
XiX2 — V1Ve, Xie + X21. 


For x1=%2, Vi=—%2 this gives the well-known relation (x?—~y?)?+4x?y? 


= (?-+-y?)?. 
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The existence of these triangles made it desirable to know whether the equa- 
tion 


(5) xn + yy” = gt 


can be solved in integers x, y, 2 for n>2, apart from trivial cases. The not yet 
established statement that this is impossible is referred to as Fermat’s last 
theorem. A similar question was raised concerning the relation (3) namely, 
do there exist two entire functions f(z), g(z), neither of them a constant, such 
that for some n>2 


(6) (f(z))" + (g(z))" = 1. 


A very brief proof was given by lyer showing that such a pair does not exist. 
The equation (6) above is identical with 


LUG@ + fg@)) =1 


when the product is taken over all solutions ¢ of the equation x*+1=0. Since 
none of the m factors can vanish the meromorphic function f(z)/g(z) cannot 
assume any of the values of ¢ which would be in contradiction with the ‘big’ 
Picard theorem. 


3. Sums of squares in number theory. This is an enormous subject which 
can only be touched briefly here. Again a very old theorem comes to our mind 
immediately: every positive integer is a sum of four squares of integers. Then 
we have the characterizations of integers which are sums of three squares and 
the well-known fact that an integer is a sum of two squares if and only if its 
square-free part is the product of primes =1(4). 

The quadratic form x7-+x3+-*2+22 is not the only form } Ja,x?, a; positive 
integers, which represents all positive integers. Forms of this nature are called 
universal and have been studied, e.g. by L. E. Dickson, Kloosterman, Linnik, 
Pall, and Ramanujan. Forms which represent all but one positive integer were 
examined by Halmos. Heilbronn showed that there exist four continuous func- 
tions f;(*) such that every rational x is represented by >of i(x)?. To represent 
positive integers by more than four, in particular by many, squares has been 
studied too and the function giving the number of representations of the fixed 
integer ” as the sum of exactly s squares has been of much interest for a long 
time. Recently Bateman pointed out that the function f,(m) = (2s)—¥%,.(n) is 
multiplicative precisely for s=1, 2, 4, 8, where 7,(m) is the number of representa- 
tions of 2 as a sum of s integral squares. 

The problem of representing algebraic integers as sums of squares in the 
same field has been much studied, but will not be discussed here. 

Next we look at integral symmetric matrices, a subject not fully investigated 
so far. 

(The fact that a real symmetric matrix has real characteristic roots links 
their study with sums of squares, but this aspect will be discussed in Chapter 6.) 
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Even the subject of rational symmetric matrices is not fully explored. For 
2X2 matrices the characteristic roots of such a matrix must lie in a field Q(./m), 
masum of two squares. This can be checked easily from the formula for the zeros 
of the (quadratic) characteristic polynomial. However, it can also be obtained 
as a special case of the following fact: Let F be an extension of degree 7 of the 
rationals and let it have a symmetric ~Xn Q-representation. Then this is equiva- 
lent with the existence of 7 elements a1, - - - , @, in F such that Ya Ma =(, 
r, S=1,--++,n, rA¥s, where the upper suffices denote the conjugate elements 
with respect to Q. First we show that this is necessary: Let A be the Q-matrix 
which represents the primitive element a. Since A is symmetric its characteristic 
vectors are orthogonal. Since A is a Q-matrix these vectors can be chosen in 


O(a), +=1, +--+, m, and as the conjugates of the vector corresponding to the 
characteristic root a. Hence the components of this vector are of the form of the 
above ay, ° °°, Q@,. Next we show sufficiency: the matrix 
a 
Gi) ae G), 
(a; ) (a; ) 
gyn) 


is rational and symmetric; here and later the prime indicates the transpose. 
Hence we have obtained a rational symmetric representation of O(a). 

From the orthogonality it also follows that F is totally real. 

The case n»=2 gives an alternative proof for a fact mentioned above, for 
the two elements ai, a are expressible as 


ay = a+ b/m, ag =ct+ dm, 
where a, 0, c, d, m are in Q and m is not a square in Q. Then 
a? — mb* + c* — md? = 0. 
Hence 
a? + ¢? = m(b? + d?). 


Symmetric Xn matrices over the integers with a given characteristic 
polynomial of degree n, with integer coefficients and 1 as coefficient of x” can 
sometimes best be studied by looking first for general matrices, i.e. not neces- 
sarily symmetric ones. Such matrices fall into classes: two matrices being con- 
sidered equivalent if they belong to the same integral unimodular similarity 
class. For irreducible polynomials this leads to an integral 1 Xn representation 
of the ring generated by a zero of the polynomial and hence to a rational n Xn 
representation for the algebraic number field generated by it. Let A bea suitable 
matrix. Then A’, the transpose, is a matrix-zero of the same polynomial. 
Under special circumstances it can belong to the class of A. We then have 


A’ = SAS 
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when S is integral and unimodular. If in addition S is p.d. (positive definite) 
and even of the form S=TT”’ with TJ integral (this follows from p.d. for n $7, 
(see Chapter 4)), then 


TAT = (T-AT)’ 


Hence the class of J contains a symmetric matrix and conversely. Faddeev, 
Shapiro, Bender studied symmetric matrices over algebraic number fields with 
given characteristic polynomials. However, not all polynomials which can turn 
up for symmetric matrices have been characterized so far. 

This whole chapter belongs partly to the theory of positive definite matrices. 
They are treated in the next chapter which also includes further number 
theoretic results. Also the similarity between a matrix and its transpose and the 
connection with symmetric matrices will turn up there again for the case of the 
real number field. 


4. Positive definite (p.d.) matrices. These are real symmetric matrices with 
positive characteristic roots. Again a very old fact is our starting point: a posi- 
tive definite quadratic form with real coefficients is a sum of squares of linear 
forms. Positive definite hermitian forms are expressible as So1:(x)i(#), where 
1,(x) is a linear form and /,(x) is the form with the conjugate coefficients. 

For the matrix itself this means in the real case that it can be factorized as 
AA’ where A’ is the transpose of A and in the complex case as BB* where 
B* = B’ is the transposed complex conjugate. 

One of the most important uses of p.d. matrices is to generalize facts in 
many different branches of mathematics, by replacing the identity matrix by 
a given p.d. matrix. We begin with two specific examples. 


(1) The orthogonal matrices. They leave > \x? unchanged. This was gen- 
eralized in the study of the ‘automorphs’ of p.d. quadratic forms. 


(2) The field of values of a complex nXn matrix A. It is the set of numbers in 
the complex plane given by «*Ax/x*x, where x0 is an arbitrary complex n- 
vector. Givens introduced the generalized field of values x*AHx/x*Hx when 
His a p.d. form. This was recently even extended to operators. 

In differential geometry Riemannian geometry extends Euclidean geometry. 
In the subject of partial differential equations the theory of elliptic equations 
extends that of the Laplace equation. There are many examples in the calculus 
of variations. There are examples in number theory on all levels. 

We begin with the discussion of products of real symmetric matrices. It can 
be shown that every real matrix is the product of two symmetric matrices— 
more generally a matrix with elements in an arbitrary field F can be expressed 
as the product of two symmetric matrices in the same field. In the case of the 
reals: if one of the two symmetric factors is positive definite then the product is 
similar to a real diagonal. If both factors are p.d. then the product is similar to 
a p.d. diagonal. Connected with this is the following fact: while every matrix 
with elements in a field F is similar to its transpose via a symmetric matrix 
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over F, the latter can be chosen in the form AA’ (and therefore p.d. for the 
reals) if the original matrix is similar to a symmetric matrix, (i.e. has real 
characteristic roots and is diagonalizable in the case of the reals). Real matrices 
with positive determinant can always be expressed as products of p.d. matrices, 
in fact only four factors are needed, unless the matrix is a negative scalar in 
which case five factors are required, in general. This was shown by Ballantine 
who also characterized products of three p.d. factors. Products of two p.d. 
matrices had been characterized by Taussky. 

Positive definite matrices can be employed to determine the signs of the 
real parts of the characteristic roots of a general matrix. This is an important 
practical problem. By a theorem of D. C. Lewis, Jr., one can find for any 
matrix A with simple elementary divisors a p.d. hermitian G such that the 
roots A of det(GA +A *G—2\G) =0 are the real parts of the characteristic roots 
of A. Also there is the matrix version of Lyapunov’s stability criterion: 

A matrix A is stable if and only if a p.d. G exists such that AG+GA”™ is 
negative definite. 

This was generalized to give statements concerning the signs of the real 
parts of the characteristic roots. 

The fact that every real symmetric matrix can be transformed to diagonal 
form by an orthogonal matrix can be generalized by saying that a pair of sym- 
metric matrices, one of which is p.d., can be transformed to diagonal form simul- 
taneously. This again is generalized by saying that a pair of symmetric matrices 
Si, Se which generate a pencil AS,;-++wS. which contains a p.d. matrix can be 
simultaneously diagonalized. Such pencils have been studied recently and were 
linked up with the convex cone formed by the p.d. ~Xn matrices in the 
n(n+1)/2 dimensional space. 

The cone of p.d. matrices H is invariant under the transformation 


AHA’ 


when A is any non-singular matrix of the same dimension. Thus they form a 
‘positivity domain’ like the positive vectors which are invariant under trans- 
formation by a positive matrix (in this case even a non-negative (+0) matrix). 
The transformation defined by A on the linear space of symmetric matrices 
can be regarded as a ‘positive operator’ and the finite version of the Krein- 
Rutman theorem concerning such operators can be applied to it. From this it 
follows that the matrix which corresponds to the operator has a positive domi- 
nant characteristic root. The matrices H which are ‘characteristic vectors’ are 
of interest too. 

We now turn to consideration of number theory and we mention some facts 
concerning the factorization XX’ and the problem of representing integers by 
p.d. forms. 

Positive definite matrices play a big role in number theory. In particular, 
unimodular integral matrices A have been studied by Hermite and Minkowski 
who showed that for »S7 every such matrix is of the form BB’, with B an 
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integral Xn matrix. For n=8 this is not any longer true as an example by 
Korkine and Zolotareff shows. Mordell showed that there are two classes in 
this case if we count two matrices A, B as belonging to the same class if B = SAS’ 
where S is an integral and unimodular matrix. The number of classes is finite 
in all cases. 

A special case of p.d. unimodular matrices arises from the set of group 
matrices for the finite group G (a ring isomorphic with the integral group ring of 
G). These matrices are of the form (a,,1), where 7, s range over the elements 
of G in a fixed order. The unimodular ones correspond to the units in the group 
ring. A factorization of the above type is possible only if B is a group matrix for 
the same group, times an integral matrix P with PP’=J. For nS7 such a 
factorization is possible always, for n=8 there are again two classes. For n=9, 
however, there is only one class. These results were obtained by Taussky, M. 
Newman, R. C. Thompson, M. Kneser, some still unpublished. 

The results concerning the factorization of positive definite unimodular 
matrices have been extended to hermitian matrices and to matrices over 
quadratic number fields. For instance, any matrix 


aa 
A= 
a b 
with a, b positive integers, a=x-+72y a Gaussian integer and ab—aa=1 can be 
factorized into BB* with B a square matrix of Gaussian integers. This result 
leads to an alternative proof for Lagrange’s theorem on expressing a positive 


integer a as a sum of four squares. The following fact is needed: Given a number 
a, integers b>0, x, y can be found such that 


xe? + y? = — 1+ ab. 


Starting with the positive integer a the matrix A is now determined. The 
factorization A =BB* then expresses a as a sum of four squares. (This was 
pointed out by M. Newman during a discussion with the author.) 

Even if the p.d. integral matrix is not of the form XX’, X an integral 
square matrix, it may still be expressible in this form with X a rectangular 
matrix with more columns than rows. For the corresponding quadratic form 
this means that it is still expressible as a sum of squares of integral linear forms, 
but with more than » terms. However, even this is not always possible. This 
was studied by Mordell, Erdés, Chao Ko, Pall and Taussky. 

Alternatively, the problem of expressing a matrix A in the form XBX’ has 
been studied in full generality by Siegel, where now A and B do not need to 
have the same dimension. Hence this includes the representation of numbers by 
p.d. quadratic forms. 

The extension of the problems to algebraic number fields led Siegel to a 
conjecture which was established quite recently. It concerns ‘classes’ and 
‘genera’ of forms. It is known that over the rationals the forms 
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are in a genus of one class. Over totally real fields the form 3 lies in a genus 
of one class only for Q(+/2), O(-V/5). This is what Siegel had conjectured. After 
some initial progress by Dzewas it was established by Barner. 


5. Formally real fields. The congruence 
x? -+ y* == —1(p),  p any prime, 


used in Chapter 4, can be used as a link with this theory since the integers 
modulo a prime form a field. A formally real field is characterized by the fact 
that —1 cannot be expressed as a sum of squares in that field. The fact that 
the above congruence can be solved for any p not only shows that the residues 
mod p do not form a formally real field (a fact obvious from the finite char- 
acteristic), but it gives the actual expression for —1. 

If the field is not formally real and if its characteristic #2 then every element 
in the field is a sum of squares. 

If —1 is asum of squares then it is of interest to study the minimum number 
of terms in the representation of —1 for various fields F. It is easy to see that 3 
cannot occur as a minimum. For, assume that 3 is the minimum and let 


—1 =, + "2+ x, 4,CGF, «x; 40. 


This implies 


0 = x9 + 01 + x2 + x9, “iG FP, 


Hence 
O = (a9 + 1) + (a0 + 204) (09 + 29) 
= (arg + 3) + (xo%2 — 4%) + (0x3 + wa) , 


so that transferring (x¢-+x7)? to one side and dividing across by it (using the 
fact that x,+x{=0 by assumption) we get a representation of —1 as a sum of 
two squares. This contradicts our assertion. This proof depends on the multi- 
plication of complex numbers and their norms. A similar proof using the 
multiplication of quaternions and the multiplicativity of their norms shows 
that 5, 6, 7 cannot occur as a minimum. The same idea, using Cayley numbers, 
shows that the numbers 9, - - - , 15 cannot occur as a minimum. The question 
concerning the possible minima had been raised by van der Waerden in 1932 
and was settled only quite recently by Pfister. He showed that only powers of 
2 can occur as a Minimum and that every such power does occur for some field. 
Pfister uses results of Cassels on quadratic forms for his proof. The relevant 
theorems are: 
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Let F be a field of charactertstic 42. Let dC F and x be an indeterminate. 
Necessary and sufficient for x?-4-d to be a sum of n>1 squares in F(x) ts that either 
—1ordisasum of n—1 squares tin F. 

Let R be the field of real numbers and let x1, - + - , Xn be indeterminates over R. 
Then xi+ +++ -+x2 ts not a sum of n—1 squares in R(%, +++, Xn). 


As the cases 1, 2, 4, 8 show, the result is connected with the composition of 
sums of squares which will be discussed in the next chapter. 

By defining ‘positive’ in real fields as ‘sums of squares’ an ordering can be 
introduced in such fields. 

Symmetric matrices over formally real fields have been studied. The set of 
all their characteristic roots form a field which is real closed. Krakowski and 
recently also Bender studied symmetric matrices over arbitrary fields with 
given minimum polynomial. 

An application of formally real fields appeared in a very unexpected con- 
nection: R. C. Thompson proved (generalizing a theorem by Shoda obtained for 
algebraically closed fields) that, with the exception of certain 2X2 matrices 
over GF (2), every unimodular matrix A with elements in a field F is a com- 
mutator B-!C-!BC in F. He also studied the question: when can the factors B, 
C themselves be chosen unimodular? For the case that A is a scalar matrix, this 
depends, among other things, on whether —1 is a sum of two squares in F. 
Later Thompson examined the case when the factors B, C have given deter- 
minants 0, c and then the representation of —1 in the form bx?-+-cy? becomes 
critical. 


6. Composition of sums of squares, anticommuting matrices, composition 
algebras. Hurwitz showed that ~=1, 2, 4, 8 are the only values of n for which 
identities of the following type hold: 


n 


DX x5 2 ye = > [Js(x, »)]' 


1 


where /; are bilinear forms in the x;, +y,. Pfister’s results concerning —1 asa sum 
of squares are linked with an extension of this problem: he allows the /;(x, y) 
to be rational functions. In this way he obtains an identity for any n which is 
a power of 2. For m=8 such an identity had been obtained independently by 
Taussky by a different method and this result was extended to » =16 by Eich- 
horn and Zassenhaus: 

The usual way to obtain the above mentioned identities is from the product 
a8 of two complex numbers, respectively quaternions, or Cayley numbers, and 
using the fact that norm a8 =norm @ norm @ in all these cases. The identities 
found by Taussky and Eichhorn and Zassenhaus were, however, derived for 
n=2 from the reals, for 7 =4 from the complex field, for 7 =8 from quaternions, 
for n=16 from Cayley numbers. The method is based on the generalization of 
the relation det X det X =det XX* when X* means the transpose conjugate. 
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The identities are special cases of Gauss’ concept of composition of quad- 
ratic forms: two n-ary quadratic forms f(x;), g(x:) are said to permit composition 
if the product fg can again be expressed as a quadratic form under a bilinear 
transformation of the indeterminates. 

Hurwitz’ proof is based on matrix theory and leads to the enumeration of 
skew symmetric “Xn matrix pairs which are anticommuting. Such pairs are of 
interest in many connections and will be mentioned again in Chapter 7. Using 
an idea of Jordan, von Neumann, and Wigner the theory of group representa- 
tions was employed by Eckmann (after he replaced Hurwitz’ matrices by ab- 
stract elements and —1 by an element of order 2) to give a proof of Hurwitz’ 
theorem. 

Freudenthal uses a projective geometry over the field of two elements and 
Desargues’ theorem to prove Hurwitz’ theorem and Chevalley uses Clifford 
algebras. An account of this and connected ideas are in a paper by van der Blij. 

Later Albert, Kaplansky, and Jacobson imbedded the problem into the 
study of composition algebras. We start by defining a normed algebra. Let 
€1, * °°, €, bea basis for the algebra. Let 


a= G36, + °° + + Qn€n 


be an element of the algebra. Define 
norma = |a| = > aj. 


The algebra will be called normed, if 
|ab| = |a|] 4]. 


If the algebra is over the reals, has an identity and is normed then it can be 
shown that it is either the reals, the complex field, the quaternions or the 
Cayley numbers. This gives a proof for the Hurwitz theorem. Jacobson treated 
a more general situation. He starts with a quadratic form N(x) defined on a 
vector space V over a field of characteristic #2. He assumes that N(x) permits 
composition, i.e., there exists a bilinear composition xy in V such that 


N(x) N(y) = N(xy), wy E V. 


The product xy makes an algebra out of V which is then called a composition 
algebra. However, Albert had shown that forms of dimension 2” permit compo- 
sition even for fields of characteristic 2. 

The Hilbert identities 


( > “) = >> prlamer + + + + aren) 
1 


with p; positive rationals and ai integers were used in Hilbert’s original solution 
of the Waring problem, i.e., the proof of the following assertion: every positive 
integer is a sum of n-th powers of integers and the number of these is deter- 


1970] SUMS OF SQUARES 817 


mined solely by z. A simpler proof of the identities goes back to Hausdorff. 
They have the flavour of a composition identity and could possibly be general- 
ized for products 


Ya yee. 


The associative algebras among the above-mentioned four algebras over the 
reals are characterized by other properties, e.g., Pontryjagin obtains them from 
topological properties of fields in which addition and multiplication are con- 
tinuous functions under some topology. The Gelfand-Mazur theorem states 
that they are the only normed algebras which are also fields. They will be dis- 
cussed further in the next chapter. 


7. Division algebras over the reals, n-dimensional spheres, n-dimensional 
Laplace differential equations. Frobenius proved that the reals, the complex 
numbers and the quaternions are the only division algebras over the reals if 
commutativity is not required, but associativity retained. In spite of arduous 
attempts there is still no algebraic proof available for the fact that n=1, 2, 4, 8 
are the only numbers of base elements for which real division algebras exist, if 
associativity is not required any longer. The only proofs available so far rely 
heavily on deep algebraic topology. This approach was initiated by H. Hopf and 
continued by Stiefel and others with a final break-through by Bott, Kervaire, 
Milnor, Adams. 

The norm in the case of the best known division algebras, the complex 
numbers, quaternions, Cayley numbers, is the sum of the squares of the coor- 
dinates. 

The role of these hypercomplex systems in almost any part of mathematics 
is well known. Quaternions have applications in most abstract parts of math- 
ematics, but also in concrete ones from where they stem. They were introduced 
by Hamilton and are of great use in theoretical physics, as well as in ring theory, 
group theory and number theory. If the coordinates are permitted to be com- 
plex then the system is no longer a division algebra, but it has other applica- 
tions. Generalized quaternions are in much use too, they are again algebras with 
four base elements, but the role of —1 in the products is taken over by other 
scalars. 

In abstract group theory the quaternions are linked with the quaternion 
group which is the hamiltonian group of lowest order. A hamiltonian group is 
a non-abelian group, all of whose subgroups are normal. The real group algebra 
of the quaternion group is homomorphic with the real quaternions. 

The norms of these hypercomplex systems link them immediately with the 
n-dimensional spheres. In particular the fact that no division algebra over the 
reals with three base elements exists is connected with the fact that the 3-di- 
mensional sphere, the set of triples x1, x2, x3 with x7-+3-+%3 = 1, cannot be made 
into a topological group. The latter result follows easily from Brouwer’s fixed 
point theorem. E. Cartan proved that the x-dimensional sphere is a group space 
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only for n=1, 2, 4. It was the study of the topological properties of the higher 
dimensional spheres which led to the success in the investigation of the real 
division algebras. That the associative case follows easily from Cartan’s result 
was noticed by Taussky. Recently a new proof of Cartan’s result was given by 
M. Curtis and Dugundji. An earlier proof came from Samelson. 

The ring R[m, +--+, %,] of polynomials in ” variables with real coefficients 
‘modulo the unit sphere’, i.e. the ring 


Risa, ++ +, n]/(01 + ++ + am, — 1) 


has been studied in several connections: 

Swan showed that a ‘unimodular’ vector (1, - + + , X,) over this ring cannot 
always be completed to a unimodular matrix, unless n= 1, 2, 4, or 8. 

Estes and Butts showed that for n=3 composition of quadratic forms is not 
possible in this ring. 

Another subject connected with the classical division algebras is the Laplace 


differential equation 
0? 0? i 
& +. —F +. o 2 +- :) “Uz 0. 
Ox, OX Ox, 


For n=2 the Laplace equations for two functions of two real variables are a 
consequence of the Cauchy-Riemann equations. The question was raised (and 
answered) by Taussky as to whether an analogous situation exists for other 
values of z. It turns out this can happen only for n=2, 4, 8 and in fact it isa 
rather simple consequence of the result concerning the division algebras over 
the reals. However, a purely algebraic proof was obtained by Stiefel subse- 
quently. The 4-dimensional case leads to a set of generalized Cauchy-Riemann 
equations which were much studied by Fueter and his school for the purpose of 
generalizing complex function theory. In slightly changed form they appear in 
theoretical physics as the Dirac equations. 

Eichhorn, who had previously contributed to the study of generalized 
Cauchy-Riemann equations, considered the following more general problem 
recently: Let X be a vector space over a field F of characteristic #2. Let xEX. 
Consider linear mappings L(x) of X into Hom(X, X) such that another such 
mapping exists so that 


M (x)L(«) = w(x)L 


when J is the identity mapping and p(x) #0 is a mapping of X into F. The 
function w(x) can be interpreted as a quadratic form and if this form is positive 
definite and of rank m and X an n-dimensional vector space we are back in 
the problem of the generalized Cauchy-Riemann equations. However, it is now 
shown that for any form, whether p.d. or not, as long as it has full rank n, the 
only possible values of ” are 1, 2, 4, 8. But also the case of lower rank r<n is 
considered. For n= p2°, p odd, Eichhorn obtains rS2g+2. The results are 
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based on the following result: Let n= 2°, p odd. Let F be an arbitrary field of 
characteristic #2. Let A;,i=1, ---,r—1, be aset of mn matrices with ele- 
ments in F with the properties: A?7=a,J, a; F, ai%0 and A;A,+AzA;=0, 
1#k. Then rS2¢g+2. 

Connected with this fact is a result by Adams, Lax and Phillips: Let 


Ai, - ++, A, bea set of real 2 Xn matrices such that >>\;A;is non-singular for 
all real Xz, except A= --- =A,=0. Let n=2°*4(2a+1), OS0S3. Then 
k2°+-8c. 


Anticommuting matrices are much connected with the various problems 
studied in this and the preceding chapter. 

Dieudonné studied the following generalization of results by Eddington and 
by M. H. A. Newman. Let F be a not necessarily commutative field of char- 
acteristic #2, V an n-dimensional right vector space over F, let o be an 
automorphism of F and y an element of F such that y’=y and & =y-y 
for €€ F. Determine the maximal number of semi-linear transformations 
ux of V, relative to the automorphism g, satisfying the relations 


,( 2) =x«y forx CV, Vk 
UjnUh = — UKUh h # k. 


Quaternions and Cayley numbers have many applications in number theory. 
Lipschitz had studied the ring of quaternions with rational integral coordinates, 
but Hurwitz later noticed that they do not form a maximal order. By order we 
understand a subring containing 1 and a basis for the algebra. He constructed 
the following basis for the maximal order: (1+2+j+k)/2, z, 7, k. He then stud- 
ied the factorization of rational primes in this ring. Similar problems have been 
studied for Cayley numbers by various authors, e.g. Coxeter, Lamont, Linnik, 
Mahler, Pall, Pall and Taussky, and Rankin. 

If é1, --+ +, €n (n=4, respectively 8) is a basis for an order and x1, -- +, Xp 
indeterminates then 


(¢1¢1 + +++ + nln) (Ger + + + + + nb) 


is the norm form of the order if @; is the conjugate of e;. The problems associated 
with these forms can then be studied via the associated orders. Here the work of 
Brandt was basic and recently Kaplansky, Estes and Pall have made con- 
tributions. 


8. Positive definite polynomials. This chapter is closely linked with the 
earlier chapters on number theory and on formally real fields. A positive ra- 
tional number can be expressed as a sum of squares and Hilbert asked whether 
a real positive definite polynomial, i.e., one which assumes positive values 
only, can be expressed as a sum of squares of polynomials or, failing this, 
whether it can be expressed as a sum of squares of rational functions. He gave 
an example of a positive definite polynomial which cannot be expressed as a 
sum of squares of polynomials. Recently Motzkin gave a rather simple example 
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of such a polynomial, namely («{-+-x3—3x§)xixe-+x8. He found this in connec- 
tion with a study of inequalities in which he expresses the difference of the two 
sides as sums of squares. Other examples were found recently by R. Robinson: 


22(4 — 1)? + y%(y? — 1)? — (@ — 1)? — (et + 9? = 1) 
x2(% — 1)? + yy — 1)? + 22(2? — 1)? + 2axys(a + y + 2 — 2). 


Artin solved Hilbert’s question completely, showing that an expression by 
rational functions does indeed exist. Recently the subject was reactivated by 
asking for a quantitative result, namely the minimum number of terms and 
explicit representations. It was shown by Pfister that a definite rational func- 
tion of ” variables in a real closed field is a sum of 2” squares in this field. This is 
only an upper bound, but for »=2 a smaller number will not suffice. Quite 
recently Cassels, Ellison and Pfister showed that the Motzkin polynomial is 
not a sum of 3 squares. The lower bound +1 follows from Cassels’ Theorem 
showing that 1-+xi-+ - - - +2 isnot asum of m squares. Ax had shown earlier 
that Artin’s own work can be used to imply the bound 2* if a further condition 
is fulfilled which he showed was in fact true for 2=3. 

For polynomials with integral coefficients the following facts have been 
studied: If the polynomial assumes square values for all integral arguments 
then f(x) is itself the square of an integral polynomial. A much deeper result 
was obtained in a diophantine formulation by Siegel: An integral polynomial, 
not a square, can attain a square value for a finite number of integral values 
only. Recently LeVeque generalized the question: if f(«) assumes only values 
which are sums of two squares, is f(x) a sum of two squared polynomials? 
This was answered affirmatively by several authors. 


9. Sums of squares in Galois theory. The converse problem of Galois theory, 
namely to find a normal algebraic extension with a given Galois group, leads 
in some special cases connected with the number 2 to sums of squares. The two 
following cases will be discussed: 

(i) The cyclic group of order 4. 

(ii) The quaternion group. 

Let k be a given field and F a separable normal extension. In case (i) there 
exists exactly one quadratic field Fy between k and F. This field is generated by 
the square root of an element wEk. It can be shown that yp is a sum of two 
squares in k. Conversely, any sum of two squares occurs in this connection. 
There are various proofs for this. If w is a sum of squares in k, say m=pi-+y5, 
u:ck, then Fo=k(V uit) has the property that —1=normy,.(o) where 
po€ Fy). For, every element in Fp is of the form a+BVyi+u3, a, BER and the 
norm of this element is a?—6?(ui+ys). This can be made equal to —1 for 
o2=p?/uz and B?=1/y3z. Conversely, if —1=normy,z(p), pE Fo, then Fo is 
generated by the square root of a sum of two squares. For k=Q it can happen 
that —1 is even the norm of a unit. 

In case (ii) the field F contains three quadratic fields /1, /2, Fs between k 


1970] SUMS OF SQUARES 821 


and F. Let F; be generated by Jui, wiCk. Then piye=y2u3 where wER. It can 
be shown by elementary field theory that each yw; can be represented as a sum 
of three squares in k. However, two expressions which are sums of three squares 
do not in general have a product with the same property. Hence ji, we cannot 
be arbitrary sums of three squares. Pairs of sums of three integers whose 
product is of the same type can be characterized easily from the known charac- 
terization of such integers. Not all such pairs, however, qualify for quaternion 
fields. A parametric representation for M1, M2 was given by G. Bucht. The ‘sum 
of three squares’ character of such a representation will now be explained (The 
following treatment is due to Cassels arising out of a discussion with the au- 
thor.): 
The elements py, Me can be obtained as: 


ul? + 5242 + 52m? — 42(J2 + 52) + 52m? 
wi J2y2 -L- [242 LL 522 i y2(J2 + 52) + 7? 
ur? me + m2? = mr? + 02) + 427? 


ho = [272 Le J2y2 +. 52y2 i 12(72 + y2) + 524? 


It can be shown, conversely, that any pair of elements ui, eG K for which 
these relations hold have the property that R(-/jm, ue) can be extended to a 
quaternion field. That p, is a sum of three squares follows from the fact that 
both denominator and numerator are norms in the field generated by in/P-+ 82, 
hence their quotient is again such a norm, hence clearly a sum of 3-squares. An 
analogous fact is true for fe. Next we show that pe is a sum of three squares. 
Since 1, fe have the same denominator we need only worry about the numer- 
ators. Both of them are norms of the field generated by 1-/u?+m?; hence their 
product is a norm in this field. Finally, we give an example showing that not 
every set M1, Me, Mille, all sums of three squares, comes in question. Take pm, =3-73, 
bio = 3-37. Then 


yo? + pol? = u? + m?, 
hence 
(*) 3(730? + 377) = u? + m?. 


This implies that u=m=0(3), hence v?+/?=0(3), hence v=/=0(3). Remove a 
factor 3 from u, m, v, 1 in (*) and repeat the process. This will finally lead to a 
contradiction. 

A proof for the fact that the yw; are sums of three squares by non-elementary 
methods was given by Reichardt. 


10. Rational arctangents.* 
We now return to a problem concerning the sums of two squares, which, in 
principle, goes back to Gauss. 


* This chapter was written by John Todd. 
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In elementary trigonometry one encounters relations of the following form: 


arctan 239 = 4 arctan 5 — 57/4 
arctan 99 = arctan 12 — 2 arctan 5 — arctan 2 + 57/4. 
We take up the question of generating all such relations or rather a basis for 
them. (One of the reasons for the study of those relations is to find convenient 
methods of calculating 7.) 


We shall write (x) for that value of arctan x between 0 and $7, so that, in 
particular, (1) =7/4. We ask, to begin with, can we have relations of the form 


(2) = r(1) 
(3) = s(4) + #2), 
where 7, s, are integers? 


Suppose the first relation holds. Then the complex numbers 1+ 22 and (1-+7)" 
necessarily have the same argument so that their ratio 


(1 + 27)/(1 + 7)" 


is necessarily real. Since the real part of the denominator is an integer, m say, 
it follows that this ratio must be 1/m. If we take the squares of the absolute 
values of each side of the equation 


m1 + 2i) = (1 + 1)" 
we obtain the equation 
5m? = 2° 
which manifestly has no solutions. 
Similar considerations applied to the second relation lead to the equation 
10 m? = 255! 
which has a solution s=3, t=1, m =2. This gives us the relation 


(3) = 3(1) — @). 


We now introduce the formal definition: (7) is called reducible if it can be 
expressed in the form 


(m) = Do fr (me), 


where the n, are positive integers less than » and the f, are integers (it can be 
shown that no change occurs if we allow the f, to be rational) ; if no such relation 
exists we call (7) irreducible. Thus (2) is irreducible, while (3) is reducible. We 
find that (4), (5), (6) are irreducible but that 


(7) = — (1) + 2(2) 
(8) = 5(1) — (2) — @). 
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Consideration of these examples suggested the following theorem: 


THEOREM. A condition necessary and sufficient for the reducibility of (m) is 
that the largest prime factor l(m) of 1+m? should be less than 2m. 


We can verify this in the early cases quoted: 


n 1-+n? L(n) 2n 
2 5 5 > 4 
3 10 5 < 6 
4 17 17 > 8 
5 26 13 > 10 
6 37 37 > 12 
7 50 5 < 14 
8 65 13 < 16 
9 82 41 > 18 


This theorem can be established constructively by elementary methods: an 
algorithm for carrying out the reduction of (”) when it is possible can be given, 
granted that the factorization of 1++r? is known for rx. A listing of the reduc- 
tions of (”) for 2 $2089 is available. 

The irreducible (”) are analogous to the ordinary prime numbers and it is 
possible to ask whether there are theorems about them similar to theorems 
about prime numbers. In the first place there is an analog of Euclid’s theorem: 
there zs an infinite number of irreducible arctangents. We can also ask whether 
there is an infinite number of reducible arctangents—this is also true. Both 
these results are elementary. Gauss conjectured that the number of ordinary 
prime numbers <7 is approximately 1/log n and this “Prime Number Theorem” 
was proved much later. Observation of the density of the irreducible arctangents 
suggests that their density is log 2 = .6931—-some theoretical evidence in support 
of this is available, but the result seems very difficult to prove. 

It can also be shown that the arctangent of any rational number can be ex- 
pressed in terms of the irreducible integral arctangents. For instance 


(100/17) = (6) + (290) — (4836). 


This work was carried out (in part) under NSF grant 3909 and 11236. The author is indebted 
for some references to A. A. Albert, D. Estes, I. Kaplansky, G. Pall, J. Robinson. 

This is an account of part of a course with the same title given at California Institute of Tech- 
nology in 1966. It is an extended version of a lecture given at University of California at Santa 
Barbara and Riverside, at University of Miami and Rice University. 
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HARDY’S “A MATHEMATICIAN’S APOLOGY” 
L. J. MORDELL, St. Johns College, Cambridge, England 


A reprint of this most interesting book appeared in 1967 with a foreword by 
C. P. Snow, Hardy’s friend of long standing. 

It has often been reviewed and highly praised, but there are, however, some 
Opinions expressed by Hardy which, perhaps, have not been adequately dealt 
with by other reviewers. Furthermore, Snow’s foreword calls for some comment, 
especially his references to Ramanujan (1887-1920). He writes that after Hardy 
and Littlewood read the manuscript sent by Ramanujan to Hardy (probably 
Jan. 16, 1913), “they knew, and knew for certain” that he “was a man of ge- 
nius....It was only later that Hardy decided that Ramanujan was, in terms of 
natural mathematical genius, in the class of Gauss and Euler; but that he could 
not expect, because of the defects of his education, and because he had come on 
the scene too late in the line of mathematical history, to make a contribution on 
the same scale.” While one would readily accept that Ramanujan was a man of 
genius, the comparison with Gauss and Euler is very farfetched. I have some 
difficulty in believing that Hardy made such a statement, or at any rate made 
it in this form. What does natural mathematical genius mean? Undoubtedly 
Ramanujan was outstanding in some aspects of mathematics and had great 
potentialities. But this is not enough. What really matters is what he did, and 
one cannot accept such a comparison with Euler and Gauss, whose many-sided 
contributions were of fundamental importance and changed the face of mathe- 
matics. In fact in 1940, in the book on Ramanujan, Hardy said “I cannot imag- 
ine anybody saying with any confidence, even now, just how great a mathema- 
tician he was and still less how great a mathematician he might have been.” 

Snow says that Ramanujan, as is commonly believed, was the first Indian 
to be elected (2 May, 1918) a fellow of the Royal Society. He was the second. 
The first was Ardaseer Cursetjee (1808-1877), shipbuilder and engineer, F.R.S. 
27 May, 1841. Snow notes that Ramanujan was elected a fellow of Trinity four 
years after his arrival in England and continues, “it was a triumph of academic 
uprightness that they should have elected Hardy’s protégé Ramanujan at a 
time when Hardy was only just on speaking terms with some of the electors 
and not at all with others.” It is well that the merits of a fellowship candidate 
are judged by the quality of his original work and not by the political views of 
his sponsors. 

Let us examine some of the views expressed by Hardy. They are sometimes 
stated too categorically, regardless of exceptions and limitations. A number of 
them had their origin in what he says, most gloomily, in the very first section 
of the Apology: “It is a melancholy experience for a professional mathematician 
to find himself writing about mathematics. The function of a mathematician 
is to do something, to prove new theorems, to add to mathematics, and not to 
talk about what he or other mathematicians have done.” 

His practice many years ago does not conform with this statement. He recalls 
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in Section 6 that he did talk about mathematics in his 1920 Oxford inaugural 
lecture, which actually contains an apology for mathematics. Further in 1921, 
he gave an address on Goldbach’s theorem to the Mathematical Society of 
Copenhagen. In this, he did talk about what he and other mathematicians had 
done. Such talks render a real service to mathematics and many have found 
great pleasure and inspiration in listening to or reading such expositions. Hardy 
had followed the practice of many eminent mathematicians in giving them. 
These have contributed to the richness and vividness of mathematics and make 
it a living entity. Without them, mathematics would be much the poorer. 

No mathematician can always be producing new results. There must inevit- 
ably be fallow periods during which he may study and perhaps gather ideas and 
energy for new work. In the interval, there is no reason why he should not occupy 
himself with various aspects of mathematical activity, and every reason why 
he should. The real function of a mathematician is the advancement of mathe- 
matics. Undoubtedly the production of new results is the most important thing 
he can do, but there are many other activities which he can initiate or partici- 
pate in. Hardy had his full share of these. He took a leading part in the reform 
of the mathematical tripos some sixty years ago. Before then, it was looked upon 
as a sporting event, reminding one of the Derby, and was out of touch with 
continental mathematics. A mathematician can engage in the many admin- 
istrative aspects of mathematics. Hardy was twice secretary and president of 
the London Mathematical Society and, while so occupied, must have done an 
enormous amount of unproductive work. He served on many committees dealing 
with mathematics and mathematicians. He wrote a great many obituary notices. 
He was well aware that a professor of mathematics is a representative of his 
subject in his University. This entails many duties which cannot be called doing 
mathematics. 

His reference to a melancholy experience shows how much he took to heart 
and suffered from the loss of his creative powers. The result is, as Snow says, 
that the Apology is a book of haunting sadness. 

Further in this first section, he says despairingly, “if then I] find myself 
writing not mathematics but ‘about’ mathematics, it is a confession of weakness, 
for which I may rightly be scorned or pitied by younger and more vigorous 
mathematicians. I write about mathematics because, like any other mathe- 
matician who has passed sixty, I have no longer the freshness of mind, the 
energy, or the patience to carry on effectively with my proper job.” He had been 
for many years a most active mathematician and his collected works now being 
published will consist of seven volumes. It seems almost nonsense to say that 
anyone would scorn or pity him, and the use of the term ‘rightly’ is even more 
nonsensical. 

We all know only too well that with advancing age we are no longer in our 
prime, and that our powers are dimmed and are not what they once were. Most 
of us, but not Hardy, accept the inevitable. There are still many consolations. 
We can perhaps find pleasure in thinking about some of our past work. We can 
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read what others are doing, but this may not be easy since many new techniques 
have been evolved, sometimes completely changing the exposition of classical 
mathematics. Various reviews, however, may give one some idea of what has 
been done. (We can still be of service to younger mathematicians.) 

His statement about a mathematician who has passed sixty is far too sweep- 
ing and any number of instances to the contrary can be mentioned, even among 
much older people. One need only note some recent Cambridge and Oxford 
professors. Great activity among octogenarians is shown by Littlewood, his 
lifelong collaborator, Sydney Chapman, his former pupil and collaborator, and 
myself. There is also Besicovitch in the seventies. Davenport, who had passed 
sixty, was as active and creative as ever, and his recent death is a very great 
loss to mathematics since he could have been expected to continue to produce 
beautiful and important work. 

The question of age was ever present in Hardy’s mind. In Section 4, he says, 
“No mathematician should ever allow himself to forget that mathematics, more 
than any other art or science, is a young man’s game.” It seems that he could not 
reconcile himself to growing old. For further on, he ways, “I do not know an 
instance of a major mathematical advance initiated by a man past fifty.” This 
may be so, but much depends on the definition of the advance. But there is no 
need to be troubled about it. Much important work has been done by men after 
the age of fifty. 

A number of Hardy’s statements must be qualified. In Section 2, he says, 
that “good work is not done by ‘humble’ men. It is one of the first duties of a 
professor, for example, in any subject, to exaggerate a little both the impor- 
tance of his subject and his own importance in it. A man who is always asking, 
‘Is what I do worthwhile?’ and ‘Am I the right person to do it?’ will always be 
ineffective himself and a discouragement to others.” 

Though one may naturally have a better opinion of one’s work than others 
have, there are many exceptions to his statement. I never knew Davenport to 
exaggerate or emphasize the importance of his work, but he was a most effective 
mathematician and a very successful supervisor of research. Prof. Frechet told 
me a few years ago, that when Norbert Wiener was working with him a long 
time ago, he was always asking, “Is my work worthwhile?”, “Am I slipping?”, 
etc. S. Chowla is as modest and humble a mathematician as I know of, but he 
inspires many research students. 

We comment on some more of Hardy’s statements about mathematics. 
One of the most surprising is in Section 29, “I do not remember having felt, as 
a boy, any passion for mathematics, and such notions as I may have had of the 
career of a mathematician were far from noble. I thought of mathematics in 
terms of examinations and scholarships; I wanted to beat other boys, and this 
seemed to me to be the way in which I could do so most decisively.” 

It has often been said that mathematicians are born and not made. Most 
great mathematicians developed their keenness for mathematics in their school 
days. Their ability revealed itself by comparison with the performances of their 
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schoolmates. Their ambition was to continue the study of mathematics and to 
take up a mathematical career. Probably no other motive played any part in 
the decision of most of them. 

In Section 3, he considers the case of a man who sets out to justify his exis- 
tence and his activities. I see no need for justification any more than a poet or 
painter or sculptor does. As Trevelyan says, disinterested intellectual curiosity 
is the life blood of real civilization. It is curiosity that makes a mathematician 
tick. When Fourier reproached Jacobi for trifling with pure mathematics, 
Jacobi replied that a scientist of Fourier’s calibre should know that the end of 
mathematics is the great glory of the human mind. Most mathematicians do 
mathematics for the very good reason that they like and enjoy doing it. Daven- 
port told me that he found it “exciting” to do mathematics. 

Hardy says that the justifier has to distinguish two different questions. The 
first is whether the work which he does is worth doing and the second is why he 
does it, whatever its value may be. He says to the first question: The answer of 
most people, if they are honest, will usually take one or the other of two forms; 
and the second form is merely a humbler version of the first, which we need to 
consider seriously. “I do what I do because it is the one and only thing I can do 
at all well.” It suffices to say that the mathematician felt no need to do any- 
thing else. 

Hardy is very appreciative of the beauty and aesthetic appeal of mathema- 
tics. “A mathematician,” he says in Section 10, “like a painter or poet, is a maker 
of patterns... ,” and these “... must be beautiful. The ideas... must fit 
together in a harmonious way. Beauty is the first test: there is no permanent 
place in the world for ugly mathematics .... It may be very hard to define 
mathematical beauty ... ” but one can recognize it. He discusses the aesthetic 
appeal of theorems by Pythagoras on the irrationality of +/2, and Euclid on the 
existence of an infinity of prime numbers. He says in Section 18, “there is a 
very high degree of unexpectedness, combined with inevitability and economy. 
The arguments take so odd and surprising a form; the weapons used seem so 
childishly simple when compared with the far-reaching results.” 

I might suggest among other attributes of beauty, first of all, simplicity of 
enunciation. The meaning of the result and its significance should be grasped 
immediately by the reader, and these in themselves may make one think, what 
a pretty result this is. It is, however, the proof which counts. This should prefer- 
ably be short, involve little detail and a minimum of calculations. It leaves the 
reader impressed with a sense of elegance and wondering how it is possible that 
so much can be done with so little. 

Somehow, I do not think that Hardy’s work is characterized by beauty. It 
is distinguished more by his insight, his generality, and the power he displays 
in carrying out his ideas. Many of the results that he obtains are very important 
indeed, but the proofs are often long and require concentrated attention, and 
this may blunt one’s feelings even if the ideas are beautiful. 

Hardy does not define ugly mathematics. Among such, I would mention 
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those involving considerable calculations to produce results of no particular 
interest or importance; those involving such a multiplicity of variables, con- 
stants, and indices, upper, lower, right, and left, making it very difficult to gather 
the import of the result; and undue generalization apparently for its own sake 
and producing results with little novelty. I might also mention work which 
places a heavy burden on the reader in the way of comprehension and verifica- 
tion unless the results are of great importance. 

Hardy had previously said that he could “quote any number of fine theo- 
rems from the theory of numbers whose meaning anyone can understand, but 
whose proofs, though not difficult, may be found tedious.” It often happens that 
there are significant results apparently of some depth, the proof of which can be 
grasped by those with a minimum of mathematical knowledge. Perhaps I may 
be pardoned if I give one of my own. The theorem of Pythagoras suggests the 
problem of finding the integer solutions of the equation x?+7?=2?. This was 
done some 1000 years ago and is not difficult. But suppose we consider the more 
general equation ax?+ by? =cz?. This is a real problem in the theory of numbers. 
Legendre at the end of the eighteenth century gave necessary and sufficient 
conditions for its solvability. Then when the equation is taken in the normal 
form, i.e. abc is square-free and a>0, 6>0, c>0, Holzer showed in 1953 that a 
solution existed with | z| <+/ab, from which it follows that |x| <~+/bc, | y| S-Vca. 
I recently found a proof of this result that no one would call tedious by showing 
that if a solution (x1, 41, 21) existed with | z:| >+/ad, then there was another with 
| z0| <| z1|. This arose by taking an appropriate line through the point (x1, 91, 21) 
to meet the conic ax?-++-by*=cz? in the point (%2, Ye, 22). I call this a schoolboy 
proof, because the only advanced result required is that the equation /x-+-my=n 
has an integer solution if / and m are co-prime. A proof of the theorem could 
have been found by a schoolboy. 

We conclude by examining Hardy’s views about the utility or usefulness of 
mathematics. He seems to denigrate the usefulness of ‘real’ mathematics. In 
Section 21, he says, “The ‘real’ mathematics of the ‘real’ mathematicians, the 
mathematics of Fermat and Euler and Gauss and Abel and Riemann is almost 
wholly ‘useless.’ ” This statement is easily refuted. A ton of ore contains an 
almost infinitesimal amount of gold, yet its extraction proves worthwhile. So if 
only a microscopic part of pure mathematics proves useful, its production would 
be justified. Any number of instances of this come to mind, starting with the 
investigation of the properties of the conic sections by the Greeks and their 
application many years later to the orbits of the planets. Gauss’ investigations 
in number theory led him to the study of complex numbers. This is the beginning 
of abstract algebra, which has proved so useful for theoretical physics and ap- 
plied mathematics. Riemann’s work on differential geometry proved of inval- 
uable service to Einstein for his relativity theory. Fourier’s work on Fourier 
series has been most useful in physical investigations. Finally one of the most 
useful and striking applications of pure mathematics is to wireless telegraphy 
which had its origin in Maxwell’s solution of a differential equation. Many new 
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disciplines are making use of more and more pure mathematics, e.g., the bio- 
logical sciences, economics, game theory, and communication theory, which 
requires the solution of some difficult Diophantine equations. It has been truly 
said that advances in science are most rapid when their problems are expressed 
in mathematical form. These in time may lead to advances in pure mathematics. 

These remarks may serve as a reply to Hardy’s statement that the great 
bulk of higher mathematics is useless. 

It is suggested that one purpose mathematics may serve in war is that a 
mathematician may find in mathematics an incomparable anodyne. Bertrand 
Russell says that in mathematics, “one at least of our nobler impulses can best 
escape from the dreary exile of the actual world.” Hardy’s comment on this 
reveals his depressed spirits. “It is a pity,” he says, “that it should be necessary 
to make one very serious reservation—he must not be too old. Mathematics is 
not a contemplative but a creative subject; no one can draw much consolation 
from it when he has lost the power or desire to create; and that is apt to happen 
to mathematicians rather soon.” What does he mean when he says mathematics 
is not a contemplative subject? Many people can derive a great deal of pleasure 
from the contemplation of mathematics, e.g., from the beauty of its proofs, the 
importance of its results, and the history of its development. But alas, appar- 
ently not Hardy. 


A mathematician’s apology by G. H. Hardy with a foreword by C. P. Snow, University Press, 
Cambridge 1967 (18 s.). A paperback edition is available at 8s. 


WINNING STRATEGIES FOR THE IDEAL GAME 
C. W. HENSON, Duke University 


If R is a commutative ring (with identity) then the zdeal game on R—or 
simply the R-game—is a two-person game in which the players alternate in 
choosing elements from R, with each choice being subject to certain conditions 
on the ideal generated by the set of previously chosen elements. Specifically, 
the player making the nth move must choose anr€X& so that 

(i) @, is not in the ideal (a1, - - + , @p-1) generated by the previously chosen 
elements, and 

(ii) (@1, °° *, Qn) is a proper ideal. The player who is able to choose a, so 
that (a1, - + -, Qn) is a maximal ideal is the winner. (If R is a field, so that no 
first choice is possible, we say that the second player wins.) 

If R is Noetherian, so that every increasing chain of ideals has finite length, 
then every play of the R-game must have a winner. In such a case there is neces- 
sarily a winning strategy for one of the players. In specific cases, however, it is 
difficult to tell which player has the strategy or to give a description of it. In 
this paper we show that if R is the ring P[x] of polynomials over a PID (prin- 
cipal ideal domain) P, then it is the first player who has the winning strategy. 
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In particular this is true when R is Z[x] or F[x, y], where Z is the ring of ra- 
tional integers and F an arbitrary field. The proof uses only elementary facts 
from the theory of rings and is constructive, in the sense that an explicit descrip- 
tion of the strategy can be extracted from the argument. This result follows 
from the: 


MAIN THEOREM. Let D be an integral domain and a ¥ 0 an element of D such 
that D/(a) 1s a PID but not a field. Then the first player has a winning strategy in 
the D-game, beginning with the choice of a?. 


Coro.LuAry. Let D be P[x|, where P is a PID but not a field. If a ts either an 
arreducible element of P or of the form x+b(b€P) then the first player has a winning 
strategy in the D-game, beginning with the choice of a’. 


Proof. In the first case D/(a)=(P/(a)) [x| and in the second D/(a)~™P. In 
either case the Main Theorem applies. 

Note that the Main Theorem yields a first-player winning strategy in the 
P|x|-game whenever P is an integral domain with a principal maximal ideal. 

For the remainder of this paper we fix D and a as in the Main Theorem, and 
let R be D/(a?). Also let ¢: DR and 7: R-R/(ca) be the canonical homomorph- 
isms and set a=a(a). Then R/(a)~D/(a), so that R/(a) is a PID but not a 
field. Moreover, R and a satisfy the property 


(A) For any BER, Ba=0BE (a). 


Indeed if Ba =0 then o(b) =6 and ba€ (a?) for some 0€D. Since D is an integral 
domain this implies b€@(a), and hence BE (a). The converse follows from the 
fact that a?=a(a?) =0. 

The rest of this paper is devoted to proving that the second player has a 
winning strategy in the R-game (Theorem 1) and hence to proving the Main 
Theorem. 


LemMaA 1. If I 1s an t1deal of Rand bE satisfies w(1) = (r(b)) in R/(ax), then 
IT=(6)+I0\(a). 


Proof. We must show JC(0)+Jf\(a). If aEl then wr(a)€(r(d)) so that 
a=cb+da, for some c, dE R. But a—cbEl so that a©&(b)+J/\(a), as desired. 


LEMMA 2. Let I be an ideal of R. I ts prime ~I=(a) or I is maximal. I 1s 
maximal <I = (a, B) where w(B) ts trreductble 1n R/(a). 


Proof. Since a?=0 every prime (thus every maximal) ideal contains (a). 
In that case J and (J) are associated ideals. Both statements then follow from 
the fact that R/(qa) is a PID. 


LEMMA 3. Let I be an tdeal of R; 

(i) «of IC (a) then I ts principal, 

(ii) there exist b, cER so that I=(b, ca). In fact this 1s true whenever DET, 
mw (I) = (r(b)) and IT\(a) = (ca). 
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Proof. (i) Let J be the ideal J: (a) so that J= {x|xaGJI}. Then IN (a) = Ja. 
Choose cE J so that r(J)=(m(c)), and hence J=(c)+J(\(a), by Lemma 1. 
Then I=I(\(a) = Ja= (ca), so that I is principal. 

(ii) This follows from (i) using Lemma 1. 

We shall call any representation of an ideal J as in Lemma 3 (ii) a standard 
representation of I. 

REMARK. Since every ideal of R is finitely generated, R is Noetherian. Thus 
each proper ideal J has a normal decomposition into primary ideals (cf. |1]). If 
I€ (a) then the prime ideals containing J are all maximal, by Lemma 2. Hence 
if the normal decomposition of such an J is T=1,1\ - + - C\I, (where each J; is 
an i;-primary ideal, 14; a maximal ideal) then we have: (1) M1, ---, M, are 
the maximal ideals containing J, and (ii) each J; is the smallest M,-primary 
ideal containing J. Also a proper ideal J is M-primary (M a maximal ideal) 
exactly when J2 M* for some n> 0. 

If I is a proper ideal not contained in (a), M is a maximal ideal and JCM 
then we let I be the M-primary component of J. It follows that for any M-pri- 
mary ideal J, J+J is also M-primary and [1+J=Iu-+J. 


Lemna 4. If I=(0, ca) in R and bE (a) then I(\(a) = (ba, ca). 


Proof. We must show I(\(a)C (ba, ca). If a=a’a is in I/\(a) then a=a’a 
=b’b-+c’'ca for some 0’, c’E R. Then b’€ (a) since (a) is a prime ideal. That is, 
b’=b’’a for some 0” and a=b’’(ba) +c’ (ca) as desired. 


LemMA 5. Let aC R; ats a unit — r(a) is a unit in R/(Q). 


Proof. Obviously (a) is a unit if @ is one. If r(a)r(b) =1 in R/(a) then 
ab=1-++ca for some cE R. But then a[b(1 —ca) | =1—(ca)?=1, so a is a unit. 


LemMA 6. Let M=(a, B) be a maximal ideal of R and let I be an M-primary 
tdeal. Then there existk=r2=0,k>0 and a€ER so that I= (6*+aa, Ba) ts a stan- 
dard representation of I. 


Proof. Since I is M-primary there is an n>0 so that JDM” and hence 
a (I) > (7r(8)"). Thus there is a R>0 so that r(J = (7 (8)*). By Lemma 3 there 
exist a, cE R so that I[=(6'+aa, ca) is a standard representation of J. Thus 
B'a€ (ca) and there exists dE R with (6*—dc)a=0. This implies that 7(6)* 
€(ar(c)), by property A, and hence there is a unit (uv) in R/(a) and k2r20 
such that m(c) =7(u)r(B)". Then ca=uf'a so that (ca) =(6’a) by Lemma 5, 
completing the proof. 

If M=(a, B) is a maximal ideal of R and n>0, then M™ is generated by 
the set of elements of the form #’a*, where r, s20 and r+s=n. That is, 
M* = (6", B*—"1a). Thus if [= (8*+aa) where k>0 then J is M-primary, since 
it contains (67, B?*-1~), and by Lemma 6 every principal M-primary ideal can 
be expressed in this form. But this implies that M"(n>0) is not a principal 
ideal: otherwise 1” = (8"+-aa) and (6"a) = (6"—1a), using Lemma 4. This implies 
that +(8)"€ (r(B)"—), by property A, contradicting the irreducibility of 7(@). 


1970] WINNING STRATEGIES FOR THE IDEAL GAME 839 


DEFINITION. An ideal J of R is special < there is a maximal ideal M so that 
[= M", for some nx>0. 

We shall show that the special ideals of R are exactly those proper ideals J 
of R such that there is a winning strategy for the second player in the R/I-game. 


LEMMA 7. Let M be a maximal ideal of Rand Ian M-primary ideal. Then there 
1s @bER such that (b) 1s M-primary and I+-(b) ts special. 


Proof. Let M=(a, 8B) and I= (6*+<aa, Ba) as in Lemma 6, (k=r=0, k>0). 


Case 1. Assume a€ (a, 6"), and hence I= (6*, B’a). If k>r then set b=6'*}, 
so that J+(b)=(@"t!, Bra). If R=r we set }=6'+6""1a and have [+(0) 
= (Br, B’—*a). 

Case 2. Assume a€ (a, 8’) so we may let s be the largest integer such that 
a€ (a, B*). Thus OSs<r, and we may write a=a’B*+a’a where a’€ (a, 8). That 
is, dca=a' Ba. Set b=B*t! so that J-+(b) = (B*t!, a/B'a). Since a’ E (a, 8) which is 
maximal, we may write 1 =ca’+d6-+ea. That is 


Bra = c(a’B*a) + dapet 


and hence J+ (0) = (6'*!, Ba). The proof is completed by noting that each (0) is 
M-primary and each J+ (0) is special. 


LEMMA 8. Let (b) be a proper ideal of R which 1s not contained 1n (a). Then the 
primary components of (b) are principal. 


Proof. Let M=(a, B) be a maximal ideal containing (0). There is a largest 
integer k such that 7(b)€(7r(8)*), and k>0O. Thus we may write b)=af*+a’a 
where a€& (a, 8). Hence there exist c, d, e€ R with 1=ca+d6*+ea. This leads 
to b= (a+da’a) (8* +ca’a), so we have expressed 0 in the form 0;(6*+dea) where 
b,€ (a, 8). This implies that any M-primary ideal containing (0) also contains 
(6* +bea), which is M-primary itself. That is, (0)u=(6*+0.a) and the proof is 
complete. 


LemMMA 9. Let M be a maximal ideal of Rand n>k>0. Then there 1s nobER 
such that M"+-(b) = M*. 


Proof. Otherwise, let M = (a, B) be maximal and assume M*”-+ (db) = M* where 
n>k>0O. Passing to R/(a) we see that DE (a). Using Lemma 8 we let (0) x 
= (B"’+aa). Also M* = M"+ (6) = M"+(b)mu, so that k=r and M*=(6", B'+aa). 
Thus M* = (6", aa). By Lemma 4 this implies that 

(Ba) = M* IY (a) = (Ba, aa) 


sO we may write aa=a’B'"1a. But this implies that 6" =6"-*(6* + aa) —a’B"“1a 
€(6*+aa) and hence that M"*C(8'+aa). That is, M*=(8*+aa) which is a 
contradiction. 


LEMMA 10. Let I be a proper tdeal of R. Then there exists bE R such that I+(b) 
1s special. 
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Proof. \f I is not contained in (a) and M is any maximal ideal containing JI, 
there is an M-primary ideal (0) so that Ju+(0) is special, by Lemma 7. Thus 
I+(6)=Iy+(0) and 0b is as desired. 

If T=(qa) let bd be such that (a, 0) is a maximal ideal. If [C(a) but [¥(a), 
then by Lemma 3 we may write I = (ca) where (c) is a proper ideal and c€ (a). 
Let M=(a, 8) be a maximal ideal containing (c) so that (c)w=(6*+aa) for 
some k>0. Set D=6**!; by Lemma 4 ([+(0))M\(a) = (ba, ca). But (ba, ca) 
= ((b)-+(c)m)a since (0) is M-primary, so 


(I + (8)) C\ (a) = (BY, BF + aa)a = (Bra). 


Hence [+ (0) = (6*+!, B¥a). In each case J+ (0b) is special and therefore the proof 
is complete. 


THEOREM 1. The second player has a winning strategy in the R-game. In fact, 
if I is a proper ideal of R, then the second player has a winning strategy in the 
R/I-game tf I1s special, and the first player has a winning strategy in the R/I-game 
if I is not special. 


Proof. Let I be a special ideal. If J~4Z+(a)AR then I[+(a) is not special, 
by Lemma 9. Then there exists bE R so that [+(a, b) is special, by Lemma 10. 
Since the maximal ideals are special (and R is Noetherian) it follows that the 
second player has a winning strategy in the R/J-game whenever I is special. 
If I is a proper ideal but is not special, then there is a b© R such that [+(0) is 
special; that is, the R/J-game has a first-player winning strategy in this case. 

Finally, since no principal ideal of R is special there must be a winning 
strategy for the second player in the R-game. 
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SUMMATION OF THE SERIES 1°+27+ --- +x 
USING ELEMENTARY CALCULUS 


L. S. LEVY, University of Wisconsin 


It is well known that the sum of the series 1+2++ --- +x is x(x+1)/2, a 
polynomial which we shall call P(x); and it is almost as well known that 12+ 2? 
+ +++ +x? =x(x+1)(2x+1)/6=P2(x). More generally, an old theorem (one 
proof of which is suggested by problem 3, Section 4) states: 


THEOREM 1. For each positive integer n there 1s exactly one polynomial Py(x) tn 
x such that 


(1) P,(a) = 1*+ 27+ +--+ 4" whenever x is a positive integer. 
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The degree of Pn(x) 1s n+1. 


The first few of these polynomials are: 


x? x 1 
P(x) = Sty ayee ry, 


P(x) <p 4 ts 4 1)024+1 
MOET TG GE NCES YD, 


a4 8 a? 1 


Pi(x) = * 24k —— (x + 1)(2% + 1)(3%? + 3x — 1), 
5 2 3 30 30 

P(x) =p EF ge 4 11002 + 2 — 2), 
6 2 12 12 12 


x % 1 
—~ oto po Lp 9) 4 3 
P(x) aa 6b go 8 + Ge + 1) Batt ox 34+1), 
p yee yey le we 3 2a 1)2(3ae4 + 600 2 Ay +2 
A) ED ay Tog Ty OT GH O08 — at — BaF 2). 


The most obvious question one can ask about these polynomials is how to 
find them. Some more subtle questions are suggested by the following observa- 
tions from the table: 

OBSERVATION 1. P,(x) has the following factors: x(«+1) for all 2, «(x+1) 
-(2x+1) for all even n; x?(x+1)? for all odd <1. 

OBSERVATION 2. The coefficient, in P,(x), of x"t! is always 1/(7-+1) and that 
of x” is always 1/2. 

OBSERVATION 3. With the exception of the term x”/2, all of the terms of 
P,(x) have even degree, or else all have odd degree. 

The key to all these questions will be an integration formula 


(2) P(x) = nf P,-1(t)dt + C,x for some constant C,. 
0 
To see how (2) can be used, start with P;(x) =x?/2 + x*/2. Then 
P,(x) = 2 f (12/2 + t/2)dt + Cox = 03/3 + 42/2 + Con. 
0 


To find C2, substitute the value x =1 into this equation, using the fact that P.(1) 
is the sum of one term of the series 17+22+ ---, that is, 1=P,(1) =(1/3) 
+(1/2)+ Cs. This shows that C,=1/6 as indicated in the table. We will prove, 
in section 3, that C,=0 whenever n is odd and >1. Thus from (2) we get 
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P3(*) = 3{ w/s + £2/2 + t/6)dt = (4/4) + (43/2) + (”?/4) and so on. 


Note that the formula 13+ 23+ - - - +x3= («*/4) + (43/2) + («?/4) which we 
obtained from the integration formula (2) is a statement about the sum of a 
series of integers. The main point of this paper is to show (without the complica- 
tions one encounters in analytic number theory) how properties of continuous 
functions can be used to obtain properties of the integers. 


1. A Pitfall. Before applying calculus to our problem we illustrate a pitfall 
which we will have to avoid. Note that 


(3) ee=axtet-.--+ye (« terms) 

whenever x is a positive integer, since the right-hand is merely x times x. Dif- 
ferentiating we get 2x=1+1+ ----+1 (x terms) =x so that, taking x=1, we 
get 2=1. 


The fallacy is that, since (3) holds only when x is a positive integer, it makes 
no sense to differentiate it. What is needed for the standard differentiation 
formulas is equality of the functions involved throughout an interval. 


2. Computation of P,(x). Our object is now to prove: 


THEOREM 2. For each positive integer n there 1s a number C, (depending on n, 
but not on x) such that 


(4) P(x) = 1 { "Py_a(i)dt + Co. 


The proof will require: 


LEMMA. Let f(x) and g(x) be polynomials (in x) of degree Sd which are equal 
for more than d values of x. Then f(x) and g(x) are the same polynomial. In par- 
ticular, they are equal for all values of x. 


Proof. Let f(x) =aax4+aaiuxt + +++ +ayx+ao and g(x) =baxt-+ bein? 
+ . ose +d1x +). Then 


h(x) = f(x) — g(*) = (da—ba) 4+ (da_1— bas) a+ + + + (G1 —b1) ¥+ (ao— bo) 


is polynomial of degree Sd such that the equation h(x) =0 has more than d roots, 
name y all values of x for which f(x) = g(x). But it is a well-known theorem that 
an equation of the form cgx4+-ca yx*-14+ --- +a.x+¢9=0 (each coefficient c; 
a real number) has at most d real roots unless 0=cag=Ca4= +--+ =€,=Co. (See, 
for example, E. Swokowski: Fundamentals of College Algebra, p. 231, Corollary 
8.25.) We conclude that the coefficient a;— 0; of each power x? of x in h(x) equals 
zero. In other words, each a;=0;, which is what we wanted to prove. 


Proof of the theorem. By Theorem 1, P,(x) =1"+2"+ --- +x”. Subtracting 
from this P,(«—1)=1"+2"+ ---+(«—1)* we get 
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(5) P(x) — Pa(w — 1) = x 


whenever x is a positive integer. Writing n—1 in place of m in (5) and multiply- 
ing both sides by n, we get m[Pn_—1(x) —Pas(x —1)]=nx"-1. Thus, differentiating 
both sides of (5), we see 


(6) Pil (%) — Pi (% — 1) = n[Pya(x) — Pra(* — 1))]. 


But wait! Are we again on our way to proving 2=1? Fortunately not this time: 
Consider x fixed temporarily. Then P,(x) is a polynomial of degree x+1 in x 
(Theorem 1). Thus both sides of (5) are polynomials in x of degree S$n+1 which 
are equal for more than n-++1 values of x. (They are equal for all positive integers 
x.) The lemma then tells us that the left and right sides of (5) are the same poly- 
nomial, written in two different ways. Thus we are justified in differentiating 
(5) to obtain (6); and (6) will be true for all real numbers x (and all positive 
integers 7). 
Writing «—1 for x in (6) we get 


Pi (% — 1) — Pi (w% — 2) = n[Pai(x — 1) — Pai(x — 2)] 
which, when added to (6) gives P,/ (x) —P/ («—2) =n|Pr—1(x) —Pa—«s(x —2) |. 
Writing x —2 for x in (6) and adding the result to our last equation gives 
Pil (*) — Pi (w — 3) = n[Pa+s(e) — Paar(x — 3)]. 
Proceeding in this way, we eventually obtain 
(7) Pi (2) — Pd (y) = n[Pa—1(*) — Pn_s(y)]| whenever x — y = positive integer. 


Note that, in (7), y does not have to be positive, nor does it have to be an integer. 
Now specialize (7) by taking x to be a positive integer. Then x —0 is a posi- 
tive integer, so we can choose y=0 in (7), getting 


Pil (x) = nPp_s(x) + [Pi (0) — nPo—1(0)]. 


Since we don’t know the values of P, (0) or Pns(0), denote the expression in 
square brackets by C, (to indicate that it depends on but not on x). Then this 
last equation takes the form 


8) Pi (*) = nPari(*%) + Cr («% and n positive integers). 


Again consider x fixed. Since both sides of (8) are polynomials in x of degree 
<n which are equal for all positive integers x, the lemma tells us that (8) holds 
for all real numbers x. Therefore we can integrate both sides from 0 to x (after 
replacing x by a dummy variable ¢), getting 


z 


P,(«%) — P,(0) =n i) Pn_1(t)dt + Cnx. 


This is precisely what we want, provided we can show P,(0) =0. But putting 
x=1 in (5) gives 1"»—P,(0) =1"; and this completes the proof of the theorem. 
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3. Factorization of P,(«). In order to find some of the factors of P,(x) we shall 
need some information about its values when x is not a positive integer. To ob- 
tain this we prove: 


THEOREM 3. For every positive integer n and every real number x, 


REMARK. Before proving the theorem we observe that it can be interpreted 
geometrically: If n 1s odd, the graph of y=P,(x) 1s symmetric with respect to re- 
flection tn the line x = —4; while uf nts even it 1s symmetric with respect to reflection 
an the point (—%, 0). To see this, note that the number midway between —x 
and x—1 is }{[—x+(«—1)]=-—4. 


Proof of the Theorem. In equation (5), write —x-+1 in place of x (which we 
can do since (5) is true for all real numbers x) and interchange the order of terms 
on the left-hand side: 


(10) —P,r(—«) + Pra(—*x + 1) = (—*%-+ 1)" (for every real number x). 


Note that P,(—x) cannot be interpreted as the sum of a series if —x <0. It is 
merely the value of a polynomial function. 


PROBLEM 1. Finish the proof of Theorem 3. (Let x be a positive integer in 
(10); then successively write —x-+1, —x+2, —x+3,---, —2, —1 for —x in 
(10) and add all the resulting equations.) 


CoroLLaARY 1 (To Theorem 3). C,=0 whenever n is odd and >1. 


Proof. When n is odd, Theorem 3 becomes P,z(—x) =P,a(x—1). Therefore 
Theorem 2 shows 


(11) P,(% — 1) = Pa(—2x) = nf Pn—s1(u)du — Cn% 
0 
make the change of variable u = —# under the integral sign in (11). 
P,(« —1) = — nf Pra(—dae — Cyx. 
0 
Now use Theorem 3 under the integral sign, with m —1 in place of n, noting that 


(—1)@-D+1=(—1)"= —1 since m is odd. (Here we use the fact that n>1; for 
if n=1, n—1 is not a positive integer, and hence Theorem 3 cannot be used.) 


(12) P,(* — 1) = nf P.x( — 1)dt — Cyx. 


Now subtract (12) from P,r(x)=nf> Pra(éi)dt+C.x, recalling that P,(x) 
— P,,(«—1) =x” for all real numbers x (by (5)). 


x" =n f dt + 2C,x = x" + 2C," (for all real x). 
0 
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Finally, putting x =1 we see that C,,=0, as desired. 


CoROLLARY 2. Each polynomial P,,(x) has the following factors: 


(13) x(x + 1) for all n. 
(14) u(x + 1)(24 + 1) for all even n. 
(15) x?(~% + 1)? for all odd n > 1. 


Proof. To see that x is a factor of P,(x), it is sufficient to show that P,(0) =0, 
which we obtained in the last sentence of Section 2. 

To see that x+1 is a factor, set x=0 in Theorem 3, getting 0= +P,(—1) 
and proving (13). 

Next, set x =1/2 in Theorem 3: P,(—1/2) =(—1)"*1P,(—1/2). If 2 is even 
we get 2P,(—1/2)=0 so that x+(1/2) is a factor of P,(x). Clearing fractions 
we see that 2x-++1 is a factor of P(x). Thus we have (14). 

To obtain (15) we need a lemma which seems to have disappeared from re- 
cent calculus courses: 


LEMMA. Let f(x) be a polynomial and c a real number. Then (x —c)? ts a factor 
of f(x)<=f(c) =0 and f’(c) =0. 


Proof. (€=). Since f(c) =0, x —c is a factor of f(x), say f(x) = (x —c) g(x). Then 
f' (x) =g(x) + («—c)g’(x). Substituting x =c we see that 0=g(c) so that x—cisa 
factor of g(x). Hence (x—c)? is a factor of f(x). 

(=). Write f(x) = («—c)’g(x) so that f’(x) =2(x —c) g(x) + (x —c)?g’(x). Let- 
ting x=c in these equations completes the proof. 


Now, differentiate Theorem 2: P, (x) =nP,si(x)+C,. When x is odd, and 
>1, C,=0 (corollary 1), so that P,/ (x)=nPn+s(x). We already know that 
P,(0) =P,(—1) =0 for every positive integer . Hence P,/ (0) =P, (—1) =0 for 
every odd positive integer 7 >1. Thus, by the lemma, x? and (x«-+1)? are factors 
of P,(x). 


CauTIon. Note that 14-+24+34=98 which is noé divisible by x(x-+-1)(2x-+1) 
=3X(3+1) X(2X3+1) =84, an apparent contradiction to (14) of Corollary 
2. The reason for this apparent contradiction is that we must distinguish be- 
tween arithmetical and algebraic divisibility. Thus, the assertion (14) of Corol- 
lary 2 that P,z(x) is divisible by x(x+1)(2x+1) when z is even means that 
for such an 7 there is a polynomial g(x) such that P,(x) =x(x+1)(2x+1)g(x). 
In fact, from the table under Theorem 1, 


Pa(x) = x(x + 1)(2 +(=+4 -) 

x) = x(x x — + — — — }, 

; 10 10 30 

Because of the fractional coefficients in the last factor, we cannot conclude that 


the number P,(x) is divisible by x(x +1)(2x+1). However, we can conclude that 
the number 30 P,(x) is divisible by the number x(x +1)(2x-+1). 
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PROBLEM 2. Find a positive integer m such that, for all positive integers x, 
m(14+24+ --- +4x*) is divisible by 1?+2?+ ----+4x?. (Better: find the small- 
est such m.) Generalize the solution of problem 2 to a proof of the following 
curious consequence of Corollary 2: 


CoROLLARY 3. For every even positive integer n there 1s a positive integer M, 
such that (for all x) M,(1"+2"+ +--+ +2”) ts divisible by 1722+ +--+ +x?, 


In closing, we remark that in order to understand Corollary 3 one only has 
to know about the positive integers. Yet its proof involves properties of differ- 
entiable functions, hence requires a knowledge of the entire real number system. 


4. Remarks and Problems. The integral formula for P,(x) (Theorem 2) 
makes it easy to calculate a table of these polynomials. However, the proof of 
this formula required the prior knowledge that the polynomials P,(x) exist for 
every positive integer x (Theorem 1). Thus the solution of the following problem 
is essential to complete the proof of Theorem 2. 

PROBLEM 3. (a) Starting with «4—(«—1)4=4x'—6x?+4x*—1, and writing 
successively x—1, x—2, ---, 3, 2, 1 in place of x, derive the formula 


19H 284 - +s + eF =F [ot O(L2+- 22 © + ~ + H)—4(1 +24 +++ +2)+2] 


and use this, together with the formulas for Pi(x) and Ps(x) given in the table, 
to find P3(x). 
(b) Generalize problem (3a) to a proof of Theorem 1 (mathematical induc- 
tion will be necessary). 
PROBLEM 4(a) Is the coefficient of x**+!in Py(x) always 1/(n-+1)? 
(b) Is the coefficient of x" in P»(x) always 1/2? 
(c) Is the coefficient of x«"-! in P,(x) always n/12? 
(d) Is it true of P,(x) that, except for the term of degree 2, all 
terms have even degree, or else all have odd degree? 
PROBLEM 5. Show that P,(17) —nP,»u(17) =P,(23) —nP,«(23) for every 
positive integer x. 
PROBLEM 6. Show that, if @g=Cnx"+Cn,ix" !+ +--+ +ax+¢o, the c;’s being 
constants, then 


Qy + dg + +++ + Ge = CnPn(%) + Cn-1Pa—1(%) + + + + + cr Pi(%) + cox. 
Use this fact, and our table of P,(x) to find the sum of 
12+ 32+ 52+ ----+ (2% — 1)? 
(1-2-3) + (2-3-4) + (3-4-5) +--+ + x(@+ 1)(4 4 2). 
The asterisks below indicate more challenging problems. 
PRoBLEM 7* (by William Self). Compute [°1 Pa(x)Pm(x)dx when m is even 
and n is odd (m, n= 2). (Hint: the answer will be the same for all such m and n.) 


ProsBLEeM 8.* Note that P.(x) and P;(x) can be factored completely into 
linear factors with real number coefficients. Does P.(x) have any real linear 
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factors other than those listed in the table of P(x)? 

PROBLEM 9.* For each n, the graph of the function y=P,(x) “interpolates” 
the sequence of points (1, 1"), (2, 1*+2”), (3, 1*+2"+3"),... by connecting 
them with a continuous curve. Is this interpolation genuine in the sense that, 
when x21, the curve y=P,,(x) is always concave upward? (Hint: First observe 
that when 1 is even P(x) = n(n —1)Pn-2(x); and when 2 is odd Pi(x) =nPp_1(x).) 


We have not obtained much information about the numbers C, (” even). 
For example: is C, always nonzero when n is even, do the numbers C2, C4, 
Cs, --- alternate in sign, do they approach a limit as n—«? The answers to 
these questions are not all what one would expect. Some additional computa- 
tions would show: 


1 1 1 ~{ 5 
C2 = = C= Cs = Cw =o 

~691 7 ~3617 43867 ~ 122277 
277300 6 6 510 Be 798.0310 


(Source: G. Boole, Treatise on the Calculus of Finite Differences, p. 90 of the 
third edition (S. T. Stechert and Co., N. Y., 1872). In Boole’s notation, our 
Ca=(—1)+)/2B,_1 (nm even) where B,_1 is the (n—1)’st “Bernoulli” number.) 
Boole shows, in fact, that 


(— 1) (n+2)/2C 
in ————— = 
neven, n— 0 (2 (n) 1/(27)”) 


so that, as n> (n even), C,/n?Cn_2 approaches —1/4:? (same source, chap. 
6, p. 109). This answers all the above questions and suggests: 

PRoBLEM 10.** Can any of the above properties of C, be obtained by our 
methods? (The double asterisk indicates that I don’t know the answer.) 

PROBLEM 11.** h(x) =3x4+6x?—3x-+1 is a factor of P»(x) (see the table of 
P,,(x)). One factor of h’(x) is 2x+1, and h’(x)/(2x-+1) =3(2x?+2x—1) =a fac- 
tor of P;(x). Is this a special case of some general property of P(x)? 

PROBLEM 12.** When x26, does P,(x) have any linear factors other than 
those listed in Corollary 2 to Theorem 3? (Note: It is clear that P4(x) and P5(x) 
have additional linear factors; and problem 8 is the special case of this question 
for n=6. Part of the answer to problem 9 will be that P,(x) =0 has no real 
roots 21; hence by the symmetry of P,(x) (Theorem 3), it has no real roots 
<—2. Thus any linear factors must come from roots between —2 and -+1.) 


The author is indebted to Mr. Edward Shapiro of Lincoln High School, Brooklyn, N. Y., for 
his help in the preparation of this paper. 


POWERFUL NUMBERS 
S. W. GOLOMB, University of Southern California 


1. Introduction. We define a positive integer 7 to be a powerful number if p? 
divides r whenever the prime p divides r. If k(x) denotes the number of powerful 
numbers $x, we show that k(x)~c-/x, with the constant c=¢(3/2)/¢(3), as 
x—>0, We also prove that there are infinitely many pairs of consecutive power- 
ful integers, such as 8; 9 and 288; 289. We conclude with some results and con- 
jectures concerning the gaps between powerful numbers. 


2. The density of powerful numbers. Let 


1 
(1) FG) =TLd+eet+eet+pe+---) = (1 +o) 


Pp 


where the products are extended over all primes p. It is evident that 


(2) F(s) = urs, 

rGK 
where K is the set of powerful numbers. Then, the sum of the reciprocals of the 
powerful numbers, 


1 1 
3 F(1) = — = 1 + ————_ }, 
® o- Ee Mtg) 
is seen to be convergent. 

To estimate k(x), the number of powerful numbers up to x, we observe first 
that k(x) = [-/x], since every perfect square is powerful. Next, we observe that 
every powerful number 7 can be represented as a perfect square nv? (including 
the case n=1) times a perfect cube m' (including m=1), and that this repre- 
sentation is unique if we require m to be square-free. That is, we set m equal to 
the product of those primes having odd exponents in the canonical factorization 


of r into powers of distinct primes, and the representation 7 = ”?m' is then unique. 
Thus, 


a) x 1/2 

(4) Ha) = Ant? < x, ulm) #0) = Lo wrem| (=) |wewe, ae, 
m=1 mM 

where 


(5) > u2(m)m-8!?2 < €(3/2) < @. 


Explicitly, 
(6) c= [T+ = [Jd -¢-9/a — 9-8”) = ¢(3/2)/¢(), 


848 
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where ¢(s) is the Riemann Zeta Function (cf. [1], p. 5). This evaluation of ¢ 
comes from setting s=3/2 in the identity 


1 1— p72 
” pe -M(1+5)= 04> = Ma-9 / Ta--) 


a ~ 
= §(s)/¢(2s) 


for all Ri(s) >1, where ¢(s) = > 7_, n° =II,(1—p-*)—! for Ri(s) > 1. 
For purposes of estimation, we have the inequalities 


(8) cxtl/? > R(x) & cul? — 33/8 forx 2 1, 


because cx/?= Dima1 w2(m) (x/m?) V2 = Dar w2(m) [(x/m*) 2] =k(x), and 


cal? — R(x) = x | u(m) | \(=)" — (=) }} < > | w(m) | +1 


‘ 1/2 «2 
+ > | u(m) | (=) < ([{x!/3] — 1) +(1 + ve us wan) 


m=([a!/3] 


Sy l/B A Ay ll2.gy-1/6 = Zy1/3, 


Numerically, c=2-173 -- - 
We have the further identities: 


F(s) = Dy (1/r’) = > n-2. x u2(m)m-* 
rEK n=l 
(9) 
- De X | ulm) | mr = 3 yea +2), 


where we used the substitution t=mn; 


(10) F(s) = p> n—*s- > u?(m)m— = £(2s)¢(35)/F (6s), 
and 
(11) FA) = c2¢@)/r@) = Dweo/r, 
where 

1 
(12) V(t) = T(t + =) 


by setting s=1 in the previous identities (9) and (10). 
Since ¢(2) =7?/6 and ¢(6) =7°/945, we observe 


315 
(13) FQ) = 535), 
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3. Consecutive powerful numbers. Four consecutive integers cannot all be 
powerful, since one of them is twice an odd number. No example of three con- 
secutive powerful numbers is known, unless one is willing to accept —1, 0, 1. If 
such an example exists, it must be of the form 


4k—1, 4k, 4k +1. 


No case of 4k—1 and 4k+1 both being powerful is known. In fact, the only 
known example of consecutive odd numbers 2k—1 and 2k-++1 both being power- 
ful is 2k—1=25, 2k+1=27. 

There are two infinite families of examples where two consecutive integers 
are powerful which correspond to the solutions of the Pell equations «?—2y? 
=1 and x?—2y?= —1. (Each of these equations is well known to have infinitely 
many solutions [2].) 

Let x1, 1 satisfy x7—-2yi= +1. Then 8xiy7=A and (xi+2y?)?=B are con- 
secutive powerful numbers! In Table I, we see several examples of this type. 


TABLE I. Consecutive powerful numbers from solutions of the Pell equations «?—2y?= +1. 


x y A B 
1 1 8 =23 9 =3? 
3 2 288 = 25.32 289 =17? 
7 5 9800 = 23.52.72 9801 =34-11? 
17 12 332,928 =25- 32-172 332,929 =5773 
VBo VAo/2 4A Bo 4A Bo+1 


If A and B=A +1 are consecutive powerful numbers, and if B is a perfect 
square, B=w?, then A =(u—1)(u+1). If uw is even, then (w—1, u+1) =1, and 
both w—1 and u+1 are odd powerful numbers. As already remarked, the only 
known instance of this phenomenon is u—1=25, u+1=27, leading to the iso- 
lated example A =675 =3%-5?, B =676=2?-13?, If u is odd, then (u—1)/2 and 
(u+1)/2 are consecutive integers, with ((w—1)/2, (w+1)/2)=1. For u?—1 to 
be powerful, (u—1)/2 and (u+1)/2 must be a powerful odd number and twice 
a powerful number, in either order. The two Pell equations produce examples 
in both orders. However, an example satisfying neither of these Pell equations 
is also known, with (w—1)/2 =242 =2-11? and (u+1)/2 =243=35. This leads 
to A =235,224 = 23-35-11? and B=235,225 =5?2-97?. 

Whenever A and B are consecutive powerful numbers, so too are A’=4AB 
and B’=44B+1=(2A+1)2. The solution x»=1, yo=1, of x?-—2y?=—1 gen- 
erates all solutions of the Pell equations x?—2y?=+1, in the sense that x, 
+ yn/2 =(xotyveV/2)” yields the complete set of solutions (x,, yn) such that 
x2—2y,= +1. 

Note that the consecutive powerful numbers A =675, B=676, come from 
the solution x =26, y=15, of the Pell equation x?—3y?=1, with A =3y? and 
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B=x?, Similarly, the example A = 235,224, B=235,225 of consecutive powerful 
numbers arises from the Pell equation x?—6y?=1 with x =485, y=198. More 
generally, any solution x1, 4; of the Pell equation x?—dy*? = +1, with the extra 
condition that d| yi, leads to an infinite family of consecutive powerful numbers, 
starting with 4;=xj, By=dyj;=A,+1, and continuing with A,=x2, B,=dy;, 
where (%n, ¥n) are obtained from the computation (41+ -/dy1)" =X%n+tVJdYn. 

Conversely, whenever we have two consecutive powerful integers, if one of 
them is a perfect square x2, we can write the other in the form 2? =my?, with 
m square-free, and we have a solution to the Pell equation x?—my?= +1. 

In all cases given thus far of consecutive powerful numbers, the /arger num- 
ber is a perfect square. However, the Pell equation x?—5y?=—1 with 5\y 
leads to infinitely many powerful numbers x?+1=5y?, such as x?=(682)? 
= 465,124; 5y2=5(305)?2 =53-612 = 465,125. 

One example has been found consisting of two consecutive powerful num- 
bers where neither is a perfect square: A =23'=12,167 and B=23-3?-13? 
= 12,168. It is not known how many such examples (without perfect squares) 
exist altogether. 


4. Gaps between powerful numbers. The set K of powerful numbers is 
closed under multiplication. Since there are infinitely many pairs of powerful 
numbers which differ by 1, there are infinitely many pairs of powerful numbers 
differing by 7, for any reek. 

Every positive integer not of the form 2(2)-++1) is a difference of two power- 
ful numbers in at least one way (specifically, as a difference of two perfect 
squares). For numbers of the form 2(2)+1), 520, such representations may 
also exist. Thus: 


2 = 38-52 30 = 832 — 193 
6 =? 34 =? 
10 = 133 — 37 38 = 372 — 113 
14 =? 42 =? 
18 = 192 — 78 = 32(3 — 52) 46 = 172 — 35 
22 = 72 — 38 = 472 — 37 50 = 52(33 — 52) 
26 = 33 — {* = 72.35 — 109? 54 = 34 — 33 = 34/38 — 52) 
= 73 — 172. 


If « and vw are both powerful numbers, (u, v) =1, and a=u—v, we say that 
a has a proper representation as a difference of powerful numbers. We observe: 


2b+1=(b +1)? —B? 
8c = (2c + 1)? — (2c — 1)? 


so that all odd numbers, as well as all multiples of 8, have proper representa- 
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tions. Among the numbers 2(2)-+1) on the range 2(4)54 for which representa- 
tions were found, there were proper representations included in every case 
except 2(2b-+1) =50. Finally, for numbers 4(2)-+1) on the range 4(8)100, we 
observe the following proper representations: 


4 = 5% — {1? 60 =? 

12 = 47? — 133 68 = 33-54 — 75 
20 =P 76 = 58 — 7? 
28 =P 84 =? 

36 =? 92 =? 

44 = $3 — 34 = 13? — $3 100 = 73 — 38, 
52 =? 


It is interesting that if w and v=u-+4 are both powerful, then so too are 
u’ =uv and v’ =u’ +4=(u+2)2. Thus, from the example 4=5'—112, an infinite 
number of proper representations of 4 are obtainable. It would be interesting to 
determine whether or not any numbers other than 1 and 4 have infinitely many 
proper representations. 

Among the powerful numbers which are not perfect squares, the smallest 
difference known to occur infinitely often is 4. Specifically, the equation 3x? 
—2y?=1 has infinitely many solutions for which 3| x, such as x =9, y=11. For 
any such solution, we have 12x?—8y?=4, where 12x? and 8y? are both powerful, 
and neither is a square. The only known instances where the difference between 
nonsquare powerful numbers is less than 4 are: 2’—5'=3, and (as previously 
mentioned) 2% - 3? - 13?—23%=1. 

It has been conjectured that 6 cannot be represented in any way as a differ- 
ence between two powerful numbers. It is further conjectured that there are 
infinitely many numbers which cannot be so represented. 


This work was supported in part by the Air Force Office of Scientific Research under Grant 
AFOSR-68-1555. 
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POLYNOMIALS WITH PREASSIGNED VALUES 
AT THEIR BRANCHING POINTS 


Jan MycieEtskI, University of Colorado 


It is the purpose of this paper to give a simple proof of the following 


THEOREM. Given a sequence of n complex numbers a1, + * +, Gn there exists a 
polynomial P(x) of degree n+1 and n complex numbers bi, - + - , bn such that the 
highest coefficient of P is 1, P(0) =0, P(b:) =a; and P’(b;) =0 for 1=1,---,n. 
Moreover (*) af B occurs k times in the sequence bi, - + + , bn then (x—B)**! divides 
P(x) —P(B). 


This theorem was announced by J. W. Andrushkiw [0] (in a different formu- 
lation) and proved by R. Thom [8]. Before [8] proofs were published for the 
case when a; are real and satisfy some additional conditions, see A. J. Kempner 
[4], Chandler Davis [1], J. Mycielski and S. Paszkowski [6] and W. J. Kam- 
merer [3 |. See also V. S. Videnskyj [10], [11], Chandler Davis [2] and R. Thom 
[7] for related studies. 

In [1], [6] and [3] it was shown that if a; are real and satisfy some addi- 
tional conditions then P is unique. In our case P is not unique but we can see 
analysing the proof which follows that if the numbers 0, a1, de, - > > , dn are all 
different then the number N of such polynomials is a finite constant depending 
only on x. I conjecture that N=(n-+1)*. 

By the theorem of Tarski on the elementary equivalence of algebraic closed 
fields of a given characteristic (see e.g. [5| or [9]) the theorem extends to all 
algebraic closed fields of characteristic 0. Our proof uses the topology of the 
field of complex numbers. Perhaps if a purely algebraic proof was found it 
would yield some extensions to other fields e.g. of characteristic 40.” 

Proof of the Theorem. For any point b=(b1,---, bn) of the ~-dimensional 
complex space C” we put 


(1) P;(x) = (n +1) I] @ — b)dt 
0 +t=1 

and we define the vector-valued function 

(2) o(b) = (Po(bi), + + + , Po(bn)). 


* Note added on December 8, 1969. Recently R. E. MacRae found such a proof. Applying the 
theorem of Bézout he generalizes the theorem proved in this paper and his generalization is valid 
for algebraic closed fields of characteristic p if p does not divide »+1 and implies my conjecture 
that N=(n+1)*. 
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Of course it is enough to prove that g maps C” onto itself and that (*) holds. 
First let us check (*). Indeed it follows directly from (1) that 


d*(P,(x) — P,(B))/dx 


is divisible by x—8 for s=0, - - - ,k, i.e., (*) is true. 

It follows from (*) that 
(3) if vy(b) = (0,---, 0) then b=b=---=b, = 0. 

Indeed (*) implies then that P(x) =x(«—bi) ---(«—d,) but this is in- 
consistent with (1) unless }:=b2= --- =b,=0. 

It is clear from (2) that ¢ is homogeneous of degree x +1, i.e., 
(4) g(ab) = a®*!y(b) fora EC. 

Now we will prove that 
(5) gis a local homeomorphism at each point 0 such that 0, di, -- - , bd, are all 
different. 


This is done similarly as in [1] but somewhat simpler. Namely we will show 
that the Jacobian determinant of ¢ does not vanish at such points. Of course the 
Jacobian matrix (0P;(},) /db;) is given by 


OP, (6,) ~_(n +1) br Il (t — b;)dt (r,s=1,---,%). 


0b, 0 t=1, ips 


(6) 


Suppose to the contrary that det(0P,(0,) /0b,) =0, i.e., that there are constants 
(1, ° °°, ¢, not all 0 such that 


(7 Ye, we 


Ss] 8 


0 forr=1,---,m. 


Clearly, 


So, I (t — by) 


s==1 t=1,is 


is a polynomial which does not vanish for ¢ =); if c;+0 and hence the function 


F(x) = — (n+ 1) "So, Il (¢ — b,)dt 


0 s=l t=1 , is 


is a polynomial of positive degree Sn. But (6) and (7) yield that F vanishes at 
n+1 points 0, di, - - - , bn which is a contradiction. Thus (5) is proved. 

Now let us replace (01, ---, bn) by a variable [do, sey bn| running over 
the complex n-dimensional projective space P¢ (i.e., Do, - ++, bx are complex 
numbers not all 0 and [bo,---, bal denotes the set of all sequences 
(cho, chi, - + +, cbn) with c¥0). And modify @ to a map o*:P¢—-P¢e putting 
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g*[Bo, - ++, Ba] = [bo Po(bs), « - - , Ps@n)]. 


Clearly by (3) and (4) this is a correct definition of a continuous mapping * 
of P¢ into itself and @* becomes an extension of ¢ if we identify (01, - - > , dn) 
with [1, di, ---, dnl. 

The further argument rests on the following topological Lemma. 


(L) Let f be a continuous mapping of a compact Hausdorff space X into a space 
Y and BCX bea nonempty set such that f(B) is open and the set Y—f(X —B) 1s 
connected and dense in Y. Then f maps X onto Y. 


Proof. X being compact its image is closed in Y, hence it will be enough to 
check that f(X) is densein Y. f(B) is open and the boundary of f(B) is contained 
in f(X —B). Therefore the boundary of f(B) does not disconnect Y and hence 
f(B) is dense in Y. 

Now we apply (L) putting f=o*, X=Y=Pt, B={[1, bi, ---, d,]:0, 
bi, +++, d, are all different}. By (5) and since o*(P¢—B)CP¢—B and B is 
open connected and dense in P¢ all the suppositions of (L) are satisfied. Hence 
g™ maps P¢ onto itself. Let A= { [1, bi, -- +, Onit(dy-:, bn) EC}, then 
y*(PE—A)CPG—A. Hence o*(A) =A and ¢(C*) =C". 

REMARK. Some a; in the theorem may be equal but the corresponding ); 
need not be equal. Indeed formula (2) shows that o(1, 2, 3) =(—9, —8, —9). 
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MORE BIRTHDAY SURPRISES 
Morton ABRAMSON, York University, Toronto, and W. O. J. MosEer, McGill University, Montreal 


There does not seem to appear in the literature a solution to the following 
generalization of the well-known birthday problem. Suppose a “year” has n 
days. If k21 is given, and p people are chosen at random, what is the probabil- 
ity B,(n, p) that the birthdays of every pair are at least k days apart? We obtain 
the answer as a special case of a more general result. For other birthday prob- 
lems see [1], [2]. Note that the first and last days of the year are considered to 
be consecutive, so that, for example, with 2=365, December 29 and January 
3 are 5 days apart. Of course 


B,(365, p) = (365)(364) - - - (365 — p + 1)/(365)? 


is the probability that (in an ordinary year) no two of the people have the same 
birthday. 
The birthdays of the p people are described by a p-permutation 


(1) bi, bo, - ++, bp bE {1,2,---, nh, 0: 4d; GZ. 
We seek the number of permutations (1) satisfying 

(2) ks |0;-0;| S17, 1SiAjSp, 

and this is p! times the number of p-choices 

(3) YSn<a<+++<ax,Sn 

satisfying 

(4) RS S11 — %, #4=1,2,°--,p-—1,%,—-—% Sr. 
Corresponding to each p-choice (3) satisfying (4) there is a p-choice (obtained 
by setting y;=x;—(¢@—1)(R—1), 2=1, 2, ---, p) 

(5) LSy<y<-++<ySn—- (b— 1I)k-1) 
satisfying 

(6) Wy Srty— (o—- )NR- 1) 


(obtained from x,»,—x1Sr upon replacing x» by yp+(p—1)(R—1) and x by 
41). We count these in the two cases: (I) y12”—r; (II) 9.<n—r. If (I) holds, (6) 
is redundant and the p-choices are 


N-1TSy<y2<-++ <y%Sn—- (P— 1)(k—- 1); 
there are 
n—(p—1)(R-1)-u+r+1 
(7) ( ) 
p 
of these. If (II) holds, the p-choices satisfy 
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LSy<n-7 NSIS s + <p Srtm— &@-NYER— 1); 

there are 
+ y1—- (p— 1)(k -— 1) - 
(8) (0-1-1) (" yi — (p — 1) ) **) 
p—-1 
of these. Adding (7) and (8) we have the 
BOBS UMA IER OTe ee) 
p p—i 

p-choices (3) satisfying (4). (This proof has been included to make this note 
self-contained; the result is contained in [3; formula (37)’] and [4; formula (19) 
with I’ large and 1=n—r].) Hence, 
pin—(p—1)k-n+7r) ("~ (k— 1)(p - ”) 
nP p p—-1 


is the probability that the birthdays (1) satisfy (2). With r=n—k in (10) we 
have 


(9) 


(10) 


~ 1)! 
Bain, p) = PO 


np 


n— pk—1)—1 
p—-1 ) 


The values of 1—B;(365, ») are listed for some values of » and k. For ex- 
ample, 1—B.(365, p) is the probability that (in an ordinary year) some pair 
have the same or adjacent birthdays. Letting s(k) denote the smallest » for 
which 1—B,(365, p) 24 we see the surprisingly small number of people for 


which, with probability 24, some pair have birthdays less than k days apart. 


k 
1 2 3 4 k | s(k) 
p 

°) 02713 07971 13013 17844 1 23 
8 07433 20873 32604 42812 2 14 
9 09462 26042 39901 51433 3 11 
10 11694 31472 47209 59648 4 9 
11 14114 37056 54328 67210 5 8 
12 16702 42693 61090 73952 6 8 
13 19440 48287 67363 79778 7 7 
14 22310 53749 73053 84663 8 7 
21 44368 83603 95537 98890 9 6 
22 47569 86378 96774 99313 10 6 

23 50729 88791 97709 99586 
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ON A PROBLEM OF J. C. MOORE 
ALEX HELLER, City University of New York 


J. C. Moore has proposed an interesting problem concerning the “con- 
tractibility” of continuous maps. 

If X and Y are topological spaces, a map f:X— Y is contractible if X and Y 
have contracting homotopies §::X—-X and y;: YY (i.e., homotopies of the 
identity with a constant map) such that fé;=7,f for all ¢. Thus a contractible 
map has contractible domain and range. Moore asks whether the converse is 
true. 

The following picture may suggest what is involved: 


A 


C 


Fic. 1 


The map is the projection indicated by the arrow. It is contractible: AB 
and CB need only be shrunk simultaneously to B. 

Moore further suggests that the map f:X—Y indicated by the following 
picture: 


Fic. 2 


is not contractible. 
This suggestion seems plausible: it is hard to see how the homotopy &; 
can deform the image of X away from A and D while B and C are covered. But 
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while A and D are covered so also, by the intermediate value theorem, are B 
and C. However, it seems difficult—and is for all I know impossible—to con- 
struct a proof along these lines. 

The result may, however, be obtained by the following argument. If g: U-->V 


define 
U = {(u, u’)/gu = gu’! CUXU and Z(u, uv’) = gu = gu’ 


If the contracting homotopies @,:U—->U and o;:V->V give a contraction of g 
then 6,(u, u’) = @,u, 0,u’) defines a contracting homotopy of U, and 2 6;=0:8, so 
that # is again contractible. In particular, U is contractible. 


Fic. 3 


Now let us apply this construction to the map f of Figure 2. Since X is an 
interval, X is a subspace of the square X XX. One sees by inspection that it is 
the subspace indicated by the heavy lines in figure 3. But this is clearly not 
contractible, hence f is not contractible. 


THE DYNAMICS OF AVALANCHES 
D. R. ENGiisH, Rhodes University, Grahamstown, South Africa 


(1) To simplify the problem, we assume that the mountainside where an 
avalanche takes place is an inclined plane of slope @ to the horizontal, and that 
the snow is 0 thick. We also assume that snow is incompressible, with constant 


density p. 


(2) An avalanche can happen in four different ways: 
(i) The snow slides down the slope as a rigid body. 

(ii) A hard lump of snow, ice, or rock rolls down the slope without gathering 
any snow. 

(iii) The snow slides down the slope, collecting itself as it moves. The action 
is similar to that of a carpet pushed along one edge, crumpling without rolling. 

(iv) The snow rolls down the slope, starting from the top. The action is 
similar to that of a carpet, wound up from one edge, without sliding. 


(3) The actual behaviour of avalanches, as observed by the writer in India, 
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is a varying combination of the above motions. Also, snow is compressible, 
slopes vary, and air resistance acts. Therefore, an exact mathematical analysis 
of avalanche motion is not feasible. It is of interest, however, to calculate the 
acceleration in the above four cases, with the simplifying assumptions of para- 
graph (1). We shall refer to them in short as: 
(i) Sliding, (ii) Rolling, (iii) Crumpling, (iv) Winding. 

It is also necessary to assume that in each case, where applicable, the action 
takes place over a constant horizontal width a of snow. This, from observation, 
is approximately true. 


(4) In what follows, m is the mass of moving snow; g is the acceleration of 
gravity; 7, for case (ii) and (iv), is the radius of the rolling body; x is the dis- 
tance moved by the body’s center of mass, down the slope, in time ¢; yn, for case 
(i) and (iii), is the coefficient of friction snow/slope; and 0, for case (ii) and (iv), 
is the angle turned through by the body in time ?. 


(5) (i) Shding. The acceleration is well known to be g(sin a—p cos a). 

(ii) Rolling. If the body is a uniform cylinder, axis horizontal, the accelera- 
tion is well known to be 2g sin a. Accelerations for bodies of other shapes, axi- 
symmetrical about a horizontal axis, can be calculated without difficulty (e.g. 
for a sphere it is #g sin a). They vary between the extreme cases: (1) all mass 
concentrated on the axis: g sin a, and (2) hollow cylinder: $¢ sin a. We may say 
that for the average-shaped rock the acceleration is roughly #g sin a. Since we 
have assumed incompressibility of the snow, there is no rolling friction. 


Fic. 1 


(ili) Crumpling (Fig. 1). At time ¢, m=pabx and the force acting down the 
slope is mg(sin a—p cos a)=mgp, say. (If the distance of the centre of mass 
from the slope increases, the normal reaction will be >mg cos a. However the 
effect is slight and will be ignored.) 

So mgp = (d/dt) (momentum), i.e., pabxgp = (d/dt) (pabx%), giving x%-+ 4? —pex 
=0, with solution (obtained by writing #?=g, or by trial with x =si*) to suit 
starting conditions: x =+pgt?, so that 


.. b7. 
# = — (sin a — 4 cos a). 
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Fic. 2 


(iv) Winding (Fig. 2). For large @ we may treat the rolling body as a cylinder 
and r=b6/27, so that m=pab?62/4r. 

About the horizontal line through the instantaneous centre P, the moment 
of inertia is 


3 ; 3pab*94 
2 3273 
and the acting torque about this line is G, where 


pab®g6 sin a 


G = mgrsina = 
Sar? 


But G=(d/dt) |J6], so simplifying, 


4; in d 
{ wg Si = 9 — © (644), 
36 dt 


If { }=v?, we get, dividing by 6, 06+46?—v?=0 with solution (obtained as 
before) to suit starting conditions: 


so that ¢=76, which on substitution reduces to 


po sing iving ¢=— sina 
6. 6 


(6) In the above calculations, p is the coefficient of kinetic friction. As every 
skier knows, this is small, in fact it is of a different order from that of the co- 
efficient of static friction. The effect is due to the Clausius-Clapeyron equation 
as applied to ice under pressure. Indeed, the sport of skiing is made possible by 
this phenomenon. Therefore, as a first approximation, we may ignore pw in our 
results. If we put g’=g sin a, so that g’ is the acceleration of a particle on the 
smooth mountain slope, our results for acceleration can be tabulated as follows: 
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(i) Sliding, (ii) Rolling, (iii) Crumpling, (iv) Winding. 
Accn: g" 48" 38 se 
and the accelerations decrease in the order given. 

(7) The fact that rocks and hard lumps of snow (rolling type) accelerate 
faster than crumpling or winding avalanches is borne out, to a certain extent, 
by observation: such objects appear to go ahead of the main avalanche, al- 
though, in the cloud of fine snow that surrounds an avalanche, it is difficult to 
see precisely what is going on. In any case, anyone watching an avalanche, if 
on a snow slope himself, is not in the right frame of mind to make detached 
scientific observations! 


THE SPECTRAL THEOREM FOR REAL SYMMETRIC MATRICES 


P. C. SHIELDS, Wayne State University 
The purpose of this note is to provide a proof of the 


Lemma. Jf T 1s a symmetric linear operator on a finite dimensional real Hilbert 
space, then T has a real eigenvalue. 


The spectral theorem for real, symmetric matrices is a simple consequence 
of this lemma. One standard method for proving the lemma is to detour through 
the analogous result for hermitian operators. A second method is to show that 
the quadratic form (Tx, Tx) has a nonnegative maximum 2? on the unit sphere 
and then that either \ or —A is an eigenvalue. The proof given here is quite 
different and is a generalization of the following 2X2 proof: 

The characteristic polynomial of 


b 
a=(° ) is A*7—(a+c)\+ ac — 6. 
C 


The existence of a real eigenvalue follows because the discriminant is nonnega- 
tive: 


(a + c)? — 4(ac — 6?) = (a — c)? + 467. 


Proceeding to the proof of the lemma, let » be the minimum polynomial of 
T, and suppose # has an irreducible real quadratic factor, that is, 


P(A) = g(A)hQ), where g(A) = \?+ dA +6 
with 
(1) b? — 4¢ < 0. 


Put M= {x|g(T)x=0} and note that 
(2) M is a nontrivial invariant subspace and g is the minimum poly- 
nomial for the restriction T/M of T to M. 
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Since p doesn’t divide h there must be a vector x such that h(T)x 40. The 
vector h(7)x is then a nonzero vector in M. The invariance of M is trivial. If ¢ 
is the minimum polynomial for T7/M, then g must divide g for g(T)x =0 for 
xCM. The irreducibility of g gives g=g and completes the proof of (2). 

Choose «GM such that 7x+0. We have 


0 = g(T)x = T’x + bTx + cx 
so that 0 = (g(7)x, x) =(Tx, Tx) +0(Tx, x) +c(«, x) and hence 

(| Tx |? + ¢ | ae |? 2= §?(Tx, x)?. 
The Schwarz inequality, combined with (1) and 7x 40, implies 

(| Tx |? + c| #|2)? < b?| Tx |?| «|? < 4c| Tx |?| «2, 
so we arrive at the false statement (| Tx|?—c|x|*)?<0. 
This contradiction establishes that the minimum polynomial p of a symme- 

tric operator JT cannot have irreducible quadratic factors and completes the 


proof of the lemma. This proof is similar to that given in the author’s Linear 
Algebra, Addison-Wesley, 1963. 


Query: The Perron-Frobenius Theorem has a simple 2 X2 proof. Can one use 
something like the above methods to obtain the general result? 


A GENERALIZATION OF NANJUNDIAH’S IDENTITY 
M. T. L. Biztey, London, England 


In 1958 Nanjundiah (this MONTHLY, page 354) published a combinatorial 
identity which, with a change of notation, may be written: 


® EM). .) =): 


the sum being over all integral values of 7 for which the expression summed does 
not vanish. 
The purpose of this note is to state and prove the two identities: 


m  LEGG MEG de) 
© (EQC MIG da). 


It will be seen that Nanjundiah’s identity (1) is a special case of (2), namely 
when d = 0; also of (3), namely when d =c. 
To prove (2) and (3), we begin with the obvious relation: 


(4) (1+ x)™(1 + y)*[1 + a(1 + 9) = (1 + x(t + y)[1 + 2) + aryl. 
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Expanding each square bracket as a binomial in the particular form shown, 
and equating coefficients of x?y?% (for arbitrary p, g) on the two sides of (4), we 
find: 


=()(," J.) - EN sy Morea Me 
©» ECM ME EQ pos a) 


This is true for all positive integers b, m, n, p, g, with the usual convention that 
(;) =0 if w<v or v<0. As it stands, (5) is not interesting because each side is a 
sum. However, by assigning suitable values to p and g, we can reduce one side 
of (5) to a single term, as we now show. 

To derive (2), we put m=a; n=c; p=a—c; q=b—d. Then (5) becomes: 


© (29659-20622) 
(OED -G29CE29 


( —_ ) ( ) 


(62 JEQQ) 


EO) Mov) GMa) 


which is (2). Again, to derive (3), we return to (5) and put m=c; n=a; p=d; 
g=b-+c. Then (5) becomes: | 


» BQ I6E)-2OCT IG 


But since 


( c+i1 )( a ) =( a yOu 
c-dt+b/\c+i/ \o+ce-d a—c—i 


identically, the right side of (7) is 


Hence, 
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(4) Qe) Ga Oe) 
“(Gar a 5): 


=()(, - Moa) - (, bee J a 


which is (3). 
CONVERGENCE OF PICARD’S METHOD FOR |a|>|a1 


Hence, 


J. W. BuRGMEIER, University of Vermont, 
AND M. R. Scott, Sandia Laboratories 


In many textbooks [1, p. 142; 2, p. 263; 3, p. 281] it is stated that the radius 
of convergence of Picard’s method for a nonhomogeneous integral equation is 
equal to the absolute value of the smallest characteristic value, while others [4] 
simply state that the iteration method converges uniformly for |X| <|1| with- 
out mentioning what happens when |A| >|]. (In order to distinguish between 
eigenvalues and reciprocal eigenvalues, we suggest, as have others [5, 6], the 
following: A characteristic value is a scalar \ such that 


6x) =f Kl, 60) 


possesses a nontrivial solution ¢; i.e., \ is the reciprocal of an eigenvalue.) We 
shall prove that, under certain circumstances, the radius of convergence can be 
much larger than | Aa ; 

Let us consider an integral equation of the form 


(1) u(a) = fla) +r f K(e, s)u(oddy, 


where K(x, y) is a symmetric L.-kernel on the rectangle 0Sx, yS1 and f(x) 
CL.(0, 1). We denote the eigenfunctions and characteristic values (counting 
multiplicities) of the homogeneous equation (i.e., when f(x) =0) by ¢2(x) and 
An, respectively. In addition, we let 


(2) ¥(a5d) = f(a) + DE MK F(a), 
n=1 
where K(x) =? K™(x, y)f(y)dy, denote the Neumann series for (1), which 


is generated by the usual Picard successive approximation process with f(x) as 
the initial approximation. 
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THEOREM. Let K(x, y) be such that 
x b;(*) b;(y) 


(3) K(%, 9) = 
j=l Aj 
converges untformly and absolutely on 0Sx, yS1. Let the function f(x) be such that 
(f, 63) =0, 7=1, 2,---, R—1. Then the Neumann series (2) converges for all 
[A] <]Ax|, and 
(a) of rE ha, Ne ty zr}, then W(x; \) ts the unique solution to (1), 


(b) of X=Ajz, 1S7Sk—-1, then P(x; d) ts the unique solution to (1) which ts also 
orthogonal to all of the ergenfunctions corresponding to dj. 


Proof: Since (3) converges absolutely and uniformly, so does 


3 b3(«) b3(y) 


K(x, y) = = 1, 2, eee 
jar MG 
Thus 
5 oe ) (f, 3) 
Kong) = Ef oxotoray = DA ae, 
> j=k 
where we have used the uniform convergence and f1dqu, @2, - - + , dr-1. Hence 
= =. Ff, 45) 
SoeK (2) = > i aarre . ——— $,(2) = > (f, 63)63(*) (= -) 
n=1 n=1 =k n=1 


For |A| <|Az| S|A,|, 722, we can sum the second series to obtain 


(4) Wa5d) — fla) = DKF) = > wee $4(2). 


From the uniform convergence and the condition fLdi, de, - - + , dx—1, it is 
straightforward to show that W(x; A) does satisfy (1). Also for 


NE {Aa Ae + + Ana}, 


the solution is unique [1, p. 116]. 

When A =);, 1 Sj7Sk—1, the conditions fLdi, do, - - - , dx_1 insure that there 
does exist a solution to (1). However, it is not unique. But, by requiring that the 
solution y be orthogonal to all of the eigenfunctions of \;, we do obtain a unique 
solution [1, p. 116]. For 7=1, 2,---,#—1, we have 


(Wy, o) = (f, 6) +a Ee 


j=k Aj — 


Cy oi) = 0. 


Thus the theorem is proved. 
We now mention a few consequences of the above theorem. If K is non- 
degenerate (i.e., possessing an infinite number of eigenvalues) and if the eigen- 
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functions form a complete orthonormal system in L.(0, 1), then there does noi 
exist an f such that (f, @;) =0 for all 7, except f=0. As a consequence, the Picard 
method cannot converge for all \ in this case. 

On the other hand, for degenerate kernels and nondegenerate kernels whose 
eigenfunctions do not form a complete orthonormal system in L£,(0, 1), there 
exists a large class of functions f which are orthogonal to all of the eigenfunctions 
and, for such f, Picard’s method converges for all \. In fact, the convergence is 
trivial, since any function which is orthogonal to all of the eigenfunctions of a 
symmetric kernel is also orthogonal to the kernel itself [4, p. 106]. Thus y=f. 

A portion of this work was supported by the U. S. Atomic Energy Commis- 
sion. 
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RESEARCH PROBLEMS 


EDITED By V1cTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (4f any are known to the author) and by a brief description of known 
partial results. Manuscripts should be sent to Victor Klee, Department of Mathematics, 
University of Washington, Seattle, WA 98105. 


CAN EVERY TRIANGLE BE DIVIDED INTO n TRIANGLES SIMILAR TO IT? 


R. W. FREESE, ANN K. MILLER, Saint Louis University, and 
ZALMAN USISKIN, University of Chicago 


DEFINITION. Given any triangle ABC and natural number », then m is an 
acceptable number for triangle ABC if there exists a decomposition of the tri- 
angle into ” subtriangles, with each subtriangle similar to triangle ABC. Thus, 
for every triangle, 1 is an acceptable number. Further, given any triangle ABC 
with M the midpoint of side AB, N the midpoint of side BC, and P the midpoint 
of AC, then the subdivision of ABC into subtriangles AMP, BMN, CNP, and 
MNP yields four triangles each similar to the original one. Thus, for every 
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A 


Fic. 1 


triangle, 1, 4, 7, --- are acceptable numbers. Again, let ABC be any triangle 
with base BC. On side AB mark off a point M such that the length of the seg- 
ment MB is one-third of the length of side 4B. Construct the line through 
parallel to side BC and let N be the point on side AC cut by the parallel line. 
Next construct the following sequence of points: let P; be the point on side BC 
cut by the line through M parallel to side AC, let P. be the point of intersection 
of MN and the line through P, parallel to AB, etc. Then it can be shown that 
P, coincides with N and that each resulting subtriangle is similar to the original 
one. Therefore, 6, 9, 12, - - - are acceptable numbers for every triangle. In an 
analogous fashion, if the point M is chosen on side AB such that the length of 
MB is one-fourth of the length of AB and if the points P; to Ps are constructed 
in a manner similar to the preceding method, then it can be shown that 8, and 
hence 11, 14, 17, -- + are acceptable for every triangle. Thus, we have shown 
the following: 


THEOREM. For any natural number n, n¥2, 3 or 5, there exists a decomposition 
of any triangle into n subtriangles similar to 1t. 


A 


Fic. 2 


Consider now, the special case of a right triangle. Let ABC be any such 
triangle, with the angle at A the right angle. If AM is the altitude to base BC, 
then the decomposition of triangle A BC into subtriangles ABM and ACM yields 
two triangles similar to the original triangle. Therefore, for right triangles, every 
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natural number is acceptable. On the other hand, if a triangle has 2 as an ac- 
ceptable number, then since the decomposition must be accomplished by joining 
a vertex to an opposite side, an inspection of the cases reveals that the triangle 
must be a right triangle. Consider next the case n»=3. We first note that there 
are only 4 essentially different ways of dividing a triangle into 3 triangles. (The 
first three are the only ways if one side of a smaller triangle joins a vertex to the 
opposite side; the latter is the only other possibility.) They are drawn here. 


(a) (b) (c) (d) 


In cases (a), (b), and (c), mZ1+mZ2=180°. If the smaller triangles are 
to be similar to A ABC, then angles of A ABC must be congruent to angles 1 
and 2. If angles 1 and 2 are congruent to different angles of A ABC, then two 
angles of the A ABC add to 180—impossible. So angles 1 and 2 must be con- 
gruent to the same angle of A ABC, so 21& 22, in which case they are right 
angles and A ABC is a right triangle. 

In case (d), suppose angle C (without loss of generality) is the largest angle 
of A ABC. It is well known that 23 > ZC. Hence the triangle with Z3 in it 
could not be similar to the original triangle. 

Hence, only a right triangle can be divided into three triangles, all similar 
to each other and to it. 

These preliminaries lead naturally into the following questions: Does the 
set of acceptable numbers of a triangle determine its shape; e.g. if a triangle has 
5 as acceptable, must it be a right triangle? Conversely, given the shape of a 
triangle, what is the set of acceptable numbers for it? 


A PROBLEM OF ILLUMINATION 
L, Feyes Téta, Hungarian Academy of Sciences 


Along an infinite straight road we want to erect equal lamps with a given 
average number per mile. How should the lamps be distributed so as to maximize 
the illumination of the road at the worst illuminated point? 

More precisely, let f(x) be the illumination function defined for x20 and 
giving the illumination of the road due to one lamp at a distance x from its foot. 
The function f(x) is supposed to be continuous, nonnegative, nonincreasing and 
such that 


J 1002s < 0, 
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If +++ <X29<K 1 <x9<K1<X%2_ °° + are the coordinates of the lamps then the 
(upper) density D of the lamps is defined by 

D = lim Ne/2R, 

R- © 

where Wp is the number of the lamps in the interval (— R, R). If D is finite then 
so is the illumination of the road I(x) = Dif (| x —x;|) at any point x. We want 
to find 

M(f, D) = sup inf, I(x) 


for all possible arrangements of the lamps with the prescribed density D. 

This problem was raised by the author at the Colloquium on Discrete 
Geometry in Oberwolfach in 1962. L. Danzer soon observed that it is not always 
the point-lattice (equidistant distribution) which gives the solution of the prob- 
lem. The following example is due to A. Heppes [Egy egydimenzios probléma, 
Mat. Lapok, 14 (1963) 124-127]: Let f(x) be defined by 


1 — dz, 0Oxx“<1 
f(") = 43 — %, lSn<3 
0 <= x, 


“ee 


If the lamps are equally spaced with D=1 then min, /(x) =3/2. On the other 
hand, placing the lamps at the points +4, +2+4, +4+4,--- we obtain a 
constant illumination I(x) = ~>8. 

Since the determination of M(f, D) for any f and D, by a formula or any 
other sort of general instruction, seems to be extremely difficult, we give our 
problem the following definitive form: Prove or disprove the conjecture that for 
any f and D, M(f, D) can be attained by a distribution consisting of the union of 
a finite number of congruent point-lattices. 


CLASSROOM NOTES 


EDITED BY DAvip DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


A NEW PROOF OF THE EXISTENCE OF THE FREE GROUP 
N. C. MeveEr, Jr., The University of Oregon 


The following is a new and simpler proof of the existence of a free group on 
an arbitrary set. 


DEFINITION. Let A be a set. A free group on A is a group F and a function 
f: AF, such that if G is any group and g: A->G a function, then there 1s a unique 
homomorphism h: FG such that g=hf. 
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Intuitively, F consists of the elements of A, formal symbols a—! wherea€A, 
and all finite chains (“words”) of a’s and a—’s. One defines multiplication to be 
juxtaposition (defined below), then one “reduces” words by cancelling out any 
occurrence of aa—! or aa. The usual proofs that Fis a group (especially showing 
associativity) involve a lengthy case by case argument. The unreduced words 
form a semi-group (a set with an associative operation) upon which we wish to 
define an equivalence relation, reduction; in the present proof, we avoid the 
consideration of cases by defining the equivalence relation in a new way. First 
we need two lemmas concerning semigroups. 


LeMMA 1. Let A be a nonempty set; let X be the set of all finite chains 
(ay dg +--+ +G@n), where each a;GA; define an operation on X by: (a1: + Qn) 
(by > + + Dm) =(G1 ++ * Gnbi + + + bm). This operation makes X a semigroup (called 
the free semigroup) ; definej: AX byj(a) = (a), the chain of length one. If Y is any 
semigroup and k:A—Y a function, then there 1s a unique homomorphism f:X—Y 
such that k =f}. 


Proof. Given any a,’s, 5;’s, and ¢;’s in A, 
[(ay eee An) (b1 eee bm) |(c1 oe Cp) = (ay eee 0,01 oe Bm) (1 eee Cy) 
= (d1 °° + Gandy > > + Dmoy °° + Cp) 
=(a--- an) [(b1 ++ bn) (ey, -° - Cp) |. 
So X is a semigroup. If k: A-Y, define f: XY byf((ai - + - Gn)) =R(Qi) + + + R(Gn). 
Then f is a homomorphism since 
f((a1 + + + Gn)(b1 + + + bm)) = [R(a1) - - + ][R(b1) - - - J, 
and f7=k. If gj=k also, then g(ai- - - dn) =g((a1) > + + (@n)) =R(ai) + - + R(Gn) 
=f(a1 + + * Qn), sof is unique. 


Lemma 2. If Y 1s a semigroup and * 1s an equivalence relation on Y such that 
xey and sew imply xexyw, then Y/* 1s a semigroup, and p: Y—Y/« ts a homo- 
morphism. If h: YZ is a semigroup homomorphism such that x*xy implies h(x) 
=h(y), then there is a unique homomorphism h: Y/*—Z such that hp =h. 


Proof. Let £=p(x) =the equivalence class of x. Define #j=xy. If x*y and 
zew then xZ=ywW since xz*yw, so this operation is well defined. Next (#9)2 
=*Xyz =xy2 = £(72), so Y/* is a semigroup. Now p(xy) =xy=£) = p(x) p(y), so p 
is a homomorphism. If 4: YZ such that x*y implies h(x) =h(y), define h by 
h(#) =h(x). h is well defined since #=¥7 implies x*y, which implies h(x) =h(y). 

h(#9) = h(xy) = h(xy) = h(x)h(y) = h(&)A(5) 
so his a homomorphism. Finally if f: Y/+--Z with fp =h, then 
f(®) = f(e(™)) = h(x) = h(@), sof =h. 


THEOREM. If A is any nonempty set, there 1s a free group on A. 
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Proof. Let A’= {a'|a in A}, where we assume A(\A’=@ (to be strictly 
correct replace A by {(a, 1)|a in A}, and A’ by {(a,—1)|a@ in A}). Let X be 
the free semigroup on the set ALA’ as defined in Lemma 1, with 7: AUA’—X. 
We write j(a) =a and j(a’) =a’. 

Define the relation * on X by: x * y iff g(x) =g(y) for all semigroup homo- 
morphisms g:X—G satisfying: 


(#) G is a group, and g(a’) = g(a)? for all ain A. 
* is an equivalence relation since for all g satisfying #, and all x, y, gin X, 
g(x) = g(x); 
g(*) = g(y) implies g(y) = g(x); 
g(x) = g(y) and g(y) = g(z) imply g(x) = g(z). 

If xxy and g*w, then for all g satisfying #, g(xz) =g(x)g(z) =g(y)g(w) =2(yw), 
so xzxyw. Thus, by Lemma 2, X/* is a semigroup and p:X—X/* is a homo- 
morphism. We shall show that X/* is a group and that bj s4:A—X /* is a free 
group on A. Pick a in A, let x be in X, then for any g satisfying #, g(aa’) =1, so 


g(xaa’) = g(x). So x * xaa’ or €=#(aa’), and similarly #= (aa’)#. Therefore aa’ is 
the identity of X/*. Now if x isin X, x=(01 - - - bn), where each 0; isin AUA’. 


For ain A, denotea” =a; lety=(0, -- - b/). For g satisfying #, 
g(xy) = g(di) -- + g(bn)g(bn) > + + g(bi) = g(b1) «> + g(bn)g(On) + + + g(b1)~ 
= g(x)g(x)“* = 1 = g(aa’) 


so xy * aa’, i.e., £9 =1in X/*. Similarly, j£=1, and so X/* is a group. Note that 
p(a’) = pla). 

Now let k: AG, where G is a group; extend k to k’: AUA’—G by k’(a’) 
=k(a)— forain A. By Lemma 1, there is a unique homomorphism h: XG such 
that hj =k’. Note that 


h(a’) = hG(a')) = k’(a’) = R’(a) = h(a) 


and so h satisfies #, and therefore x*y implies h(x) =h(y). By Lemma 2 there is 
a unique homomorphism h: X /*—>G so that hp =h (since X/* and G are groups, 


Xx /* 
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h is a group homomorphism). Thus h(p7) =hj =k’ and h(pj| A) =k. Furthermore 
h is unique since if f(pi| 4) =k, then (fp)7=k’ since 


fPj(a') = fO@™) = (fei(@) = ka)! = F(a’); 


and so by Lemma 1, fp=h, and by Lemma 2, f=h. Therefore, pj| 4:4 —-X/* is 
a free group on the set A. 


REMARK: The free product (coproduct) of a set {Ga} of groups can also be 
obtained in this way. Let 7: U.G.—-X be the free semigroup on U. Ge and 
ja =j| g,:Ga?X. Define x *y in X iff h(x) =h(y) for all homomorphisms h:X 
—H, where H is a group and hj: Ga—H is a homomorphism for all a. As above, 
one checks that X/* is a group, that pj2:Ga—X /* is a monomorphism, and that 
(X /x, | bia 3) satisfies the appropriate universal diagram. 
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IDEMPOTENCE IN CARDINAL ARITHMETIC 
R. D, KopreRMAN, City College of New York 


We present a proof of idempotence in cardinal multiplication which is 
shorter than existing proofs but uses more properties of ordinal arithmetic. 

Suppose ordinals have been defined as well as cardinals (=initial ordinals) 
and we have defined <, S, =for ordinals, ~ (equipollence), <, and <. We also 
assume it known that for any two ordinals, a$$ or 8<a, and aS if and only 
if «CP (from which we obtain the fact that for any two ordinals, a<f or B<a), 
and we use the Schréder-Bernstein Theorem. 

Let X bea well-ordered set, Y. well-ordered for each xX. Then by Srex Y; 
we mean UsexY.X {x}, well-ordered by (y, x)<(y’, x’) iff x<x’ (in the given 
well-ordering of X) or x =x’ and y<y’ (in the given well-ordering of Y;). It is 
well known and easy to check that the above actually defines a well-ordering. If 
a is an ordinal and for y€a, 8, is an ordinal, then by > y<a3, we mean the 
ordinal order-isomorphic to S,éa8,. If w and B are ordinals, then a B= ) y<cBy 
where 8,=8 for each y <a. 

From the above definition it is clear that 


D, By © U By X {yr} 


<a <a 
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and therefore aB~a XP, a+a=a X2. Also clear is the fact that if @ is a limit 
ordinal, 


>, By = U > Bs. 


<a y¥<a b<y 
If A is well-ordered and {C, 
UCax U Gx {a} 


aGA aGA 


aca i a collection of sets, then 


by the map f(y) = (4, 0) for 6 the first element of A such that yCC;. Finally note 
that in the proof to follow we do not assume it known that all infinite cardinals 
are limit ordinals; if this fact were assumed, the second paragraph of the proof 
might be eliminated. 

THEOREM. If ats an infinite ordinal then a =a. 


Proof: We have axaX1<aXa~a? so if our theorem is false there is an 
infinite ordinal, thus a least infinite ordinal a such that a<a?. Our a must bea 
cardinal, for if B<a, Ba, then 


atz= pz Bra. 
This @ cannot be a successor, for if 8-+1=a, then B<a,so: 
BxBtIixB+Br=BX2KBXBTHP HB 


thus a ~B, contradicting the fact that a is a cardinal. 
However if a is a cardinal and a limit ordinal, then 


aXa=UBXB<KUBXBX {8} ~ eta U ( Xe"). 
B<a B<a B<a y<a \ B<y 
Thus sincea~<a?, a-< > pcalS?, soa< ><a’, 1.e., for somey <a@ 
a = » B?, 


B<y 


so a€y’*, thus axy* for some y <a. However, this is clearly impossible since 
if y <w, y?><wx<a, otherwise 


V=)Y = 17 = 7 <a. 
This is a contradiction to a<a? and establishes our theorem. 


COROLLARY. (a) For every infinite ordinala,aza-+a. 
(b) For every infinite ordinal a, a~a+l1. Thus each infinite cardinal is a limit 
ordinal. 


Proof. lf « is an infinite ordinal, 
axatixataryaXlxaxXarea’a, 
soavzatizata. 


I wish to acknowledge the help of Professor M. Zuckerman and an unknown referee. 
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should be sent to E. P. Starke, 1000 Kensington Ave., Plainfield NJ 07060. Proposers of 
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ELEMENTARY PROBLEMS 


Solutions of Elementary Problems should be sent to Problems Group, Mathematics Depart- 
ment, University of Maine, Orono, ME 04473. To facilitate their consideration, solutions of 
Elementary Problems in this issue should be typed (with double spacing) and should be mailed 
before January 31, 1971. Contributors (in the United States) who desire acknowledgment of 
receipt of their solutions are asked to enclose self-addressed stamped postcards. 


E 2253. Proposed by L. A. Gehamt1, Glastonbury, Connecticut 


Find the number S, of distinct n-tuples (aq, a2,..., @n) such that a; is a 
nonnegative integer,iz=1,2,..., N; > tat Qi=N3 dint ajzk,k=1,2,...,n. 


E 2254. Proposed by Marvin Marcus, University of California at Santa Bar- 
bara 


Let H be an n Xn hermitian matrix and let | H| be the matrix obtained from 
H by replacing each entry by its absolute value. Show that if H20 (i.e, A is 
positive semi-definite) and n <3, then | H| 20. Show that for each n2=4 there 
exists an H20 such that | H| is indefinite. 


E2255. Proposed by T. E. Mott, State University College, Buffalo, N. Y. 


Let f(x1, - + +, Xn) be a real valued function defined on an open set GC R” 
and let v;=(Ac1, °° +, Aen), t=1,°---, 2, be linearly independent vectors in 
R*. If the function f is continuous along that portion of every line passing 
through G and parallel to v;,7=1,---,m, and f is monotonic along each of 
these lines (the direction of monotonicity depending upon the choice of line), 
then f(x1, - > +, X,) is continuous in G. 


E2256. Proposed by H. Kestelman, University College, London, England 


Determine a real function f on R* so that |f(x)| S|lx|| for all « where 
||| = (2+ - - » +22)1/2 and ||grad f(0)|| =21/2. 


882 
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E 2257. Proposed by I. Kauck4, Bratislava, Czechoslovakia 
Prove the identity 


2n—1 __ —l &k& n 
cay ") > i _ 2n 2 I . 
k j 


k=1 


E 2258. Proposed by Arthur Marshall, Madison, Wisconsin 


For each natural number k, let Ni; be the kth number in the sequence which 
consists only of primes and products of consecutive primes, taken in natural 
order. Let the Mébius function be defined as usual: w(1)=1, and for n>1, 
u(n) =(—1)", where r is the number of primes dividing n. Prove 


< uN) 
» Ne 


— Cc . 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Route Determinism 


E 1980 [1967, 438]. Proposed by R. E. Chandler, Duke University 


The street plan for a certain town is a generalization of the tic-tac-toe board 
with z vertical streets and m horizontal streets (with no vertical street intersect- 
ing the end of a horizontal street and vice versa). An automobile will start at 
the end of one (unspecified) street and travel to the end of another. What is the 
minimum number of traffic counters (each to be placed in the middle of a block) 
necessary to determine the route of the automobile? The route is to have no 
cycles, and the counters record the time that the automobile passes so that the 
direction of the route is determined when two counters have been passed. 


Solution by H. Shank, Cornell University. More generally, consider a con- 
nected graph G of v vertices and e edges to which k22 additional “end edges” 
are attached, making a new graph G*. To avoid special cases, we assume (a) G 
is inseparable (i.e., removing an edge from G never disconnects G) and (b) the 
end edges are attached at different vertices of G. (These conditions are satisfied 
in the problem.) 

Now identify the & non-G end vertices with a single vertex no. This changes 
G* into a new graph G>. 

We seek a minimal set S of branches such that (i) each loop (cycle) in Gy 
contains an edge of .S, and (ii) each loop in G> passing through no contains at 
least two edges of S. (The edges in S contain the traffic counters; condition (ii) 
means the counters always determine direction of a trip across town.) 

Let T be the complement of S, 1.e., the set of branches of Gs not in S. By 
(i), J contains no loop. By standard graph theory, T is a forest of trees, so 


| T| < | vertices(Go) | —~1=(@+1)-1=»2. 
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Hence 
| S| = | edges(G.)| —v = (e +h) —v=e—v+h. 


Suppose LS | =e—v-+k. Then T is a maximal tree of G,, hence there exists a 
loop through no containing exactly one edge not in T. This contradicts (ii), so 
we conclude that 


|S] S(e—vt+1) +h. 
But there exists a set S which satisfies (1) and (11) and has | S | =(e—v+1)-+k, 


namely, the set S consisting of the complement of any maximal tree in G and 
the k end edges. Hence 


min | S| = (e—v+1)+&. 
In the original problem, v=mn, e=2mn—m—n, and k=2m+2n, so 


min | S| =m +m+n+1= (m+ 1a 1). 


Also solved by E. F. Schmeichel. 
A more detailed solution will appear in Robert Kopp, Systems of roads with counters, this 
MONTHLY, probably December 1970. 


The Greatest Integer as a Double Sum 
E 2198 [1969, 1063]. Proposed by Michael Aissen, Fordham University 
Ifr>1 isan integer and x is real, define 


joy = DEF —*|, 


k=0 j=1 


where the brackets denote the greatest integer function. Show that 


_ glx] if + 2 0 
=e aa) ite co. 


Solution by Howard Thoyre and George Kung, Wisconsin State University at 
Stevens Point. It is an easy exercise to show that if aw is a real number and k isa 
positive integer, 


x E + —| = [hal 


t=0 


j= EE et |= Elle Leal)) 


Letting S, represent the nth partial sum, we have S,= [x]—[x/r*|. Since 
r>1, there exists a positive integer N such that for all n>WN, |x/rm*!| <1. 
Therefore, for fixed x and n> JN, the sequence of partial sums is constant. For 
x20 and n>WN, the sequence is [x] and for x<0, the sequence is [x]+1 
= [x+1]. 


Hence, 
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Also solved by M. T. Bird, D. M. Bloom, L. Carlitz, G. C. Dodds, D. F. Eliezer, M. G. Green- 
ing (Australia), Doug Hanto, Robert Heller, Dean Hickerson, Stephen Hoffman, James Inglis, 
Kenneth Klinger, R. S. R. Rao (India), Simeon Reich (Israel), Henry Ricardo, Bart Rice, Steve 
Rohde, E. F. Schmeichel, David Spear, T. Tamura (Japan), E. W. Trost (Switzerland), E. R. Van 
Eschen, Charles Wexler, and the proposer. 


Integral Values for a Factorial Quotient 
E 2199 [1969, 1063]. Proposed by F. J. Papp, University of Delaware 
For given v, determine all r and s such that 
r+i)m+s)! 
(s+i)m+yr)! 


is integral. 


Solution by E. P. Del Norte, University of Texas at El Paso. For n>1 we have 


r+i1 
>. <7, 
(+1):-~+st+)m+s+2°---"t+n > - 
R= 1 S = T, 
(y+ti)---s:(s+2)---(s+n) 
eI Y<S. 


(r+ 2) +--+ (r+ n) 


Thus R is integral fors=rand s2=r-+n. Fors<r, Ris not integral since0O<R<1. 
Now letr<s<r+nand set s=r-+k: 


R= rt D¢tatDrtat2)---etatk)- 

| r+tkt+i1 

But r+k-+1 is a divisor of (r+1)(7+n+1) ... (7+n+k) which equals 
Ytkt+t1i-ksrtkt+it(m—k))---~7+tkt+14+ (4—1)) 


if and only if r+k-+1 divides k(n—k)(n—k+1) ... (n—1). It follows that for 
r<s<r-+n the pairs (7, s) for which R is integral are (d—k—1, d—1) where 
0<k<n,and ddividesk(n—k)... (n—1)andd>k-+1. 


For n=1, R=s!/r! is integral whenever s2r, and not integral for s<r. 
Forn=0,R=(r+1)s!/(st+I1)r!. lfis<r, 
r+i1 
R = —__________ 
(st1)-G+t1)(s+2)---4r 

which is not integral since (7, r+1)=1. R=1 for s=r. Now take s>r: R 
=(r+1)2(r+2) --- s/(s+1). Choose k>0 and let s=r-+k. Proceeding as before 
we have that r-+k-+1 divides 

(7+ 1)?%(r + 2) +--+ (r+ &) 

=(rtk+i-krtket+it(i—&))---~7@tket+1—1) 
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if and only if r+-+1 divides k?(k—1)!, and hence R is integral for (r, s) 
= (d—k—1,d—1), where d| k?(k—1)!andd>k-+1. 


Also solved by E. F. Schmeichel, and by Charles Wexler. Partial solution by Michael Gold- 
berg. 


Counting Unequal Partitions 
E 2200 [1969, 1063]. Proposed by J. M. Moser, Navy Electronics Laboratory, 
San Diego, Cal. 


Find the number of partitions of an integer m into m unequal parts such that 
each part must be not larger than 2m nor smaller than g, where 2SqSm-+1. 


Solution by E. F. Schmetchel, The College of Wooster. Set g’=q—1 and 
k=m—q'=0. Put B=("3'). Unless 


Dd (¢ +i) =m’ + Bsns m'+B= di (m+ 9), 
i=] t=] 


there is no representation for m in the desired form. Otherwise, let 1 =mgq’+ 
B-+a, where 0SasSkm. It follows easily that the number of representations 


of mas asum of m unequal parts chosen from q’+1,q’+2,---,q/+t(m+k) =2m 
will be precisely the same as the number of representations of B+a as a sum 
of m unequal integers chosen from 1, 2,...,m-+k. But . 


(1 — ah tt) (1 — ght 2) ce (1 — gktm) 


Pinin(%, m) = «B (1—x)(1— 9) +--+ (1 — x) 


enumerates the partitions of ~ into m unequal parts, none of which exceed 
m-+k. (See John Riordan, Introduction to Combinatorial Analysts, 1958, p. 153, 
Exercise 7.) So 


Fimtk(%, 1) _a- ott) (1 — ght?) .. (1 — abt) 7 . > 


where P,. denotes the number of partitions of n=mq’+B+a into m unequal 
parts ranging between g and 2m. 


Also solved by Michael Goldberg, C. T. Haskell, and W. F. Rogers. Letting p’(n, m; q, k) 
denote the number of partitions of m into m unequal parts, each part 2g and Sk, and letting 
v(m, m; r) denote the number of partitions of ~ into at most m parts, each part Sr. Haskell first 
proves that p’(n, m; q, k) =v(n —mq —m(m—1)/2,m;k—m—q-+1). 


Steiner-Artzt Coincidence 


E 2201 [1969, 1063]. Proposed by J. M. Quoniam, Saint-Etienne, France 


Given a triangle, find (with compass and straightedge) the points in which 
the inscribed Steiner ellipse of the triangle intersects the three Artzt parabolas of 
the triangle, and also construct the tangents to the curves at these points. (The 
inscribed Steiner ellipse of a triangle is the ellipse inscribed in the triangle and 
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having the centroid of the triangle for its center; an Artzt parabola is the para- 
bola tangent to two sides of the triangle at the endpoints of the third side.) 


I. Solution by A. W. Walker, Toronto, Canada. The two points where the 
Artzt (A) parabola meets the inscribed Steiner ellipse are distant (BC)/(2+/3) 
from the centroid G of the triangle ABC on a line through G parallel to BC; the 
tangents to the parabola at these points meet at the midpoint of AG, and the 
tangents to the ellipse are parallel to 4G. These results are readily verified when 
ABC is equilateral, the parabola then having G as focus and the right bisector of 
AG as directrix, and follow generally by affine projection. 


II. Solution by Michael Goldberg, Washington, D.C. By orthogonal projec- 
tion, project the given triangle A BC into an equilateral triangle. If the lengths of 
the sides of the given triangle are a, b, c, where aSb Sc, then the length of the 
side of the equilateral triangle is e, where e satisfies the equation 


Je—e@=/e—b+ Ve? — C2, 


or 3e*—2(a?+0?-+c?)e?+16K?2=0, where K is the area of the given triangle. 
Since e is obtainable from a, b, c by the taking of square roots, it is constructible 
by Euclidean methods. 

For the equilateral triangle, the Steiner ellipse is the inscribed circle. It is 
easily verified that an Artzt parabola intersects this circle at the ends of a di- 
ameter that is parallel to a side. The tangent to the circle at such a point is 
perpendicular to a side. The tangent to the parabola makes an angle of 45° with 
a side. 
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If the equilateral triangle is projected back into the given triangle then the 
circle is projected into the Steiner ellipse of the given triangle, and the Artzt 
parabolas are projected into the Artzt parabolas of the given triangle. The tan- 
gents are projected into the desired tangents in the given triangle. The deter- 
mination of all the desired points and lines requires only Euclidean construc- 
tions. 


Also solved by James Buddenhagen, Charles Chouteau, Jordi Dou (Spain), M. G. Greening 
(Australia), and the proposer. 


The Transition Span of a Set of Ordered Pairs of Reals 
E 2202 [1969, 1137]. Proposed by Necdet Ucoluk, Clarion (Pa.) State College 


Since a binary relation on a set A is a subset of A XA, every subset of the 
Euclidean plane £3 is a binary relation on the reals. Define the transitive span of 
a relation p to be the smallest transitive relation containing p. (a) Determine the 
transitive span of the relation p, consisting of all points interior to or on the 
triangle with vertices (3, 5), (2, 7) and (6, 0). (b) What if the region is the set 
bounded by the ellipse (x —1)?-++-4y? =16? 


Solution (a) by Grattan Murphy, University of Maine. Let A be a set and § 
a relation in A. For any sets P, QCAXA, let PoQ={(x, y):(x, a)€P and 
(a, y)€Q for some aC A i Then define S'=S and S"=S*"—!0 S for each n>1. 
It is easy to see that (x, y) ©S” if and only if there exist a1, - + + , @n_1@A such 
that (x, a1), (@1, de), °° +, (Gao, y) ES, from which S? o S7=S?+4, Now any 
transitive relation containing S must contain S” for every 2 and, therefore, also 
US". But US" is transitive, for if (x, y) and (y, z) GUS", then there exist p, q¢ 
such that (x, y) ES? and (y, z) € S%, whence (x, z) ES? o S¢¥= S?+2C US". There- 
fore U.S" is the transitive span of S. 

Let S be a closed convex set in £? such that the lines y =a and x =a intersect 
Sin intervals bounded by the points (%1, a), (xe, a), (a, y1) and (a, ye) with x1<%2, 
yi<yo. Then the rectangular region { (x, yy) | x1 Sx Sx, wmSys yo } isin SoS. 
Let the boundary at (%1, a) and (a, 1) satisfy x =fi(y) and x =f2(¥y), respectively. 
Then fe(y1) =a and x1 =f1(a) =fi(fe(y1)). The lower left vertex of the rectangle is 
(f1 © fo(1), 1). The boundary points of S o S must be vertices of a rectangle of this 
form. Similarly the boundary points of S” are of the form (f10 fro - - - ofn(y), y) 
where x =f:(y) are appropriately chosen boundary curves of S. 

In the case where S is the triangle with vertices (2, 7), (3, 5), and (6, 0), the 
boundary curves are f(y) =x=6—4y/7, g(y)=x=(11—y)/2, and h(y) =x= 
6—3y/5.Theright and left hand boundaries of S?” arex = (fh)"(y) ,« = (fh)"-Yfg(y), 
x=(hf)""gf(y) and x=(hf)"(y), while those of St! are x=f(hf)""2f(y), 
x =f(hf)"(y), x =(hf)"*h(y) and x=(hf)"g(y) for n=1, 2,---; the upper and 
lower boundaries being y=7 and y=0 in each case. The slopes of the boundary 
lines of S?" are all positive and tend to © as n—>© with the points (90/23, 90/23) 
and (84/23, 84/23) being fixed points for right and left boundaries, respectively. 
Likewise the slopes of the boundaries of S?"t! tend to — © and contain the points 
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(90/23, 84/23) and (84/23, 90/23). The points on the open intervals on the 
lines x = 84/23 and x =90/23 between these points are not contained in any S”. 


Solution (b) by E. F. Schmeichel, College of Wooster, Ohio. Let E denote the 
ellipse and its interior. Through the point (1, 1) CE draw horizontal and vertical 
lines intersecting E in segments PP’ and QQ’, respectively, and let ABCD be 
the rectangle having Q and Q’ on its horizontal sides and P and P’ on its vertical 
sides. Note that rectangle ABCD must be in the transitive span of E. Repeat the 
above process with the point (0, 0)€£ and the set ABCDUE to obtain rec- 
tangle A’B’C’D’, a transitive set and the desired transitive span of E. (Here 
A’=(—3, 2), B’=(5, 2), C’=(5, —2), and D’=(—3, —2).) 


Also solved (part (b)) by J. P. Comiskey. 


Consequences of a Lucas Congruence 


E 2205 [1969, 1138]. Proposed by S. M. Farber, D. W. Walkup, and R. J. B. 
Wets, Boeing Sctentific Research Laboratories 


Suppose nonnegative integers m and n are given in their representations to 
a prime base f, 1.e. 


k 
m = (fz ° ° “Tio)p = Drip §, 0OSri <p, 
t=0 


k 


n = (Sk+ + S1S0)p = DY, Sibi, 055; <p. 


+=0 


Find a simple expression for the binomial coefficient C(m, n) =(;) mod p. In 
particular, find necessary and sufficient conditions for 


(a) @ =(0 mod (b) (°” ; ') odd. 


Solution by Robert Fray, Florida State University. A well-known result due 
to E. Lucas says that C(m, n) =C(ro, 50) C(r1, 51) C(re, S52) - + + C(re, Sz) (mod p). 
See L. E. Dickson, History of the Theory of Numbers, 1919, vol. 1, p. 271; also 
N. J. Fine, Binomial coefficients modulo a prime, this MONTHLY, 54(1947), pp. 
589-592. 

(a) Since OS7;, 5:<p, the binomial coefficient C(r;, s;) is divisible by p if 
and only if s;>r; Hence C(m, n) =0 (mod p) if and only if s;>r; for some 2, 
where 0S51Sk. 

(b) If wechoose p =2, and let m—1 = >*_, 72%, where; is 0 or 1 for 0OSi<k, 
and r,=1, then 2m—1=1+ > **) 7,_124. It follows that 


C(2m — 1, m — 1) = C(A, 70) C (ro, 71) C (11, 72) - - + C(re-1, 7x) C (rx, 9) (mod 2). 


se Sry yer, = 1, 


Hence C(2m—1, m—1) is odd if and only if l2n2n2n2 


890 ADVANCED PROBLEMS AND SOLUTIONS [October 


Le. m—1=14242?+423+4 .-- 4+2*=2*t!1—1. Since C(1, 0) is odd, it follows 
that C(2m—1, m—1) is odd if and only if m is a nonnegative power of 2. 

The Lucas congruence has been extended to the g-binomial coefficient. The 
result in part (a) of this problem has been proved for the multinomial coefficient, 
the g-binomial coefficient, and the g-multinomial coefficient. (See R. D. Fray, 
Congruence properties of ordinary and q-binomial coefficients, Duke Math. J., 
vol. 34(1967), pp. 467-480.) 


Also solved by L. Carlitz, N. J. Fine, Michael Goldberg, Wells Johnson, Douglas Lind, 
D.C.B. Marsh, Simeon Reich (Israel), E. F. Schmeichel, Michael Shimshoni (Israel), Ira Ster- 
bakov, T. Tamura (Japan), Charles Wexler, and the proposers. 


ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State University, 
New Brunswick, N.J. 08903. Solutions of Advanced Problems in this issue should be typed 
(with double spacing) on separate, signed sheets and should be mailed before January 31, 1971. 
Contributors (in the United States) who desire acknowledgment of receipt of their solutions are 
asked to enclose self-addressed stamped postcards. 


An asterisk (*) means neither the proposer nor the editors supplied a solution. 


5753.* Proposed by V. A. McAuley, Huntsville, Alabama 
Show that for all integers n> 2, 


__ | n(22n — 19) 1/2 
* Lin — 2)(22n — ml 


is an approximation to the positive real root of the equation 
(1 — 2)x" — nu? — 2 = 0,7 
with at most an error in the third decimal place. 


5754. Proposed by M. S. Klamkin, Ford Scientific Laboratory 


If L(a, b) denotes the perimeter of an ellipse with semi-axes a and b (a2), 
show that 


L*(a, c) — 16a? = L(b, c) — 1667. 


5755. Proposed by G. G. Kiziak, University of Toronto 


It is well known that if a sequence of commuting normal operators T, on a 
Hilbert space H converges strongly to an operator 7, then T is normal. Show 
that the assumption that the operators commute cannot be dropped, even if 
the 7, are unitary. (This contradicts an assertion in Nagy and Foias, Analyse 
Harmonique des Operateurs de espace de Hilbert, p. 107, paragraph 2.) 
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5756.* Proposed by J. H. Westbrooke, Evanston, Ill. 


Let gi1=13, qg2=17, and qs, gs, --- be the successive primes of the form 
4k-+-1. Find the least upper bound of the sequence whose nth term is 


5757. Proposed by R. A. Struble, North Carolina State University 


Does there exist a solution of the boundary value problem: 


dy ; 
dx + sin y = J [1+ y%(s)] sins ds, y(0) = y(1)? 
xX t 


[2 


If so, how many? 


5758. Proposed by W. O. Egerland, Research and Development Center, Aberdeen 
Proving Grounds, Md. 


A theorem of E. Landau states: If f(z) is analytic and bounded by 1 in the 
unit disk, | z| <1, and satisfies the conditions f(0)=0 and f’(0)=a, 0<a<1, 
then f(z) is univalent in the disk |z| <<p=a—(1—(1—a?)1/?). What is the cor- 
responding p if, in addition, f(z) ~0 for z¥0? 


SOLUTIONS OF ADVANCED PROBLEMS 


Ideals in an Associative Ring 
5675 [1969, 565]. Proposed by M. Slater, University of Bristol, England 


Let R be an associative ring, A an ideal of R, and U a nonzero ideal of A. 
Under suitable conditions U contains a nonzero ideal of R (cf. Jacobson, Struc- 
ture of Rings, p. 65). Construct an example in which the conclusion is false, and 
in addition (1) A is as well-behaved as possible; (2) U?(0); (3) U?¥(0) and 
A?=A. 


Solution by the proposer. All our examples will be algebras over any field F. 
We write (x1, %2, - - * , Xn) for the linear span of the indicated elements, and use 
of the notation indicates that these elements are linearly independent. 

(1) AUA isan ideal of R contained in U; hence is (0). If A has no total right 
divisors of zero, then UA =(0), and U¥(0) consists of left total divisors of zero 
in A. Thus, in any example A must have (say) left total divisors of zero. We 
give an example in which A has a left unity e; then A has the additional regulari- 
ties A2=A and Aa=(0), where a@€A, implies a =0. 

R=(e,a,u,s); A =(e, a, u); U= (u). Basis products zero except ce =e, ea =a, 
eu =U, US =A. 

(2) We assume R is an algebra over F. Then it can be shown that dim U24. 
If dim U=4, say U=(u, zg, u, w). Then it can be shown that dim A 27; say 
AD(a, b, c, v, 8, u, w), and of course dim R28; say sCR, sGEA. By suitable 
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choices of the indicated elements we can present a partial multiplication table 
for R in the following form: 


a b C d v z u w S p t 
a Y Y 0 Y Y Y 0 0 
b c+YV 0 z+YV xX 0 0 
C 0 0 0 0 0 0 0 0 
d O 0 xX xX 0 0 
v u Y 0 Y w 0 0 0 a 
Zz 0 0 0 0 0 0 0 0 C 
u 0 0 0 0 0 0 0 0 Y Y Y 
W 0 0 0 0 0 0 0 0 u Y Y 
Ss Q x 
p ct+tY e2+YV 
t xX 


Q=(c,u,w); X=@,u,w); Y= (u, wv). 


Capital letters in the table denote (unknown) elements in the corresponding 
subspace; p need not be independent of {s } UA;d and ¢ are arbitrary elements 
of A and R respectively. 

There actually exist examples of this type; two are as follows: 


(i) R = (a, b, ¢, 2, 3, u, w, s); A and B as above. 


Nonzero basis products: ba=c, bv=2, va =u, vd =wW, VS=A, 28S=C, WS=U, sv=|, 
SU=C, SW=Z. 


(ii) R = (a, b, c, 2, 3, u, w, 5s, p); A and B as in (i). 


Basis products as in (i) except that the last three listed are replaced by pu =c, 
pw=s. 

(3) We do not have a solution for (3), but we can prove that in an example 
with 42=A we must have dim U>4. (In the examples of (2) we have A?=(0).) 
If dim US4, then R has a sub-multiplication table as shown in (2). Suppose 
bEA*% Then b=) fig:i, say, with f;, g;GA. Then c+Y=ba=()>, fig,a 
= > fi(gia) = >> fig =0 from the table (¢;EQ). Since c€ Y, this is a contradic- 
tion. So, if dim Us4, b A’, and so A? A. 


Nonlinear Homogeneous Isometries 


5688 [1969, 835]. Proposed by P. R. Chernoff, University of California, 
Berkeley 


Let X and Y be normed vector spaces and f:X— Y a homogeneous isometry; 
that is: f(t-v) =éf(v) and ||f(v) —f(w)|| =||o—w| for all scalars t and vectors », 
w. Must f be linear? (Cf. Banach, p. 166, Theorem 2.) 
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Solution by J. Marsden, University of California, Berkeley. If Y is two di- 
mensional or strictly convex or if f is onto, then f is linear; but in general f need 
not be linear. 

We first give the example. Let X = R? and Y=R', each with the L, norm 
(equalling the supremum of the coordinate norms). Consider the piecewise 
linear homogeneous function f:X—Y defined by (+1, 0) —>( £1, 0, 0), (0, £1) 
—(0, £1, 0) and (1, +1)-(1, £1, +1). It is straightforward to check that f 
is an isometry. 

That f is linear if it is onto is the cited theorem of Banach. To show f is 
linear if Y is two dimensional, one is quickly reduced to the case in which X is 
two dimensional. But by invariance of domain, f is open so, by homogeneity, is 
onto, and therefore linear. 

Finally, if Y is strictly convex, (i.e. equality in the triangle inequality holds 
only for collinear points), the equalities 

u+v 
fe) — (=) 


flu) -4()| + : 
9-19] =o CE 


i(**)- fu) + £0) 


3 


If) — f)|| = | 


imply 


2 2 


from which linearity is evident. 
Also solved by P. J. Owens (England), T. V. Wimer, and the proposer. 


Zero Divisors in an Associative Ring 
5690 [1969, 947]. Proposed by Irving Kaplansky, University of Chicago 


Let R be an associative ring. Suppose aE R is nilpotent and DER is a right 
zero-divisor. Suppose a and 6 commute. Prove there exists c#0 in R such that 
ac =bc =0. 


Solution by Barbara Osofsky, Rutgers—The State University. Assume d #0 
such that dd =0. Then, since there exists m such that a*=0, there exists k20 
with a*d0, a**!1d=0. Now a-a*d=0 and ba'd =a"bd =0: that is, c=a*d estab- 
lishes the result. 

If we interpret “bd is a right zero-divisor” to mean: there exists d¥0 such 
that db =0, then the statement of the problem would be false. 

Let (R, +) be the additive group of S[z], where S=F[x, y]/(x2), F some 
field. That is, R= } Sten O4(%, y)att, where a; is an at most linear polynomial 
in x and any degree in y. Define multiplication in R by the distributive law and 
the a(x, vy) polynomials multiply as in S, za=az for aE F, 2x =xz, and zsy=0. 


894 ADVANCED PROBLEMS AND SOLUTIONS [October 


Now x and y commute, x?=0, and zy =0, +0, yet for all nonzero e in R, ye+0. 


Also solved by G. A. Elliott, Harry Gonshor, Melvin Hendriksen, D. A. Higgs, M. L. Laplaza 
(Puerto Rico), Bruce Jensen & Ernest Sparks, C. O. M. Pewter (England), Peter Ross, G. F. 
Schumm, A. J. Silberger, James H. Smith, Seth Warner, A. K. Wayman, Jr., Qazi Zameeruddin 
(India), and the proposer. 


A Discrete Set with Uncountable Closure 


5691 [1969, 947]. Proposed by John Bond and Harold Reiter, Clemson Uni- 
versity 


Does there exist a discrete subset of the real line (usual topology) which has 
an uncountable closure? 


Solution by J. C. Morgan II, University of California, Berkeley. The first 
example of such a set was given implicitly by I. Bendixon (Quelques Théorémes 
dela Théorie des Ensembles de Points, Acta Math., 2(1883) pp. 416-418). Namely, 
consider any nowhere dense (nonempty) perfect set K. In each complementary 
interval of K choose a decreasing sequence converging to the left end-point and 
an increasing sequence converging to the right endpoint. The set of all sequences 
chosen is a discrete set whose closure contains K. 


Also solved by sixty other readers. 

Editorial Note. Many contributors pointed out that it suffices to select the midpoints of the 
complementary intervals of Cantor’s ternary set. Ter Morsche and van der Steen note—as implied 
above—that the closure may also have positive measure. 


Characteristic Roots of a Symmetric Matrix 
5692 [1969, 948]. Proposed by Olga Taussky, California Institute of Technology 


Let F be a real closed field and A a finite matrix with elements in F. Show 
explicitly that the characteristic roots of A’A (A’ the transposed matrix) are 
sums of squares in F. 


Solution by J. B. Skinner, University of Massachusetts. Since F is real closed, 
the extension F(,/—1) is proper and algebraically closed. There is an F-auto- 
morphism of F(,/—1) given by z=x+y/— 1-2 =x—y/—1, and the space of 
n-tuples over F(./—1) has an inner product given by (x, y) = >ij-1 xi: If Nis 
a characteristic root of A’A, then NC F(.\/—1); furthermore, since A’A is 
symmetric, it follows that \CF, hence A’A has a characteristic vector with 
coordinates in Ff. We may assume that (x, x) =i. Then X=X(x, x) =(x, Ax) 
=(x, A’Ax). But this last expression equals (Ax, Ax), which is a sum of squares 
in F. 

Also solved by G. N. de Oliveira (Portugal), E. D. Dixon, R. A. Kopas, C. L. Sabharwal, R. C. 
Thompson, E. W. Trost (Switzerland), and the proposer. 


The proposer proves also that the characteristic roots are sums of squares in a formally real 
field if we assume that Fis a formally real field. 
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Metric in a Half-open Interval Topology 


5694 [1969, 948]|. Proposed by Jiirg Ratz and Albert Wilansky, Lehigh Uni- 
versity 


As a subspace of R (the reals) with the half-open interval topology, (Kelley, 
General Topology, p. 59), Q (the rationals) is first countable, hence second count- 
able, hence metrizable. Exhibit an explicit metric for this topology. 


Solution by D. R. Anderson, Central Washington State College. Choose a 


listing of the rationals, gi, ---,@Qn,°-°-°. For each positive integer n, the func- 
tion 
lx —y| if x,y < Qn or X,Y = Qn 
dr{x, y) = . 
le—y| +1 fle<gSy or y¥<onSXx 


is a metric for R for which the open sets are the usual open sets and the half 
open intervals of the form [q,, b). The function 
d(x, y) = Dy dnl, y)/n? 
n21 

is also seen to be a metric for R for which the open sets include the usual open 
sets and the half open intervals of the form [gn, b), n=1, 2, -- - . In fact, these 
are the only open sets. This metric may then be restricted to the rationals in 
order to obtain the desired metric. 

Also solved by Linda W. Brinn, E. P. Del Norte, R. J. Driscoll, G. J. Foschini, T. E. Gantner. 
Leonard Gillman, Gary Gruenhage, J. B. Lindner, O. P. Lossers (Netherlands), M. D. Mavinkurve 


(India), Ka Menehune, D. Hammond Smith (England), R. H. Sorgenfrey, T. Wimer, and the 
proposer. 


Boolean Lattices 
5696 [1969, 1074]. Proposed by P. J. Chase, Laurel, Maryland 
Show that a lattice LZ is Boolean if and only if it admits a unary operation 
x—«’ such that ab S$cSaUb implies cNb’ Sa ScU0’. 


Solution by Michael McCoy, Kansas State Teachers College. First suppose 
that LZ is a Boolean lattice, and let complementation be the operation x—>x’ 
(well-defined since complements are unique in a Boolean lattics.) Using the 
distributive property and the isotonicity of the meet and join operations: if 
af\bScsSaU), then 


CIN! SaVU(eNd) =(aUanNn@aVUd) Ss (@UdNNGUD) 
aT Oond’) =aVU0=a=aN\T=atr\(0UD) 
(af\b) Ua’) S$ (af\\o UGN DS) 

af(\\(cUb’) ScUDO. 


Thus the necessity of the condition follows. 
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To prove the sufficiency suppose that a/b Sc Sab) implies cb’ SascU0’. 
If DEL, then bOasb SbVa for all aEL, and this implies bd’ Sa SbUD’ for 
all aE L; hence 0/)\b’=0 and bUb’=T, which shows that LZ is complemented. 
Moreover, if a and } are complementary, then a/\b Sc Sab for all cEL, which 
implies b’(\c Sa Sb’ for all cEL. But if c=TJ, then b’=0/C\I Sa; and if c=0, 
then a Sb’U0=0’. Therefore a =0’, and L is uniquely complemented. To show 
distributivity, we use the well-known characterization: a lattice is distributive if 
and only if relative complements are unique. Now x(\\a=yf\a and xUa =yUa 
implies yOasxsyVa and xMasysxUa. By the hypothesis, y asx syUa 
implies a’(\x SySa’Ux, which implies x/Oy Sa’ Sx'Uy; and «<OasysxUa 
implies x/O\y SasSx’'VUy. Together, these imply x’/Oy=0 and x’Uy=T which 
implies x=¥ since L is uniquely complemented. Therefore L is complemented 
and distributive; hence, Boolean. 

Also solved by T. S. Blyth (Scotland), N. A. Borba, D. Z. Djokovic, G. A. Edgar, S. I. Gendler, 


Khee-meng Koh, Yataro Matsushima (Japan), V. V. Rama Rao (India), Simeon Reich (Israel), 
and the proposer. 


Dense Subsets in a Separable Space 


5698 [1969, 1074]. Proposed by D. L. Lutzer, University of Washington 


It is well known that a subspace S of a separable Hausdorff space X need 
not be separable. Is there an example of this in which S is dense? 


Solution by Dennis Henkel, Allen-Bradley Company, Milwaukee, Wisconsin. 
Let X be the real numbers, and define a new topology so that a set V is open if 
it is empty or if it is of the form V= U—K where U is open in the usual sense 
and K is a set of irrational numbers that is at most countable. The rational 
numbers are dense in this space and the set S of irrational numbers is also 
dense. However, the complement of any countable subset of S is open and 
nonempty in the relative topology. 

Also solved by Harold Bennett, W. E. Bonnice (Turkey), Helen F. Cullen, T. E. Gantner, 
Leonard Gillman, K. D. Joshi, R. H. Lohman, Dan Marcus, T. M. Phillips, Jiirg Ratz & Albert 
Wilansky, George Reynolds, D. H. Smith (England), Gerald Wildeberg, T. V. Wimer, and the 
proposer. 


Derivation on a Ring of Continuous Functions 

5699 [1969, 1074]. Proposed by G. J. Foschini, Bell Telephone Laboratories. 

A derivation on a ring S is an additive mapping s—s’ of S into itself satisfy- 
ing (pg)'=pq'+p’g. Let C[0, 1] be the ring of continuous real functions on 
[0, 1] with the usual norm. Show that there exists a ring R[0, 1]CC][0, 1] with 
a derivation r—r!" such that the following subsets of R[0, 1] are dense in 
C[o, 1]: 
(i) { r| 7] 0}, 
(ii) {r | fll r}, 
(iii) fr| rll = (rle2)O1 > 0,n = 1,2,---} 
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and such that R[0, 1|>C~[0, 1| whereon the derivation coincides with the 
usual derivative. 


Solution by the proposer. Let fEC[0, 1] which is differentiable almost 
everywhere in such a way that there exists a g©C[0, 1] such that g=df/dt 
almost everywhere. In such a case we define g =/!). In the ring 


RO, 1| — {7 Cc c|0, 1] | yin] exists, nw= 1, 2, cee } 


r—rl4l is a derivation. This follows from the continuity of 7 and 7! and the 
fact that 7! =dr/dt almost everywhere. 

To verify @) we note that these constants are precisely the algebra of con- 
tinuous singular functions. Let C(t) denote the Cantor function. Given any 
pair (x, y) of distinct points, a clock speed A, 0<A<1, can be chosen so that 
C(AL) separates x and y. Thus by the Stone-Weierstrass theorem, the constants 
are dense in C(0, 1]. 

Let # denote an arbitrary element in C[0, 1]. By (i) we can choose sequences 
of constants {am} and {Dm} converging to e~' and hk respectively. Thus @mbme! 
—h. Now Gmbm is constant and so (@mbme')!) =ambme* is a fixed point and (ii) 
has been demonstrated. 

Finally (e/"—1)+bm—h and [(e/™—1) +d | =e"/m">0, n=1, 2,--> 
Therefore (iii) is satisfied. 


Also solved by O. P. Lossers (Netherlands). 
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PITFALLS IN COMPUTATION, OR WHY 
A MATH BOOK ISN’T ENOUGH 


GEORGE E. FORSYTHE, Computer Science Dept., Stanford University 


1. Introduction. Why does a student take mathematics in college or unt- 
versity? I see two reasons: (i) To learn the structure of mathematics itself, be- 
cause he (or she) finds it interesting. (ii) To prepare to apply mathematics to 
the solution of problems he expects to encounter in his own field, whether it be 
engineering, physics, economics, or whatever. 

Surely (ii) motivates far more students than (i). Moreover, most solutions 
of major mathematical problems involve the use of automatic digital computers. 
Hence we may justifiably ask what mathematics courses have to say about 
carrying out mathematical work on a computer. This question motivates my 
paper. 

I am not in a mathematics department, and sometimes I moralize about 
them. If the reader prefers not to be lectured to, let him ignore the preaching 
and just pay attention to the numerical phenomena for their own sake. 

I want to acknowledge the help of Mr. Michael Malcolm in criticizing the 
manuscript and doing the computations with a special floating-decimal arith- 
metic simulator he wrote for Stanford’s hexa-decimal computer, an IBM 360/67. 


2. Nature of computers. An automatic digital computer is a general-pur- 
pose machine. The bits of information in its store can be used to represent any 
quantifiable objects—e.g., musical notes, letters of the alphabet, elements of a 
finite field, integers, rational numbers, parts of a graph, etc. Thus such a machine 
is a general abstract tool, and this generality makes computer science important, 
just as mathematics and natural language are important. 

In the use of computers to represent letters of the alphabet, elements of a 
finite field, integers, etc., there need be no error in the representation, nor in the 
processes that operate upon the quantities so represented. The problems in 
dealing with integers (to select one example) on computers are of the following 
type: Is there enough storage to contain all the integers we need to deal with? 
Do we know a process that is certain to accomplish our goal on the integers 
stored in the computer? Have we removed the logical errors (“bugs”) from the 


Prof. Forsythe received his PhD at Brown University under W. Feller and J. D. Tamarkin. 
He was an instructor at Stanford, worked in meteorology with the Air Force and at UCLA, and 
worked in numerical analysis at Boeing Airplane Co., the Institute for Numerical Analysis, and at 
UCLA. He has been at Stanford since 1957 in mathematics and in computing science. He spent 
1955-56 at the Courant Institute and 1966-67 at various computer centers in Europe, Asia, and 
Australia. 

He is known for his extensive writing on random variables, meteorology, and computer science. 
In 1969 he received an MAA Lester Ford Award. His books are: Dynamic Meteorology (with J. 
Holmboe and W. Gustin, Wiley, 1945), Bibliography of Russian Mathematics Books (Chelsea, 1956), 
Finite Difference Methods for Partial Differential Equations (with W. R. Wasow, Wiley, 1960), and 
Computer Solution of Linear Algebraic Systems (with C. B. Moler, Prentice-Hall, 1967). Editor. 
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computer representation of this process? Is this the fastest possible process or, 
if not, does it operate quickly enough for us to get (and pay for) the answers 
we want? 

The above problems are not trivial; there are surely pitfalls in dealing with 
them; and it is questionable whether math books suffice for their treatment. 
But they are not the subject of this paper. This paper is concerned with the 
simulated solution on a digital computer of the problems of algebra and analysis 
dealing with real and complex numbers. Such problems occur everywhere in 
applied science—for example, whenever it is required to solve a differential 
equation or a system of algebraic equations. 

There are four properties of computers that are relevant to their use in the 
numerical solution of problems of algebra and analysis. These properties are 
causes of many pitfalls: 

(i) Computers use not the real number system, but instead a simulation of 
it called a “floating-point number system.” This introduces the problem of 
round-off . 

(ii) The speed of computer processing permits the solution of very large 
problems. And frequently (but not always) large problems have answers that 
are much more sensitive to perturbations of the data than small problems are. 

(iii) The speed of computer processing permits many more operations to be 
carried out for a reasonable price than were possible in the pre-computer era. 
As a result, the tustability of many processes is conspicuously revealed. 

(iv) Normally the intermediate results of a computer computation are hid- 
den in the store of the machine, and never known to the programmer. Conse- 
quently the programmer must be able to detect errors in his process without see- 
ing the warning signals of possible error that occur in desk computation, where 
all intermediate results are in front of the problem solver. Or, conversely, he 
must be able to prove that his process cannot fail in any way. 


3. Floating-point number system. The badly named real number system is 
one of the triumphs of the human mind. It underlies the calculus and higher 
analysis to such a degree that we may forget how impossible it is to deal with 
real numbers in the real world of finite computers. But, however much the real 
number system simplifies analysis, practical computing must do without it. 

Of all the possible ways of simulating real numbers on computers, one class 
is most widely used today—the floating-point number system. Here a number 
base 6 is selected, usually 2, 8, 10, or 16. A certain integer s is selected as the 
number of significant digits (to base 8) in a computer number. An integer ex- 
ponent e is associated with each nonzero computer number, and e must lie in 
a fixed range, say 


msesM. 


Finally, there is a sign + or — for each nonzero floating-point number. 
Let F=F/(G, s, m, M) be the floating-point number system. Each nonzero 
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x€F has the base-8 representation 
% = ck .didg+ ++ ds-B°, 
where the integers dy, - - + , d, have the bounds 
13aq456-1, 
0OSda,58-1 (4 = 2,--+-+,5). 
Finally, the number 0 belongs to F, and is represented by 
+.00 --- 0-8". 
Actual computer number systems often differ in detail from the ideal one 
discussed here, but the differences are of only secondary relevance for the funda- 
mental problems of round-off. 


Typical floating-point systems in use correspond to the following values of 
the parameters: 


B= 2,5 = 48,m = — 975, M = 1071 (Control Data 6600) 
B= 2,5 =27,m= —128,M = 127 (IBM 7090) 
B=10,s= 8,m=-— 50,M= 49 (IBM 650) 

B= 8,s=13,m=— 51,M = 77 (Burroughs 5500) 
B=16,5= 6,m=— 64,M = _ 63 (IBM System/360) 
B=16,s=14,m= — 64,M = 63 (IBM System/360). 


Any one computer may be able to store numbers in more than one system. 
For example, the IBM System/360 uses the last two base-16 floating-point 
systems for scientific work, and also a certain base-10 system for accounting 
purposes. 


—4 ~3 2 —1 24-1034 1 2 3 4 


F is not a continuum, nor even an infinite set. It has exactly 2(8—1)6*— 
-(M—m-+1)+1 numbers in it. These are not equally spaced throughout their 
range, but only between successive powers of 8 and their negatives. The ac- 
companying figure, reproduced from [3] by permission, shows the 33-point set 
F for the small illustrative system 8B =2, s=3, m=—1, M=2. 

Because F is a finite set, there is no possibility of representing the continuum 
of real numbers in any detail. Indeed, real numbers in absolute value larger than 
the maximum member of F cannot be said to be represented at all. And, for 
many purposes, the same is true of nonzero real numbers smaller in magnitude 
than the smallest positive number in F. Moreover, each number in F has to 
represent a whole interval of real numbers. If « and y are two real numbers in 
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the range of F, they will usually be represented by the same number in F when- 
ever |x—y|/|x| $4678; it is not important to be more precise here. 

As a model of the real number system R, the set F has the arithmetic opera- 
tions defined on it, as carried out by the digital computer. Suppose x and y are 
floating-point numbers. Then the true sum x+y will frequently not be in F. 
(For example, in the 33-point system illustrated above let x=5/4 and y=3/8.) 
Thus the operation of addition, for example, must itself be simulated on the 
computer by an approximation called floating-point addition, whose result will 
be denoted by fl(x+). Ideally, fl(x+y) should be that member of F which is 
closest to the true x+y (and either one, in case of a tie). In most computers this 
ideal is almost, but not quite, achieved. Thus in our toy 33-point set / we would 
expect that fl(5/4+3/8) would be either 3/2 or 7/4. The difference between 
fi(x+y) and x+y is called the rounding error in addition. 

The reason that 5/4+3/8 is not in the 33-point set F is related to the spacing 
of the members of F. On the other hand, a sum like 7/2+7/2 is not in F because 
7 is larger than the largest member of F. The attempt to form such a sum on most 
machines will cause a so-called overflow signal, and often the computation will 
be curtly terminated, for it is considered impossible to provide a useful approxi- 
mation to numbers beyond the range of F. 

While quite a number of the sums x+y (for x, y in F) are themselves in F, 
it is quite rare for the true product x-¥y to belong to F, since it will always in- 
volve 2s or 2s—1 significant digits. Thus the simulated multiplication operation, 
fl(x-y), involves rounding even more often than floating addition. Moreover, 
overflow is much more probable in a product. Finally, the phenomenon of 
underfiow occurs in floating-point multiplication, when two nonzero numbers 
x, y have a nonzero product that is smaller in magnitude than the smallest non- 
zero number in F. (Underflow is also possible, though unusual, in addition.) 

The operations of floating-point addition and multiplication are commuta- 
tive, but not associative, and the distributive law fails for them also. Since these 
algebraic laws are fundamental to mathematical analysis, working with floating- 
point operations is very difficult for mathematicians. One of the greatest mathe- 
maticians of the century, John von Neumann, was able to collaborate in some 
large analyses with floating-point arithmetic (see [10]), but they were extremely 
ponderous. Even his genius failed to discover a method of avoiding nonassocia- 
tive analysis. Such a new method, called inverse error analysis, owes its origins 
to Cornelius Lanczos and Wallace Givens, and has been heavily exploited by 
J. H. Wilkinson. A detailed study of inverse error analysis is part of the subject 
of numerical analysis. We will mention it again in Section 5. 


4. Two examples of round-off problems. One of the commonest functions 
in analysis is the exponential function e*. Since it is so much used, it is essential 
to be able to have the value of e” readily available in a computer program, for 
any (not too large or small) floating-point number x. There is nowhere near 
enough storage to file a table of all values of e7, so one must instead have an 
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algorithm for recomputing e* whenever it is needed. (By an algorithm we mean 
a discrete process that is completely defined and guaranteed to terminate.) There 
are, in fact, a great many different methods such an algorithm could use, and 
most scientific computing systems include such an algorithm. But let us assume 
such an algorithm did not exist on your computer, and ask how you would pro- 
gram it. This is a realistic model of the situation for a more obscure transcen- 
dental function of analysis. 

Recall that, for any real (or complex) value of x, we can represent e? by the 
sum of the universally convergent infinite series 


x? x8 
€ =Trer ary mote 


Since you learned mathematics because it is useful, you might expect to use the 
series to compute e*. Suppose—just for illustration—that your floating-point 
number system F is characterized by 8=10 and s=5. Let us use the series for 
* = —5.5, as proposed by Stegun and Abramowitz [13]. Here are the numbers 
we get: 
eF-5 x 1.0000 

— 5.5000 

+15.125 

— 27.730 

+38.129 

—41.942 

+38.446 

— 30.208 

+20.768 

— 12.692 

+ 6.9803 

— 3.4902 

+ 1.5997 


+ 0.0026363 


(The symbol “=~” means “equals approximately”.) The sum is terminated when 
the addition of further terms stops changing it, and this turns out to be after 
25 terms. Is this a satisfactory algorithm? It may seem so, but in fact 
e5.5 ~().00408677, so that the above series gets an answer correct to only about 
36 percent! It is useless. 

What is wrong? Observe that there has been a lot of cancellation in forming 
the sum of this alternating series. Indeed, the four leading (i.e., most significant) 
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digits of the eight terms that exceed 10 in modulus have all been lost. Professor 
D. H. Lehmer calls this phenomenon catastrophic cancellation, and it is fairly 
common in badly conceived computations. However, as Professor William 
Kahan has observed, this great cancellation is not the cause of the error in the 
answer—it merely reveals the error. The error had already been made in that 
the terms like 38.129, being limited to 5 decimal digits, can have only one digit 
that contributes to the precision of the final answer. It would be necessary for 
the term (—5.5)4/4! to be carried to 8 decimals (i.e., 9 leading digits) for it to 
include all 6 leading digits of the answer. Moreover, a tenth leading digit would 
be needed to make it likely that the fifth significant digit would be correct in 
the sum. The same is true of all terms over 10 in magnitude. 

While it is usually possible to carry extra digits in a computation, it is always 
costly in time and space. For this particular problem there is a much better cure, 
namely, compute the sum for x =5.5 and then take the reciprocal of the answer: 


eSB = 1/55 
1/(1+5.5 + 15.125 + ---) 
0.0040865, with our 5-decimal arithmetic. 


R 


With this computation, the error is reduced to 0.007 percent. 

Note how much worse the problem would be if we wanted to compute e? for 
x= —100. 

Actual computer algorithms for calculating e? usually use a rational function 
of x, for x on an interval like 0Sx1. For x outside this interval, well-known 
properties of the exponential function are used to obtain the answer from the 
rational approximation to e”, where y=x— [x]. The creation of such algorithms 
for special functions is a branch of numerical analysis in which the general 
mathematician can hardly be an expert. On the other hand, it is part of the 
author’s contention that mathematics books ought to mention the fact that a 
Taylor’s series is often a very poor way to compute a function. 

I shall briefly state a second example. Recall from the calculus that 


bdx xP | 


(1) = (b? — at?) (p # 1). 


2? 1p 


a 1—>p 


Now using a floating-point system with B=10 and s=6, let us evaluate the 
above formula for a=1, )=2, and p=1.0001. We have 


2 dx 1 —_— 2—.0001 
e p= [' wt 
1 x1-0001 0.0001 


If we use 6-place logarithms to evaluate 2—-°°9!, we have 


logio2 ~ 0.301030, 
log192—-9%1 ~ — 0.0000301030 = — 1 + 0.999970, 
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whence, using our logarithm table again, 
2—-0001 = 0.999930. 


Thus, from (2), we get [~0.7, an answer correct to only one digit. 

The precise meaning of the restriction to 8=10, s=6 is not so clear in the 
evaluation of 2—-°°°! as it would have been in the previous example. However, 
the example does illustrate the fact that formula (1), which is precisely meaning- 
ful for real numbers as long as p 1, is difficult to use with finite-precision arith- 
metic for p close to 1. Thus practical computation cannot admit the precise 
distinction between equality and inequality basic to pure mathematics. There 
are degrees of uncertainty caused by approximate equality. 


5. Solving quadratic equations. The two examples of Section 4 were taken 
from the calculus. But we don’t have to learn college mathematics to find al- 
gorithms. In ninth grade there is a famous algorithm for solving a quadratic 
equation, implicit in the following mathematical theorem: 


THEOREM. Jf a, 0, c are real and a0, then the equation ax?+bx-+c=0 1s 
satisfied by exactly two values of x, namely 


—b+ Sb? — 4ac 


3 = 
( ) v1 2a 
and 
—b — Vb? — 4ac 
(4) Lo = 


2a 


Let us see how these formulas work when used in a straightforward manner 
to induce an algorithm for computing x; and x2. This time we shall use a floating- 
point system with B=10, s=8, m= —50, M@=50; this has more precision than 
many widely used computing systems. 


Case 1: =1,b=—105,c=1. 


The true roots of the corresponding quadratic equation, correctly rounded 
to 11 significant decimals, are: 


x, = 99999 .999990 (true) 
x2 =~ 0.000010000000001 (true). 


If we use the expressions of the theorem, we compute 
x1 = 100000.00 (very good) 
Xe = 0 (100 percent wrong). 


(The reader is advised to be sure he sees how x2 becomes 0 in this floating-point 
computation.) 
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Once again, in computing x2 we have been a victim of catastrophic cancella- 
tion, which, as before, merely reveals the error we made in having chosen this 
way of computing x2. There are various alternate ways of computing the roots 
of a quadratic equation that do not force such cancellation. One of them follows 
from the easily proved formulas, true if adc30: 


2C 
(5) yiyFSoo.e eo 
— b — ~/b? — 4ac 
2C 
(6) te = = - 
— b+ V/b? — 4ac 


Now, if <0, there is cancellation in (4) and (5) but not in (3) and (6). And, if 
b>0, there is cancellation in (3) and (6), but notin (4) and (5). Special attention 
must be paid to cases where 0 or c is 0. 

At this point I should like to propose the following criterion of performance 
of a computer algorithm for solving a quadratic equation. This is stated rather 
loosely here, but a careful statement will be found in [2]. 

We define a complex number z to be well within the range of F if either z=0 or 


| Re(z)| S$ B™-* and 
| Im (z)| S B¥-?, 


Bmre 


IA IA 


Bmre 


This means that the real and imaginary parts of z are safely within the magni- 
tudes of numbers that can be closely approximated by a member of F. The 
arbitrary factor B? is included as a margin of safety. 

Suppose a, b, c are all numbers in F that are well within the range of F. Then 
they must be acceptable as input data to the quadratic equation algorithm. If 
a=b=c=0, the algorithm should terminate with a message signifying that all 
complex numbers satisfy the equation ax?-++-bx-++c=0. If a=b=0 and c40, then 
the algorithm should terminate with a message that no complex number satis- 
fies the equation. 

Otherwise, let 2; and z2 be the exact roots of the equation, so numbered that 
| 21] S| 22]. (If @=0, set 22= 0.) Whenever 2; is well within the range of F, the 
algorithm should determine a close approximation to 2, in the sense of differing 
by not more than, say, 8-+1 units in the least significant digit of the root. 

The same should be done for 2p. 

If either or both of the roots z; are not well within the range of F, then an 
appropriate message should be given and the root (if any) that is well within 
the range of F should be determined to within a close approximation. 

That concludes the loose specification of the desired performance of a quad- 
ratic equation solving algorithm. Let us return to a consideration of some typical 
equations, to see how the quadratic formulas work with them. 

CASE 2:a=6, b=5, c= —4. 

There is no difficulty in computing x,;~0.50000000 and x. —1.3333333, or 
nearly these values, by whatever formula is used. 
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CASE 3: a=6-10*9, b=5-10%9, c= — 4-109, 

Since the coefficients in Case 3 are those of Case 2, all multiplied by 10°, 
the roots are unchanged. However, application of any of the formulas (3)—(6) 
causes overflow to occur very soon, since }?>10°°, out of the range of F. Prob- 
ably this uniform large size of |a|, ||, |c| could be detected before entering 
the algorithm, and all three numbers could be divided through by the factor 
10*° to reduce the problem to Case 2. 

CasE 4: a=107*9, b= —10*, c=10°°. 

Here 2 is near 1, while 2: is near 10°. Thus our algorithm must determine 
2, very closely, even though 2 is out of the range of F. Obviously any attempt to 
bring the coefficients to approximate equality in magnitude by simply dividing 
them all by the same number is doomed to failure, and might itself cause an 
overflow or underflow. This equation is, in fact, a severe test for a quadratic 
equation solver and even for the computing system in which the solver is run. 

The reader may think that.a quadratic equation with one root out of the 
range of F and one root within the range of F is a contrived example of no prac- 
tical use. If so, he is mistaken. In many iterative algorithms which solve a 
quadratic equation as a subroutine, the quadratics have a singular behavior in 
which a—0 as convergence occurs. One such example is Muller’s method [9] 
for finding zeros of general smooth functions of gz. 

CasE 5: a=1.0000000, 6 = —4.0000000, ¢c=3.9999999. 

Here the two roots are 2,~1.999683772, 2.~2.000316228. But applying the 
quadratic formulas (3), (4) gives 


g1 = 2, = 2.0000000, 


with only the first four digits correct. These roots fail badly to meet my criteria, 
but the difficulty here is different from that in the other examples. The equation 
corresponding to Case 5 is the first of our equations in which a small relative 
change in a coefficient a, 0, c induces a much larger relative change in the roots 
21, 22. This is a form of instability in the equation itself, and not in the method 
of solving it. To see how unstable the problem is, the reader should show that 
the computed roots 2.0000000 are the exact roots of the equation 


0.999999992x? — 3.999999968x + 3.999999968 = 0, 


in which the three coefficients differ, respectively, from the true a, b, c of Case 5 
by less than one unit in the last significant digit. In this sense one can say that 
2, 2 are pretty good roots for Case 5. 

This last way of looking at rounding errors is called the inverse error approach 
and has been much exploited by J. H. Wilkinson. In general, it is characterized 
by asking how little a change in the data of a problem would be necessary to 
cause the computed answers to be the exact solution of the changed problem. 
The more intuitive way of looking at round off, the direct error approach, simply 
asks how wrong the answers are as solutions of the problems with its given data. 
While both methods are useful, the important feature of inverse error analysis is 


940 G. E. FORSYTHE [November 


that in many large matrix or polynomial problems, it can permit us easily to 
continue to use associative operations, and this is often very difficult with direct 
error analysis. 

Despite the elementary character of the quadratic equation, it is probably 
still true that not more than five computer algorithms exist anywhere that meet 
the author’s criteria for such an algorithm. Creating such an algorithm is not a 
very deep problem, but it does require attention to the goal and to the details 
of attaining the goal. It illustrates the sort of place that an undergraduate 
mathematics or computer science major can make a substantial contribution to 
computer libraries. 

I wish to acknowledge that the present section owes a great deal to lectures 
by Professor William Kahan of the University of California, Berkeley, given at 
Stanford in the Spring of 1966. 


6. Solving linear systems of equations. As the high school student moves 
from ninth grade on to tenth or eleventh, he will encounter the solution of sys- 
tems of linear algebraic equations by Gauss’ method of eliminating unknowns. 
With a little systematization, it becomes another algorithm for general use. I 
would like to examine it in the simple case of two equations in two unknowns, 
carried out on a computer with B=10, s=3. 

Let the equation system be one treated by Forsythe and Moler [3]: 


Yoon + 1.00y = 1.00 


(7) 
1.00% + 1.00y = 2.00. 


The true solution, rounded correctly to the number of decimals shown, is 
« = 1.00010, y = 0.99990 (truly rounded). 


The Gauss elimination algorithm uses the first equation (if possible) to 
eliminate the first variable, x, from the second equation. Here this is done by 
multiplying the first equation by 10000 and then subtracting it from the second 
equation. When we work to three significant digits, the resulting system takes 


the form 
o—on—v + 1.00y = 1.00 (the old first equation) 


— 100007 = — 10000. 

For just two equations, this completes the elimination of unknowns. Now 
commences the back solution. One solves the new second equation for y, finding 
that y=1.00. This value is substituted into the first equation, which is then 
solved for x. One then finds x ~0.00. In summary, we have found 

° = 1.00 
x = 0.00. 


Of course, this is awful! What went wrong? There was certainly no long accumu- 
lation of round-off errors, such as might be feared in a large problem. Nor is the 
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original problem unstable of itself, as it would be if the lines represented by the 
two equations (7) were nearly parallel. 

There is one case in which it is impossible to eliminate x from the second 
equation—when the coefficient of x in the first equation is exactly 0. Were such 
an exact 0 to occur, the Gauss algorithm is preceded by interchanging the equa- 
tions. Now, once again, if an exact zero makes a mathematical algorithm im- 
possible, we should expect that a near zero will give a floating-point algorithm 
some kind of difficulty. That is a sort of philosophical principle behind what 
went wrong. And, in fact, the division by the nearly zero number 0.0001 intro- 
duced some numbers (10000) that simply swamped the much smaller, but essen- 
tial, data of the second equation. That is what went wrong. 

How could this be avoided? The answer is simple, in this case. If it is essential 
to interchange equations when a divisor is actually zero, one may suspect that 
it would be important, or at least safer, to interchange them when the coefficient 
of x in the first equation is much smaller in magnitude than the coefficient of x 
in the second equation. A careful round-off analysis given by J. H. Wilkinson 
[14] proves this to be the case, and good linear equation solvers will make the 
interchange whenever necessary to insure that the largest coefficient of x (in 
magnitude) is used as the divisor. Thus the elimination yields the system 


{100s + 1.007 = 2.00 
1.007 = 1.00. 


After the back solution we find 


’° =~ 1.00 
x = 1.00, 
a very fine result. 
This algorithm, with its interchanges, can be extended to m equations in n 
unknowns, and is a basic algorithm found in most computing centers. 


The following example shows that there remains a bit more to the construc- 
tion of a good linear equation solver. Consider the system 


(8) 10. + 100000y = 100000 


1.00% + 1.00y = 2.00. 

If we follow the above elimination procedure, we see that interchanging the 
equations is not called for, since 10.0>1.00. Thus one multiplies the first equa- 
tion by 0.100 and subtracts it from the second. One finds afterwards, still work- 
ing with B=10, s=3, that 

ne -+ 1000007 = 100000 
— 10000y = — 10000. 


Back solving, one finds 
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’ =~ 1.00 
x = 0.00! 


This is just as bad as before, for system (8) has the same solution as (7). 
Indeed, system (8) is easily seen to be identical with (7), except that the first 
equation has been multiplied through by 100000. 

So the advice to divide by the largest element in the column of coefficients 
of x is not satisfactory for an arbitrary system of equations. What seems to be 
wrong with the system (8) is that the first equation has coefficients that are too 
large for the problem. Before entering the Gaussian elimination algorithm with 
interchanges, it is necessary to scale the equations so that the coefficients are 
roughly of the same size in all equations. This concept of scaling is not com- 
pletely understood as yet, although in most practical problems we are able to do 
it well enough. 

If you were faced with having to solve a nonsingular system of linear alge- 
braic equations of order 26, for example, you might wonder how to proceed. 
Some mathematics books express the solution by Cramer’s rule, in which each 
of the 26 components is the quotient of a different numerator determinant by a 
common denominator determinant. If you looked elsewhere, you might find 
that a determinant of order 26 is the sum of 26! terms, each of which is the 
product of 26 factors. If you decide to proceed in this manner, you are going to 
have to perform about 25-26! multiplications, not to mention a similar number 
of additions. On a fast contemporary machine, because of the time required to 
do preparatory computations, you would hardly perform more than 100,000 
multiplications per second. And so the multiplications alone would require about 
10!” years, if all went well. The round-off error would usually be astronomical. 

In fact, the solution can be found otherwise in about (1/3) - 2625859 mul- 
tiplications and a like number of additions, and should be entirely finished in 
well under half a second, with very little round-off error. So it can pay to know 
how to solve a problem. 

I wish to leave you with the feeling that there is more to solving linear equa- 
tions than you may have thought. 


7. When do we have a good solution? Another example of a linear algebraic 
system has been furnished by Moler [8]: 


{oth + 0.563y — 0.217 = 0 


9 
) 0.913% + 0.659y — 0.254 = 0. 
Someone proposes two different approximate solutions to (9), namely 


and 
(Xo, ye) = (0.341, — 0.087). 
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Which one is better? The usual check is to substitute them both into (9). We 
obtain 


\votn, + 0.563y, — 0.217 = — 0.001243 


0.913%, + 0.659y,; — 0.254 = — 0.001572 


and 
yon + 0.563y2. — 0.217 = — 0.000001 


0.91342 + 0.659y2 — 0.254 = 0. 


It seems clear that (x2, ye) is a better solution than (x, 1), since it makes the 
residuals far smaller. 

However, in fact the true solution is (1, —1), as the reader can verify easily. 
Hence (x1, y1) is far closer to the true solution than (x2, yz)! 

A persistent person may ask again: which solution is really better? Clearly 
the answer must depend on one’s criterion of goodness: a small residual, closeness 
to the true solution, or perhaps something else. Surely one will want different 
criteria for different problems. The pitfall to be avoided here is the belief that 
all such criteria are necessarily satisfied, if one of them is. 


8. Sensitivity of certain problems. We now show that certain computa- 
tional problems are surprisingly sensitive to changes in the data. This aspect of 
numerical analysis is independent of the floating-point number system. 

We first consider the zeros of polynomials in their dependence on the coeffi- 
cients. In Case 5 of Section 4 above, we noted that, while the polynomial 
x2 —4x-+-4 has the double zero 2, 2, the rounded roots of the polynomial equation 


(10) x? — 4x + 3.9999999 = 0 


are 1.999683772 and 2.000316228. Thus the change of just one coefficient from 
4 to 3.9999999 causes both roots to move a distance of approximately .000316228. 
The displacement in the root is about 3162 times as great as the displacement in 
the coefficient. 

The instability just described is a common one, and results from the fact 
that the square root of a small ¢ is far larger than e. For the roots of (10) are 
the roots of 


(x —2)2=¢ e¢€= .0000001, 


and these are clearly 2+-+/e. For equations of higher degree, a still more startling 
instability would have been possible. 

However, it is not only for polynomials with nearly multiple zeros that insta- 
bility can be observed. The following example is due to Wilkinson [14]. Let 


p(x) = (x — 1)(# — 2) +++ (w — 19)(% — 20) 
= 420 — 210x194... 


The zeros of p(x) are 1, 2,---+-, 19, 20, and are well separated. This example 
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evolved at a place where the floating-point number system had B=2, s=30. 
To enter a typical coefficient into the computer, it was necessary to round it to 
30 significant base-2 digits. Suppose that a change in the 30-th most significant 
base-2 digit is made in only one of the twenty coefficients. In fact, suppose that 
the coefficient of x!°is changed from — 210 to —210 —2-%3. How much effect does 
this small change have on the zeros of the polynomial? 

To answer this, Wilkinson carefully computed (using 8B =2, s=90) the roots 
of the equation p(x) —2-%*x!®=0. These are now listed, correctly rounded to 
the number of digits shown: 


1.00000 0000 10.09526 6145 + 0.64350 09042 
2.00000 0000 11.79363 3881 + 1.65232 97281 
3.00000 0000 13.99235 8137 + 2.51883 00702 
4.00000 0000 16.73073 7466 + 2.81262 48947 
4.99999 9928 19.50243 9400 + 1.94033 03477 
6.00000 6944 
6.99969 7234 
8.00726 7603 
8.91725 0249 
20.84690 8101 


Note that the small change in the coefficient —210 has caused ten of the 
zeros to become complex, and that two have moved more than 2.81 units off the 
real axis! Of course, to enter p(x) completely into the computer would require 
many more roundings, and actually computing the zeros could not fail to cause 
still more errors. The above table of zeros was produced by a very accurate com- 
putation, and does not suffer appreciably from round-off errors. The reason these 
zeros moved so far is not a round-off problem—it is a matter of sensitivity. 
Clearly zeros of polynomials of degree 20 with well-separated zeros can be much 
more sensitive to changes in the coefficients than you might have thought. 

To motivate a second example, let me quote a standard theorem of algebra: 
In the ring of square matrices of fixed order n, 1f AX =I, where I 1s the tdentity 
matrix of order n, then XA =I. 

It follows from this theorem and continuity considerations that, if A isa 
fixed matrix and X a variable one, and if 4X —J-—4, the zero matrix, then also 
XA—I-—6. Hence, if 4X —T is small in some sense, then XA —T is also small. 
However, as with polynomials, one’s intuition may not be very good at guessing 
how small these smallnesses are. Here is an example: Fix 


4 ne “a 
~ £10000 9999}. 
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Let 


| 9999 .9999 “yon | 
— 10001 9998 


Then a computation without round-off shows that 


.001 "O | 


Ax —1=| 
0 0 


From the last equality the reader may conclude that X is close, though not 
equal, to the unique inverse A—!. However, another calculation without round- 
off shows that 


— 19999 — 19995 


Thus the quantities 4X —I and XA —I, which must vanish together, can be of 
enormously differing magnitudes in a sensitive situation, even for matrices of 
order 2. 

The true inverse matrix is given by 


jo 9999 “eee 
~ L—10000 9999], 


19997 .0001 19995 .0003 
XA—-I= | | 


and this is hardly close to X. 


9. A least-squares problem of Hilbert. The following least-squares problem 
was discussed by the great mathematician David Hilbert [6], and leads to some 
interesting matrices. Fix n21. Let f(t) be given and continuous for 0SiS1. We 
wish to approximate f(t) as well as we can by a polynomial x1-+xet-+x3f?-+ -- - 
+xnt"—! of degree n—1. To be more precise, we wish to determine x1, %2,--°, 
Xn, so that 


&(4) = [ to — 4, — ot — 6 + = ath ]2dt 


is as small as possible. It is not difficult to show that the minimizing vector of 
coefficients x exists, is unique, and can be determined by solving the system of 
n simultaneous equations 


o® 


Ox; 


(11) =O (=1,2,+++,n). 


If you carry out the algebra, you find that (11) is equivalent to the system 
of 2 linear algebraic equations 


(12) Ax = 6, 
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where 
1 1 

13 aij -{ t-Yye-1¢dt = —___—-_—__- 1,9 =1,2,---,n 
( ) j ; i+ 7-1 ( J ) 
and 

1 
(14) b; -{ t*-lf(t) dt @=1,2,---,n). 

0 


The matrix A of coefficients in (12) is now called the Hilbert matrix (of order 
n), and is denoted by An: 


1 1 1 
1 _ __ __ 
2 3 nN 
1 1 1 1 
H, =| 2 3 4 n+ i 
1 1 1 
nm nti n+2 2n — 1 


The equations (12) with matrix A =H, are called the normal equations for this 
least-squares problem. It appears that all one has to do is to find and use a 
quadrature rule for approximating the 0; in (14), and then solve the system (12). 
This is certainly the standard advice in books on practical statistics. 

However, what is observed 1s that for ~ bigger than 8 or 9 (the threshold de- 
pends on the system used), programs for solving linear equations in ordinary 
floating-point precision are simply unable to solve (12). Moreover, for problems 
that can be solved (say n=6), there are enormous differences in the solution 
vectors x for apparently identical problems on slightly different machines. Why 
all this trouble? 

Let me try to explain the sensitivity of the problem first. Let T, =H, *. Then 
it can be proved that 


36 — 630 3360 — 7560 7560 —2772 

— 630 14700 —88200 211680 —220500 83160 
3360 —88200 564480 —1411200 1512000 —582120 
—7560 211680 —1411200 3628800 —3969000 1552320 
7560 —220500 1512000 —3969000 4410000 —1746360 
—2772 83160 —582120 1552320 —1746360 698544 


6 


This means that a change of 10-* in just the one element 0; will produce changes 
in the solution vector x of 
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(.00756, — .2205, 1.512, — 3.969, 4.41, — 1.74636)”. 


Such changes are unavoidable in a system with 8=10 and s=7. This means that 
some of the coefficients of the best fitting polynomial of degree 5 will have un- 
avoidable uncertainties of the order of 4 units. This may give some explanation 
of the instability in the answers. More details are in Section 19 of [3]. 

Here are approximate values of ¢,, the maximum elements in T,, for n $10; 


nN tn n bn 

2  1.20-10! 6 4.41-108 

3. 1.92-10? 7 ~=1.33-108 

4 6.48-103 8 4.25-10° 

5 1.79-105 9 1.22-10% 
10 3.48-101 


It cannot be demonstrated here, but if tz>>8*, you just cannot solve the 
system H,x=6 with s-digit arithmetic in base £. 

The conclusion of this example is that one should not follow a statistics book 
blindly here. It is much better to arrange things so that matrices of Hilbert 
type do not arise, even approximately. And when they do, one must be sure to 
use enough precision that tn<*. There are other ways of attacking least-squares 
problems which are less sensitive to the data. 


10. Instability in solving ordinary differential equations. The standard 
initial-value problem for a single ordinary differential equation dy/dx =f(x, ¥) 
is to determine y(x) as accurately as possible for x>0, given (0). In one very 
common class of methods (the multistep methods) of solving this problem ap- 
proximately, one picks a fixed interval h>0, and determines y, to approximate 
y(nh) for n=1, 2,---. One highly recommended multistep method in desk- 
computing days was the Milne-Simpson method. Here one let yo=y(0), the given 
initial value, and determined y,; by some method not mentioned here. Let 
yn =f(nh, yn). The basic idea was to determine yas; from Yay and Yn 


(n=1, 2,---) by the integral 
(n+1)h 


(15) Ynz. = Yn—1 + f(x, y(«))de. 


(n—1)h 


Since the integral in (15) cannot usually be evaluated exactly, Milne’s actual 
idea was to approximate it by Simpson’s formula, and so let 


h 
(16) Yn+1 = Yn—1 + 37 (Vn —1 + Ayn + Yn+1)- 
At the time we seek to find ya41 from (16) we know ya_1 and yn, and hence yh_, 


and y,/ ; but yi, is not known. For general f, Milne [7] determined the solution 
of (16) by an iterative process that is irrelevant to the present discussion. Let 
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us merely assume that yn4; has been found so that (16) holds, where yi, 
=f((n+1)h, yni1), and that this has been done for n=1, 2,-+-+, as far as we 
wish to go. This method was highly recommended by Milne for solution of ordi- 
nary differential equations at a desk calculator, and it seemed to work very well 
indeed. Most problems were probably solved within 30 steps or less. 

As soon as automatic digital computers arrived on the scene, users of the 
Milne-Simpson method started to find extraordinary behavior in certain prob- 
lems. To illustrate what happened, let us take the very simple test problem 


dy/dx = f(x,y) = — y, with y(0) = 1. 


The true solution, of course, is y= e7*. 

Take 4=0.1, and carry out the Milne-Simpson process with yo=1 and 
1 = 0.90483742, an 8-decimal correctly rounded value of e—°-!. This is not some- 
thing you can do in your head, and so I will give you the results, as computed 
on a system with B=10, s=8: 


x Y computed e* 

2 . 81873069 .81873075 

3 . 74081817 . 74081822 
8.0 .00033519912 .00033546263 
8.1 .00030380960 .00030353914 
13.2 .00000036689301 .0000018506012 
13.3 .000003 2084360 .0000016744932 
13.4 — .000000070769248 .0000015151441 


We see that by x =8.0 a noticeable oscillation has set in, whereby successive 
values of y, alternate in being too low and too high. By x = 13.4 this oscillation 
has grown so violent that it has (for the first time) actually thrown the sign of 
Ya negative, which is unforgivable in anything simulating a real exponential 
function ! 

The Milne-Simpson method is very accurate, in that the Simpson formula 
is an accurate approximation to the above integral. What can be the matter? 

Since f(x, y) = —y, we can explicitly write down the formula (16) in the form 


h 
Vnt1 = Yn~-1 — 3 (Vn—1 + AVn + Vatu): 
Thus the computed {y,} satisfy the 3-term recurrence relation 
(17) (1+) +2 (1 -) 0 
3 Vn+1 3 Vn 3 Vn—1 — Vv. 
We know that the general solution of (17) takes the form 


(18) yn = Aqdy + Aada, 
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where Ay, Ae are the roots of 


(19) (1 + =) (1 -) 0 
3 3 3) 
Some algebra and elementary analysis show that 
1 =1—-4+4+ O(/), ash —>0, 


h 
Y= — (1 +=) + O(k), ash—-0. 


Putting these values of \y, A, into (18), and using the relation nh=x, we find 
that, for small A, 


h n 
Vn = A,(1 _ h) +- (— 1)nAa(1 +=) 


hh \ (3/h)- @/8) 
= Ax(1 — fyalh + (— nAa(1 -- =) 


= Aye * + (— 1)" Ace?/8, 


The first term is the desired solution if 4;=1, and the second is an unwelcome 
extra solution of the difference equation (17) of the Milne-Simpson method. Now 
the initial conditions might have been chosen exactly so that 4;=1 and A2=0. 
(They were roughly of this nature.) Had they been so chosen, and if the solution 
could have proceeded without round-off error, the unwanted term in Az would 
never have appeared. But, in fact, a small amount of this solution was admitted 
by the initial condition, and some more of it crept in as the result of round-off. 
Then, after enough steps, the size of e7/? caused the unwanted term to dominate 
the solution, with its oscillating sign. 

This disaster never occurred in desk computation, so far as we know, be- 
cause at a desk one just doesn’t carry out enough steps. Professor Milne has 
just told me, however, that he did occasionally observe harmless oscillations in 
the low-order digits. 

The moral of this example is that not only are math books not enough, but 
even old numerical analysis books are not enough to keep you out of some pit- 


falls! 


11. Instability in solving a partial differential equation. The foflowing is 
a simple problem for the heat equation. Suppose a homogeneous insulated rod 
of length 1 is kept at temperature 0 at one end, and at temperature 1 at the other 
end. If the entire rod is initially at temperature 0, how does it warm up? 

Let u=u/(x, t) denote the temperature at time ¢ at that part of the rod that 
is x units from the cold end. Then, if the units are chosen to make the conduc- 
tivity 1, the temperature u satisfies the differential equation 
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(20) —=— (0<#<1;t>0), 


with end and initial conditions 
u(0,t) =0 (t>0), 

(21) u(1,t)=1 (¢> 0), 
u(x,0) =0 (0<«< 1). 


This problem can perhaps best be solved by separation of variables and 
Fourier series. But let us apply the method of finite differences, which might in 
any case be needed for a more difficult problem. To do this, we divide the length 
of the rod into equal intervals, each of length 4. And we divide the time interval 
[0, ©) into equal intervals of length &. Instead of trying to determine u(x, #) 
for all x and #, we shall limit ourselves to computing u(x, #) on the discrete net 
of points of type (mh, nk), for integers m, n. The heat equation (20) can then 
be simulated by a number of finite-difference equations, of which we pick one: 


u(x — h, t) — 2u(a, t) + ula + A, t) _ u(x,t + k) — u(x, t) . 


(22) 7 5 


Equation (22) can be used to determine u(x, t) for all net points in the in- 
finite strip of the problem, as follows: Solve (22) for u(x, t+) in terms of 
u(x—h, t), u(x, t), u(x+h, t). Thus compute u(x, k) forx=h, 2h, ---, (n—A)h 
in terms of the given initial conditions on the line t=0. The given end conditions 
give u(0, k) and u(1, k). With this set of values of u at all points of the net with 
t=k, we can continue and compute all values on the net for = 2k, etc. The com- 
putation is very attractive, because each new value of u(x, t+) is determined 
explicitly from (22)—there is no need to solve a system of simultaneous equa- 
tions. 

How does the solution behave? To try a case, we pick h=0.1 and k=0.01. 
Thus the rod is represented by 9 interior points and two endpoints, and we get 
a solution at time steps 0.01 apart. Just to show the behavior of the solution of 
(22), we give the value of the temperature u(0.5, t) at the midpoint of the rod, 
computed with 6=10, s=8, for selected times: 


~ 


u(0.5, ¢) computed from k=0.01 
0 


0 

0.05 1 
0.06 —4 
0.07 16 
0.15 132276 
0.20 — 28157050 


99 +1.0196022- 1044 
.00 —2.9590007 - 104, 


1970] PITFALLS IN COMPUTATION 951 


The values in the table are ridiculous, of course. It is a classical example of 
instability. Common sense and mathematics both tell us that the real tempera- 
ture can never get outside the range of OSu(x, ¢) $1. Our difference-equation 
problem is a disastrous model of the continuous problem, even though both dif- 
ference expressions in (22) are reasonable models of the derivatives in (20). 

This terrible pitfall has been known for at least 20 years, and yet new prob- 
lem solvers keep on rediscovering it. 

It is interesting to note that if one selects a time step only half as long, the 
computation proceeds very nicely. Here is the corresponding table of values of 
u(0.5, t) for a computation (@=10, s=8) with h=0.1, k=0.005: 


t u(0.5, £) computed for k =0.005 
0 0 

0.05 . 10937500 

0.06 . 14599609 

0.07 .17956543 

0.15 .35637261 

0.20 .41304382 

1.00 .49997173. 


The values of the midpoint temperature appear to be converging to 0.5, as 
they obviously should in the physical problem. 

What is the reason for the great difference in behavior between k=0.005 
and k=0.01? The matter can be analyzed in many ways, and here is one simple 
approach. Let \=k/h?. Then, from (22), 


(23) u(x,t + k) = Au(x — hh, t) + (1 — 2dA)u(sx, t) + rAu(&% + A, 2). 


Hence, if 0<AS1/2, the formula (23) represents u(x, t+) as a weighted aver- 
age with nonnegative weights of u(x—h, f), u(x, t), and u(x+h, t). Hence 
u(x, t+) will always be between the maximum and minimum values of u(x, #). 
But if \>1/2, the weights alternate in sign and thus permit a solution in which 


| w(x, t-+k)| =A] w(e — hb, | + (2. —1)[ u(x, | +A] ue + hd]. 


Here the sum of the weights is 44—-1>1. This permits an exponential growth 
of a solution with an alternating sign pattern. 

Thus the condition 0<A=k/h?S1/2 is essential to keep the solution 
bounded. A deeper discussion found, for example, in Forsythe and Wasow [4] 
proves that the solution of (22) converges to the solution of (20) uniformly for 
all (x, t) with OSxS1,0<fST< , as h—-0, k-0 in sucha way that k/h? S1/2. 

The proof of convergence and an analysis of the stability of (22) can be car- 
ried out by means of Fourier analysis. The stability can be examined in more 
detail by studying the eigenvalues and eigenvectors of the linear transformation 
(23) that maps each line of solutions onto the next line. 
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Note that in our two tables we had A=1 and A\=1/2, respectively. 


12. Round-off errors in polynomial deflation. Our final example, due to 
Wilkinson [14], shows a more subtle effect of round-off error that arises in the 
course of finding polynomial zeros. The quartic polynomial 


Pi(x) = x* — 6.7980«3 + 2.9948x%? — 0.043686x% + 0.000089248 
has zeros that, correctly rounded, are as follows: 


0.0024532, 0.012576, 0.45732, 6.32565. 


I. Suppose first that we somehow compute the zero 0.0024532, and then 
deflate P, to a cubic by dividing Pa(x) by «—0.0024532, using B=10, s=5. If 
we do, the resulting cubic has zeros 


0.012576, 0.457315, 6.32561, 


so that the main error introduced by this deflation is a change of the largest 
zero by 4 units in its last place. 

II. Suppose, on the other hand, that we first compute the zero 6.3256, and 
then deflate P, to a cubic by dividing P.(x) by x—6.3256, again using 5-place 
decimal arithmetic. If so, the resulting cubic has the zeros 


0.0026261 + 0.0643397, 0.467148. 


We have perturbed two of the remaining zeros beyond recognition, and have 
changed the second significant digit of the third! 

Thus it appears to matter a great deal which zero of P., we locate first. For 
the present case we can get a feeling for what is happening by examining the 
process of division of P.(x) by the linear factors. We use detached coefficients: 

First, the division by x —0.0024532: 


1—6.7980 + 2.9948 — 0.043686 + 0.000089248 
— 0.0024532 + 0.166707206 — 0.00730587492 -++ 0.000089247416 


1 — 6.7955 + 2.9781 — 0.036380. 
Thus the cubic that results from the first deflation is 
P3(x) = «3 — 6.79552? + 2.9781% — 0.036380. 


Moreover, a careful examination of the division shows that P3(x) is exactly 
(i.e., without round-off) equal to the quotient of 


P,(«) = «4 — 6.7979532x3 + 2.9947707206x? — 0.04368587492x + 0.000089247416 


by x—0.0024532. Hence the zeros of P; are exactly the zeros of P, except for 
0.0024532. Note that all the coefficients of Ps and Ps, are quite close, so it is 
reasonable to expect that the zeros of P, and P, should be close (as they are). 
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Now we show the deflation by x —6.3256: 


1 — 6.7980 + 2.9948 — 0.043686 + 0.000089248 
— 6.3256 + 2.98821344 — 0.04174896 + 0.0122526872 


1 — 0.4724 + 0.0066 — 0.001397. 


Thus the result of this deflation is a cubic P3(x) =x3—0.4724«?+ 0.0066x 
— 0.001397. Again, P3(x) is exactly the quotient of 


Pi(x) = «* — 6.7980x3 -+ 2.99481344%? — 0.04368596x -+ 0.0122526872 


by x—6.3256. Note that P, and P, differ very much in their constant terms. 
Hence the product of the roots of P, must be very different from that for Pu. 
This is an explanation for the great shift of the zeros of Ps. 

Further analysis shows that the shift in zeros during this kind of deflation 
is generally small when deflation is made with zeros of small modulus, and is 
generally large when deflation is based on zeros of large modulus. Thus it is 
better to get zeros of small modulus first in using a polynomial solver with defla- 
tion in the above manner. 

Of course, any zero of a deflated polynomial can be refined by use of the origi- 
nal polynomial, and that is normally done. But, zeros that change as much as 
those above are difficult to refine, since the refinement process may converge to 
the wrong zero. 


13. Conclusions. Around ten years ago, when I last read a number of them, 
most mathematics books that dealt with numerical methods at all were from 
ten to fifty years out of date. In the past ten years, many excellent new methods 
have been devised for most of the elementary problems—methods that are well 
adapted to automatic computers, and work well. Let me cite a few examples of 
important algorithms hardly known ten years ago: 

1. For getting eigenvalues of stored square matrices, there is an excellent 
method that starts with the transformation of Householder (1958), and follows 
it with the QR-algorithm of Francis (1961-62) and Kublanovskaja (1961). It is 
the method of choice for most problems. For references, see Wilkinson [15]. 

2. For solving ordinary differential equations, special methods have been 
developed by Gear [5], Osborne [11], and many others which can deal with so- 
called stiff equations. (Roughly speaking, a stiff equation is one whose solutions 
contain very rapidly decaying transients which contribute nothing to the long- 
term solution, but which interfere drastically with most numerical methods of 
solving the equation.) 

3. For evaluating the definite integral of a smooth function of one real vari- 
able, the method of Romberg (see vol. 2 of Ralston and Wilf [12]) has proved 
to be very useful. 

4, For minimizing a smooth real-valued function of 2 real variables, a vari- 
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ant by Fletcher and Powell [1] of a method of Davidon is far superior to any- 
thing used in the 1950’s. And there are still more recent methods. 

Many other examples could be given. Indeed, the 1960’s have proved almost 
explosive in the number of newly invented algorithms that have supplanted 
those known earlier. Of the methods known years ago for common numerical 
problems, only Gauss’ systematic elimination method for solving linear alge- 
braic equation systems with dense, stored matrices remains supreme today, and 
even it must be augmented with scaling and pivoting decisions, as we noted in 
Section 6 above. Newton’s method for solving a nonlinear system of equations 
is still much used today, though it has strong competition from newer methods. 

Because of my knowledge of mathematics texts ten years ago, and my knowl- 
edge of the explosive increase in numerical methods in the 1960’s, I am confident 
that today’s mathematics books cannot be trusted to include important knowl- 
edge about computer methods. As we noted in Section 10 above, you can’t trust 
early numerical analysis textbooks either. 

On the other hand, there are experts in numerical analysis. They have socie- 
ties in which methods are presented and discussed. The Society for Industrial 
and Applied Mathematics (SIAM) and the Special Interest Group on Numerical 
Mathematics (SIGNUM) of the Association for Computing Machinery (ACM) 
are the most active in this country. There are a number of journals with impor- 
tant information. For a start, you might consult the keyword-in-context index 
of Computing Reviews, the review journal published by ACM, as well as the 
algorithms in the Communications of the ACM and in Numertsche Mathematik. 
Modern monographs and textbooks in numerical analysis are slowly appearing, 
and the beginner might profitably consult Ralston and Wilf [12], especially 
volume 2. 

It might be noted as a digression that, just as mathematics departments 
mainly ignore modern numerical analysis, so also the newly created computer 
science departments often give the subject little attention, since they are so 
busy with a variety of important nonnumerical fields. Thus numerical analysts 
remain a small corps of specialists whose greatest appreciation probably comes 
from the users of mathematical programs. 

Students of mathematics are well equipped to read about numerical methods. 
Why should they repeat the classical blunders of generations past? Why aren’t 
they informed of the existence of good numerical methods, and roughly where 
to find them? 

Remembering that most students take mathematics in order to apply it on 
computers, I ask why mathematics courses shouldn't reflect a true awareness of 
how computing is done? Why shouldn’t students demand in their mathematics 
courses a greater awareness of the points of contact of pure mathematics and 
its practice on a computer? 

Of course, a mathematics instructor can shrug his shoulders and say that 
actual computing problems don’t interest him, and suggest that his students 
contact a numerical analyst sometimes. If the instructor actually says this out 
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loud, it at least has the virtue that the students may realize immediately that 
the mathematics is not applicable directly instead of having to discover it for 
themselves. It still sounds irresponsible to me. After all, society has been sup- 
porting mathematicians pretty well for the past 25 years—not because mathe- 
matics is a beautiful art form, which it is—but because mathematics is useful, 
which it also is. But this would seem to imply that a mathematician should 
convey some awareness of the main ways in which his subject is used. 

On the other hand, a mathematics course cannot really include very much 
numerical analysis. Wilkinson’s treatise [15] on computing eigenvalues is 700 
pages long, and can hardly be summarized in every course on linear algebra! As 
a practical matter, then, the mathematics instructor’s main responsibility is to 
be aware of the main features of practical computing in the areas of his mathe- 
matics courses, and mention occasional points of contact, while giving his stu- 
dents pertinent references to important algorithmic materials in other books. 

If one just ignores the relations between mathematics and its important 
applications, I fear that an instructor is running the risk of being exposed by 
some technological chapter of the Students for Democratic Society for not being 
relevant, and that is a very nasty accusation nowadays. Why risk it? 
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(NR 044 211), the National Science Foundation (GJ 798), and the Atomic Energy Commission 
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purpose of the United States Government. 
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ON LINEAR INEQUALITIES IN COMPLEX SPACE 
R. N. KAUL, University of Delhi 


1. Introduction. In [1] Eisenberg has given characterization of the set of all 
supports of a convex function. This extends the famous Farkas theorem [2, 3] 
and is useful in mathematical programming. The generalization of Farkas’ 
theorem for a complex space has been obtained by Levinson [4]. Its extension, 
which is the object of the present note, is contained in the theorem that follows. 

Let A be a complex m Xn matrix. The conjugate transpose of A will be de- 
noted by A*. If u and v are two complex vectors with the same number of 
components, then 


uv=v¥u = >) usd; 
‘ 


Let a be the columns of the matrix A and an »X1 complex vector. Then we 
write 


Aé = ») aDé, = f. 
j=1 


If @ is a real vector such that 0Sa;S7/2 for j7=1, 2,---+,n, then by 
larg t| <a, resp., | arg t| < (m/2) — a, 


we mean that 


IIA 


| arg t; | Sa; resp., | arg t; | (a /2) — aj. 
We now state the theorem: 


THEOREM. Let A be an mXn matrix with complex entries, Can n Xn hermitian 
positive semt-definite matrix, and b a complex nX1 vector. Then 


(z* C z)1/2 + Re(b* z) = 0 
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for all nX1 complex vectors z satisfying 


(1) larg Az| Sa 

uf and only tf there exists an mX1 vector — and an nX1 vector n such that 
(2) A*E=b+Cn 

(3) | arg | < (w/2) — a, 

and n satisfies (1) and 

(4) n* Cn S 1. 


The above theorem is used as the principal tool for establishing the duality 
in nonlinear programming in the complex space [6]. For proof of the theorem we 
need the following lemmas. 


LEMMA 1. If z, w are any complex vectors of the same dimensions and C 1s a 
hermitian posttwe semt-definite matrix of the appropriate order, then 


Re(z*Cw) S (z2*Cz)!/2(w*Cw)!/2. 
Proof. Apply the Cauchy-Schwarz inequality to the semidefinite inner prod- 


uct [z, w]=z*Cw: 
Re[z,w] S | [z, w] | < [z, z]'/2[w, w]/2. 
LEMMA 2. The set of all vectors w=A*E—Cn, where 
|argé| S (4/2) — a, larg An| Sa, n*CnS 1, 
1s closed. 


Proof. Suppose that u is in the closure of the set. There exist sequences { E;, } , 
{wi}, and {n,} such that 


(5) larg &| S(#/2) a, 
(6) | arg An, | < a, 

(7) niCn, <1 

(8) A*&, — Cru, = wi 


and {wz} converges to u. 
Let us assume temporarily that the sequence { mz} has a limit point n. It 
follows from (6) that 


(9) [arg An| Sa 
and the inequality (7) yields 

(10) n*Cn X 1. 
Now for each complex vector z satisfying 


(11) |arg Az| Sa 


958 R. N. KAUL [November 


it follows from (5) that 

(12) Re(ftAz) = 0. 

As w,—u and n,—n, therefore (8) and (12) yield 
(13) Re{ (u* + n*C)z} = 0. 


Using Levinson’s generalization of Farkas’ theorem for a complex space [4], we 
see from (12) and (13) that there exists a — such that 


A*E=u+Cn, | arg &| < (r/2) — a, 


which shows that u belongs in the set w. 

Now we shall show that the sequence {n,} can always be chosen so that it 
has a limit point n. Let { 6x} be a sequence with elements 6; in the complex n- 
space and H= {R | R is a nonnegative integer and there exists a ky such that for 
all k= ko, 6, has at most R components with real part non-zero}. Then H has a 
least element—call this r. We now define the (pseudo) norm of {5} to be the 
least element of H, say 

| {a} | =r. 


Since 
(14) | {Am} | + | {mj | Smt+n 


it is always possible to choose {nx} and {&,} satisfying (5), (6) and (7) such 
that (14) is minimal. We shall show that the sequence {nz} indeed has a limit 
point in that case. 

Assume that the sequence {nx} has no limit point, so that 


|. | = (mpmy)!!? > 00. 


We may assume that |n,| >0 for all &. Let z,=nz|0,|—-1. Then {z:} is bounded; 
hence there is a complex vector z such that some subsequence of {zx} converges 
to z, where | 2 =, | arg Azz | <a, and zp Cz S1/| nx | 2 for all &. Thus it follows 
that 


(15) | arg Az| <a 
and 
(16) z*Cz=0. 


As a consequence of (16), Lemma 1 yields 
(17) Re(w* Cz) = 0 


for each complex vector w. Also the complex vector z is the limit of z,. If z has 
a component whose real part is non-zero, then all but finitely many n, must have 
the same component with real part non-zero. Hence if \; is any sequence of real 
numbers, then 
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| ime — ez} ] S| tm]. 


Assume first that the inequality (15) is true in the form Re(Az) #0. Let 
A;,j=1, 2, - +--+, m, denote the jth row of A, and let 
Re(A jn) 


Ny = min Roca 
Re(A,;z) 


j=1,2,+++,m, and Re(4yz) > oh 


Clearly 
Re{ A(m — dxz)} = 0. 
On the other hand the relation (17) implies that 
(mz — ez)*C(m_ — Anz) = neCn, S 1. 


This demonstrates the existence of the sequence {z—Dz } satisfying (6) and 
(7). Again the manner in which ), is selected shows that for sufficiently large k, 
eventually each A(n,—),z) has at least one more component whose real part is 
zero than An,, thus contradicting the minimality assumption in (14). Hence 
Re(Az) =0. 

Since z30, we can define \; so that for sufficiently large k, n~—Dgz has at 
least one more component with real part zero than n,. Therefore 


| fm — rezh] < | {mf ], and | {AG — rz} | = | {Am} |. 
Hence 
| {me — rez} | + | {ACme — rez)F | < | fmf] + | LAms |, 


again contradicting the minimality of (14). This completes the proof. 
Proof of the Theorem. To prove sufficiency, assume that there exists vectors 
n and & satisfying (1) and (3) respectively such that 


n*Cn S 1, A*—§=b+Cn. 
From (2) follows the relation 
(18) z* A*E = z*b + 2*Cu. 


But | arg { (4z)3€;}| S|arg(Az),| +|arg é,| <7/2 using (1) and (3). Thus 
Re((Az)*) 20. Hence (18) yields 


(19) Re(z*b + z*Cn) = 0. 
This last inequality implies by (4) and Lemma 1 that 
Re(b*z) + (z*Cz)1/? = 0. 


To prove the necessity, we assume that for each vector z such that | arg Az| <a 
it is true that 


Re(b*z) + (z*Cz)1/? = 0. 


960 R. N. KAUL [November 


We now suppose that 
(20) A*E—Cn=wx+b 


for any — and n satisfying (1), (3), and (4). In particular we can take n=O and 
£=n=0 obtaining respectively 


(21) A*’=wxb 
and 
(22) 0=w + b. 


Our supposition implies [5] that there exists np and a scalar k such that 
(23) Re(w* No) 2=k> Re(b* No) 


for all w defined by (20). In particular the inequalities in (23) hold for all w 
defined by (21) and (22). From (21) and (23) it follows that Re(—*A no) 22 for 
all — satisfying (3), which establishes that np must obey the condition (1). Hence 
the given hypothesis together with the inequality (23) shows that 


(24) k > Re(b* a) = — (no C m)!/2. 
But 
(25) ks 0, 


which follows from the inequality (23) on substitution of w=0 from (22). Thus 
the relation (nj Cno)/2>0 is obtained from (24) and (25). Finally we remark 
that we can set €=O and n=(noCno)—/2m9 to obtain w= —(mp Cuo)—"/2Cno. 
With this value of w, we obtain from (23) and (24) 


(19 C tao) 1/2 < (ng C m9) 1/2, 


which is a contradiction. This completes the proof of the theorem. 
-. The author is deeply indebted to Professor Bertrand Mond for several 
valuable suggestions. 
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MORE GENERALIZATIONS OF A THEOREM OF PAPPUS 
L. E. PURSELL, University of Missouri-Rolla 


Let Cp, a plane curve of class C™, move through Euclidean 3-space in such 
a way that it is always perpendicular to the path @sg of its centroid (of arc 
length), Cs being a curve of class C® such that its curvature is positive every- 
where. Let Cp X Cg denote the surface generated. In their paper, Generalizations 
of the theorems of Pappus, this MONTHLY, 76 (1969), 355-366, (hereafter referred 
to as [G]), A. W. and Gary Goodman (hereafter referred to collectively as 
“Goodman”) assume that @p is fixed to the moving (T, N, B) frame and show 
that if Cs is a plane curve, then the Pappus formula 


[ep X esl] = | Cr - | @s| 


holds (here | -| denotes arc length and ||-|| denotes area). For the special case 
—€y is a circle—Goodman’s result reduces to the Theorem of Pappus referred 
to in our title. But for the special case—€> is a line segment and @,g is a circle— 
Goodman found that if Cp makes a uniform spin around its own midpoint of 
total amount @ as it moves around @sz, then 


[ep X Cll > | Cp| -| Cg|. 


They also show for the special case—€>p is a line segment which always has the 
direction of the unit normal N of Cs—that if Cg is a twisted curve (i.e. its torsion 
is not identically zero), then again ||CpX@s|| >| ep| -| es]. 

In this paper we explore further the case, Cys is a twisted curve, and the effect 
of Cp spinning around @g. Using a general formula derived by Goodman, we 
show: 


THEOREM 1. Jf the curvature of Cg is not too great (the appropriate upper bound 
is given below) and tf Cp ts not a circle and 1s allowed to spin about Cg in such a 
way that | CpX Co|| 1s minimum, then (regardless of whether Cg ts plane or twisted) 
there ts a unique (modulo a constant) spin function 0 which gives this minimum area 
and this minimum area 1s given by the Pappus rule 


(1) |p X @s|| = | Cp] -| Cs]. 


If Cp is a circle, then, of course, spinning @p has no effect on the surface 
generated and this surface would be a tube surface whose area would be given 
by the Pappus rule as is well known (cf. [C, p. 275, ex. 7]). 

To prove Theorem 1, consider Goodman’s equation [G, eq. 49]: 


(2) ||e@p X Ce|| = ff [(r + 0’)2Cff’ + ge’)? + (1 + xf cos @ — xg sin 6)?|'/2dids, 
OSsS|es|, OStS|@|. 
Here as in [G], s and ¢ denote arc length along @s and @p, respectively, and 


7 =7(s) and x=«(s) are the torsion and curvature of @g. The curve @p is given 
by X =f(#), Y=g(t), where X and Y are rectangular coordinates in the plane of 
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Cp with the centroid of Cp at (X, Y) =(0, 0). The spin function 6 =6(s) is given 
by the equations 
(1, 0)xy = — cos ON — sin OB, 


(3) , 
(0, 1)xy = sin ON — cos 6B, 


where the unit vectors (1, 0)xy and (0, 1)xy are given in X, Y-coordinates and 
N and Bare the unit normal and binormal vectors of Cs. The angle 6 is the angle 
from —B to (0, 1)xy, (cf. [G, fig. 1 and eq. (33) ]). 

Since the integrand on the right of (2) involves a sum of squares, one obtains 
from standard integral inequalities: 


(4) lee x edl = f f | 1 -+ «f cos 6 — xg sin 6| dt ds, 


OSs [@s|, OSt5 |@|. 


If Cp were a circle, then (ff’ +gg’)(#) =0 for all t and the equality in (4) would 
hold. We are assuming €p is not a circle, i.e. (ff’ +gg’)(¢) 40. Since ff’ +gg’ is 
independent of s and r+’ is independent of #, the equality in (4) holds, i.e. 
| Cp X Cs is minimum, if and only if (r+6’)(s) =0. 

We also assume 


(S) k(s) S [g(Z) sin 6(s) — f(d) cos 6(s) }- 


for all (s, #), OSs | Cs ,OStS | Cp| . (This is the upper bound on x referred to 
above.) To interpret (5) geometrically, observe that (5) is equivalent to: 


(radius of curvature) = [k(s)]-! = g(t) sin 6(s) — f(t) cos 0(s). 


(Recall that we assume xk(s) >0 for all s.) Now if we denote the position vector 
from the centroid of Cp to a point (X, Y) =(f(@), g(t)) on Cp by P(t) (here we 
are not following the notation of [G]; P(é) corresponds to their vector R,(t)), 
then from (3) we have 


P(t)-N(s) = g(é) sin 6(s) — f(é) cos @(s). 
Therefore (5) is equivalent to: 
(5’) [x(s)]-* = P@-N(s). 


Hence assumption (5) means that for each value of s the curve Cp does not cross 

the polar axis (the line through the center of curvature which is parallel to the 

binormal B of @s), i.e., no part of Cp ever backs up as the centroid of Cp moves 

forward along @y. (More precisely, Cp must be contained in a closed, half-plane 

bounded by the polar axis.) This assumption is the counterpart of the require- 

ment in the classical Pappus Theorem that Cp does not cross the axis of rotation. 
Assuming (5) and (7-+6’)(s) =0 for all s, equation (2) becomes 


(6) |lep X @g|| = ff aast [cosa f sards— fresine f gatas, 
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but ff dt=f gdt=0 since the centroid of arc length of @p is at the origin of the 
X, Y coordinate system. Hence (6) reduces to the Pappus formula 


(1) lee x esl] = f f ards = | er] -[ es], 


establishing Theorem 1. 
To interpret geometrically the minimum condition, d0/ds-+r =0, consider a 
unit vector u fixed to @p and lying in the plane of Cp. Then uw is of the form 


(8) u = cos(@ + a):N + sin(@ + a)-B, 


where @ is constant. Differentiating with respect to s and applying the Frenet 
formulas [G, eqs. (8), (9), (10) | one obtains 


(9) du/ds = (— sin(@ + a)-N + cos(@ + a)-B) (r + 6’) — x cos(6 + a)-T. 


Hence, the projection r(du/ds) of du/ds onto the plane of Cp vanishes if and 
only if d@/ds-++r vanishes. Since the torsion 7 is sometimes interpreted as the 
rate of spin of the (T, N, B) frame as it moves along Cs, we interpret the condi- 
tion r(du/ds) =0 to mean: Cp moves without spin as it moves along Cs. By Cp 
moving in “a natural manner,” Goodman means that @> is fixed to the (T, N, B) 
frame. Hence, their “movement in a natural manner” is the same as our “move- 
ment without spin” if and only if 7 =0 for all s. Therefore: 


THEOREM 2. Let a continuous smooth curve Cp move through Euclidean 3-space 
in sucha way that itis always perpendicular to the path Cg of tts centroid, Cg being 
a curve of class C® and of positive curvature everywhere. If for each point of Cg the 
curve Cp ts contained in the closed half-plane bounded by the corresponding axts of 
curvature of Cg, then the area of the surface generated will satisfy | CpX Cs|| 
>| @p| -| Cs]. If Cp ts not a circle, then the Pappus rule ||@p X@s|| =| Cp| -| Cs 
holds tf and only tf Cp does not spin as 1t moves along Cz. 


If Cg is not a plane curve, then it is generally difficult to visualize the nature 
of the motion of @p along @g determined by the differential condition (d@/ds) 
+7 =0. However, if Cys lies on a sphere Z, then one can give a simple description 
of this motion in terms of elementary geometric concepts. 

Let R(s) denote the point on @g given by s, r(s) denote the vector from the 
point R(s) to the center of the sphere 2, and a denote the radius of 2. Then 
r-r=a? and r'=—R’. Hence r-T=r-R’=-—r-r'=0. Therefore, r lies in the 
plane of N and B; and for each value of s there is a corresponding angle a(s) 
such that 


(10) r=acos(@+ a)N+ asin(@ + @)B, 
(cf. eq. (9) above). Differentiating both sides of (10) with respect to s, applying 
the Frenet formulas, and collecting like terms, one obtains 

0 = (1 + ax cos(@+ a))T+ af + 7+ 0’) sinf@+a)N 


(11) 
— a(0’ + 7+ a’) cos(@ + a)B. 
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Hence 
(12) 0O=-@+r4+ a’. 


Therefore 6’+7=0 for all s if and only if @ is constant, i.e., Cp is fixed to r. 
Hence 


THEOREM 3. Under the hypothesis of Theorem 2, 1f Cg les on a sphere, then 
|C2Xesll =| Cr] -|Cs| if and only if Cp is fixed to the radius vector from the 
centroid of Cp to the center of the sphere. 


For a simple application of this theorem consider the case: Cp is a moving 
line segment of length 2a having one end point fixed at a point in 3-space and 
Cs is the path of the midpoint of Cp. Then CpX @g is the portion of a conical 
surface enclosed by a sphere 2 whose center is at the vertex of the conical sur- 
face. The line containing Cp is the generator and @sz is a directrix of the conical 
surface. The “base” of the cone (i.e., the intersection of the conical surface with 
the sphere Z) is of length 2- | Cs| . Hence, we obtain a formula, 


[ep X @s|| = 4| Cp| -2| Cs] = 4(slant height): | base| , 


which is an extension of the familiar formula for the area of a right circular cone. 
Of course one can easily derive this formula in other ways without having The- 
orem 3. Thinking about these “spherical cones” first lead to our discovery of 
Theorem 3. 

For a non-spherical, twisted curve one might conjecture that the directed 
line segment I'(s) from a the point R(s) on the curve @yg to the center of the 
osculating sphere 2(s) would satisfy the equation of motion @’-++7 =0, but this 
is not true. If we fix the X Y-coordinate frame to I'(s), then, as we shall show, 
the hypothesis 6’(s) +7(s) =0 and r(s) 0 for all s implies Cg is a spherical curve. 

A standard formula for the osculating sphere gives: 


(13) T= pN + (p'/r)B, 


where p denotes the radius of curvature, x—!. If we denote the polar coordinates 
of the center of 2(s) in the X Y-plane by (a, #), then from (3) and (13) we obtain 


(14) pN + (p’/r)B = — acos(o + 6)N — asin(d + OB. 
Hence 

(15) p = — acos(¢ + 6) 

and 

(16) (p’/7) = — asin(¢ + 6). 


Since we are assuming I(s) is fixed to the X Y-coordinate frame, @ is constant. 
Differentiating (15) by s and substituting from (16) gives 


(17) p' = — a’ cos(p + 6) — (p'/7) 6’. 
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Applying our hypothesis, @’-++7r =0, we obtain 
(18) = — a cos(d + 6) 


for all s. Since our hypotheses imply @ is not constant, (18) implies that a, the 
radius of the osculating sphere, is constant. Hence by a well-known result of 
differential geometry (cf. [L, exercise 1.7.8]), @g is either a spherical curve or 
its radius of curvature p is constant. If p is constant, then (13) becomes T=pN, 
which means that VY and, hence, the X Y-coordinate frame is fixed to the 
(T, N, B). But this means 6’ and, hence, 7 is identically zero. Therefore the only 
nonplane curves that satisfy (14) are spherical curves. 

Goodman’s results for the hypothesis— @g is a plane curve—and our The- 
orem 3 can be restated: 


THEOREM 4. Under the hypothesis of Theorem 2, 1f Cg lies on a surface & and 
2 is either a plane or a sphere, then ||CpXCs|| =| Cp|-|Cs| if and only if Cp is 
fixed to the normal of 2 at the centroid of Cp. 


Are there other surfaces 2 which satisfy Theorem 4? We leave this question 
to the reader as a topic for further research. 
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A FURTHER COMMENT ON PAPPUS 
HARLEY FLANDERS, Purdue University 


The results of Goodman and Goodman [1] and Pursell [2] can be derived 
in a systematic and natural way by using moving frames and differential forms 
[3, 4]. We begin with the structure known as a strip or ribbon. 

Let x =x(s) define a space curve C, where OSs | C | , and let e, =e1(s) be its 
unit tangent. Suppose a unit normal vector e2:=ee(s) (not necessarily the unit 
normal) is attached to each point of the curve. Let e3(s) =e:Xe2 so that 
{ ex, é2, es} is a moving orthonormal frame. The corresponding structure equa- 
tions are 
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ax = @] ds 
de, = (ae, — bes) ds 
de, = (— ae; + ces) ds 


dez; = (be; — ces) ds, 


where a, 0, and ¢ are functions of s. If the curve is closed and e, and e; return 
to their initial positions, then a, b, and c are periodic of period | C |. 

Whatever we are going to move, plane curve or plane domain, will be rigidly 
attached to the e2 and e; axes in the normal plane to the curve. Note that so far 
this approach avoids conditions on curvature; x(s) may perfectly well be straight 
for example. 

Now let D be a domain in the u, v-plane of area | DI. Define the generic 
point of a solid S in space by 


y(s, u, 2) = x(s) + uw e2(s) + 0 e9(5), 


where OSs <|C| and (u, v) ED. 
We seek the volume V of S. Of course what we really have here is a mapping 


y:CX D- E%. 


If the mapping is not injective, “volume” means “algebraic volume.” (See con- 
ditions below.) Compute: 


dy = (1 — au + bv) ds ey + (— cu ds + du) e2 + (cu ds + dv) e3. 
Hence the volume element is given by 
dV = (1 — aut bv) ds A (— cv ds + du) A (cu ds + dv) 
= (1 — au + bv) ds A du / dv. 


Consequently 
v=fff (1 — au + bv)ds A du /\ dv 
CxD 


|c|-| DI — fads { fudu art feds f fo dude 


This is a slight improvement of Goodman’s Theorem 2 because we have not 
assumed x0. However, if Chas a Frenet frame (xk >0 everywhere), then we can 


write 
” 
C3 


Since e,=t, we have de:=knds; this immediately implies a=x cos a and b= 
KSin a. 
Now let Co be acurve in the uw, v-plane described by u =u(¢), where ¢ is the arc 


(cos a)n + (sin a) b 
— (sin a)n-+ (cos a) B. 
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length and 0StS|C)|. Define a surface 2 by 
Z(s, t) = x(s) + u(Z) e2(s) + v(Z) e(s). 
Then (with z4=du/dt, etc.) 
dz = (1 — au + bv) ds ex + (— cuds + a dt) eg + (cu ds + 4 déjes. 


If n denotes the unit normal to 2, then the element of area dA satisfies 
dAn = $dz X dz 
= (1 — au + bv) ds A (— cvds + uw dt) e3 
+ (— cuda+ adt) (A (cu ds + 4 dt) e; 
+ (cuds + iadt) A (A — au + bv) ds ee 
= [— c(ua + vd) e, — (1 — aut brde, 
+ (1 — au + bv) we] ds A dt. 


Hence dA = [c?(ua-+vd)? + (1 —au-+bv)?]!/? ds dt, since #?+0?=|a|?=1. 
Assume 1—au-+bv>0 at each point of CX Co. Then 


ff dA = [f — aut bo) ds A at 
CXxCo 
lc|-| Col — fads fuat foasf oa, 


with equality if and only if c(s) (ua+vd) =0 identically. If uaz+vs=0, then 
u*-+-y? =k? is constant, and Co is an arc of a circle with center at (0, 0). If Co is 
the complete circle, the formula reduces to A =|C|-|Co|, a special case of a 
known result on the volume of tubes [5]. If Co is not a circle, the condition for 
equality is c(s) =0. 

The condition c(s) =0 is a linear differential equation for the normal frame 
{ ee, e3} ; hence there is a unique solution subject to initial data. If C happens to 
bea plane curve (b=0), this system evidently reduces to de; =0. Hence e3 =con- 
stant. In that case e2 is the normal to C and e; is the binormal. (Recall that the 
assumption x ~0 is unnecessary for plane curves; there is always a unique unit 
normal.) 

This completes the results in Goodman and Goodman, and Pursell. It re- 
mains to discuss the inequality 1—au-+dv>0. In the domain case, this will 
guarantee that y:C X D—E? is regular; in the curve case, that z is regular. Of 
course no local condition can make these maps injective. 

Here is a sufficient condition. Set 


M = sup (u? + v?)1/2, 


A 


taken over D or Cy as the case may be. Then au—bvS(a?+b?)!/2M, so 
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(a?-+62)1/217 <1 is sufficient. But (@?-++0?)1/2 = | dx/ds| =K, the curvature of C. 
Hence a sufficient condition is 


M < inf p(s). 


where p(s) is the radius of curvature. 
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THE SIMPLE CONTINUED FRACTION EXPANSION OF e 
C. D. OLDS, San Jose State College, California 


1. Introduction. Students, and others, have asked where they can find a 
readable account of the simple continued expansion of e, namely, the beautiful 
result due to Euler [1] that 

1 1 1 1 1 1 

e=2+—- —- —- —- —- — 

(1) 1+24+1+1+4+4+41 
= (2, 1, 2, 1, 1, 4, 1, 1, 6, 1,- ° -). 


1 1 
+1+6+ 


Simple continued fractions have the form 


1 
a1 + 


a, +} ————--_-— 
(2) a3 + 


where a1 is usually a positive or negative integer (but could be zero), and where 
the terms Qe, d3, G4, ++ + are positive integers. It is convenient to write (2) in 


the form 
J 1 1 
a ee oe ee 
de + a3 + dat ee, 


with the “+” signs after the first one lowered to indicate the “step-down” pro- 
cess in forming a continued fraction; or simply to represent it by the symbol 
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(a1, 2, dz, > > + ). To understand the continued fraction part of this expository 
paper one need only read, say, a few pages of Chapter 7 in Niven and Zuckerman 
[4]; noting, in particular, that the convergents of the continued fraction (2), 
namely, 


a 1 aia. + 1 

(3) ae atau 4 ae TE Le... 
q1 Qe ae G2 

can be calculated, successively, from the equations 


Di = Qipi_sa t+ pis, 
di = AiQi-1 + Qs-2, 


(4) 


for1=1, 2,3, 4, - - - , provided the undefined terms which occur are assigned the 
values p_.1=0, po=1, g-1=1, go=0. 

Euler derived the expansion (1) by converting the infinite series expansion 
of e into a continued fraction, an effective method when it succeeds. One does 
not always end up with a simple continued fraction. See Wall [6, p. 17]. 

Since Euler’s time, mathematicians such as Lambert, Gauss, Liouville, 
Hurwitz, Stieltjes, to mention only a few, established continued fractions as a 
field worthy of independent study, with applications to many branches of mathe- 
matics. It comes as no surprise, then, that the expansion (1), and similar ones, 
are special cases of later developments of the subject. 

However, for historical reasons, we present Hermite’s derivation of (1). 
The main ideas are contained in his famous paper [2 | in which he gave the first 
proof that e is a transcendental number. Hermite needed approximations to é 
and its integral powers, and, as a matter of fact, those he used were not conver- 
gents to their respective continued fractions. What follows, then, is not a mere 
translation of what Hermite wrote, but, rather, a re-working of his ideas, with 
changes and additions to make a self-contained exposition starting with the 
integral (5), given below, and ending with (1). 


2. Hermite’s Method. Hermite starts with an integral 


(5) [ 70) &, 


where 70 is an arbitrary constant, and where f(x) is a polynomial of degree 
n=2m. Repeated integration by parts transforms (5) into the form 


1 1 
| e*f(x)dx = — —e f(x) + — f f™ (a)e*dx 
r r 


1 1 1 
(6) — —— “TF (4) — > et) (x7) — — | s?@emae 
r r r 
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1 1 1 
— (= fe) +5 /@ +--+ sem) 
r r rem 


— (x). 


Hence, 
(7) i) eo Ha)de = —¢ &(x)], = 6(0) —¢ (1). 


In (7) let f(x) =x™(«—1)”, then 
fOO) = fH) = 0, j =9,1,2,---,m—1, 


consequently, the expression ®(x) in (6) reduces for x =0 and x =1, respectively, 
to 

2m (0 2m f()(1) 
(8) @(0) = Tm? @(1) = | rem 


j=m jum 


On the other hand, Taylor’s expansion about x =0 shows that 


(0 2m 
9) a) = DO Oat = Saye! 


where f‘) (0) =7!a;. Since f(x) is a polynomial with integral coefficients, the a;’s 
are integers. Similarly, writing f(x) = [(«—1)+1]"(x—1)™, Taylor’s expansion 
about x =1, shows that 


2m fC) (4 
(10) f(s) = ~ wie § > Bil — 1 


where f(?(1) =718;, the B,;’s being integers. Hence, the equations (8) take the 
form 


2m ala; !Mm(r) 
J Xj Mm IIL m 
on 6(0) = pe “a am 
11 
- ; 718;  _mINn(r) 
(1) = pitt ptt } 


where M,(r) and Na(r) are polynomials of degree m in r with integral coeffi- 
cients. Using (11), with f(x) =x™"(x—1), we rewrite (7) in the form 


yomtlor 
(12) e"Mn(r) — Nun(r) = Im = Vine’, 
where 
1 gomt 
(13) Jn = J ettym(y — 1)™dx, Vn = J im: 
0 m! 
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Setting m=0, f(x) =1, B(x) =r—!, and so ®(0) =r7!, My(r) =1, No(r) =1. Sim- 
ilarly, for m=1, f(x) =x?—x, and so Mi(r) =2—7, Ni(r) =2-+7. 

The crucial part of Hermite’s development hinges on a relationship between 


Jimny Im—1, and Jm_2, m2, of the form JmtaJm1itbJm_2=0. To this end, inte- 
gration by parts shows that 


m (} mt} 
(14) In = — J eteym ly — 1)™dx + — J en teym(y — 1)™-ldx, 
rvdo rv/q 
In the first integral above replace (x—1)™ by (x—1)"—1(x—1) to obtain 
1 r 1 
(15) J E—frtym(x _— 1)"™—!dx = OO Jn -- —— J m1; 
0 2m 2 
and with m replaced by m—1, 
1 r 1 
(16) J Ee ttym—l(y — 1)m—-2 = ——__—__ J,,_, + Pies 
0 


2(m — 1) 


A second integration by parts shows that 


} m m—i1f? 
(17) J etty™(y — 1)" dx = —JImit f eteym(y — 1)™-2dx, 
0 r r 0 


On the other hand, since (x —1)”-!=x(x—1)™—-?— (x —1)™~’, it follows that 


1 1 
(18) In = J Ettym(y — 1)™-2dx — J e-ttym—1(y — 1)™—2dx, 
0 0 
Using (16), and then solving (18) for the first integral on the right, we get 
(19) f ‘eam (a — 1)" de = Ina +} — In $I 
Crem ( 4 — 1) aX = Sm JS mn — J m~2 
; "Om —1)°"™" 90 


Now, substitute (19) into (17), and follow this by equating the right side of (15) 
with the new form of (17). The result, after simplifications, is 


(20) r°Im — 2m(2m — 1)Jm_1 — m(m — 1)Jm_-2 = 0. 
Using the relationship between Vm and Jm given in (13), we can replace (20) by 
(21) Vin — (4m — 2)Vin-1 — Vin» = 0. 
Next, calculate Vin, Vina, Vm_-2 from (12) and substitute into (21) to obtain 
(22) e"|Mn(r) — (4m — 2)Mnalr) — 1’ Mm-2(r) | 

— [Walt — (4m — 2)Nn-a(t) — PN pa(p)] = 0, 
and since é* is irrational this demands that 

Mn (r) — (4m — 2)Mn_i(r) — r?Mm_2(r) = 0, 


(23) Nnl(r) — (4m — 2)Nm—1(1) — 1? Nm—2(r) = 0. 
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From (11) we know that M,,(r) and N,.(r) are polynomials of degree m in r. 
Hence, if we replace r by 2/k, where Rk is a positive integer, then these poly- 
nomials can be written in the form 


(24) M,,(2/k) = Sim/k™, Nn(2/k) = Rnf/ k™, 


where S» and Rn» are integers. Next, substitute the values of Mn and Nm from 
(24) into (12), use (13) to get 


Drmt+le2/k 1 

2/kS — R, = f — 22 [kvm — | md 

é é ¥ adel 0 x. 
™ m kml! 0 ( ) 


Since | «(x —1)| <1/4 if OSxS1, and since [§ e~*/*dx <k/2, it follows easily 
that 


2/k 
(25) | eS — Rn| < : 


k™m | 


Again, using (24) the relations (23) can be replaced by 
Sm = (4m — 2)RSm—1 -+ AS'm—2, 


(26) 
Rm = (4m — 2)RRm-1 + 4Rn-2- 


If, for convenience, we let 
Sm Rm = 22 Ty 
(27) 
Sm — Ru Se Qmrt Lm; 
then the recurrence relations (26) can be replaced by 
Tm = (2m — WRT m1 -+- T m—2; 


(28) 
Zin = (2m — 1)kZm—1 -- LZim—2; 


where, in particular, 
To = 1,71 = k, To = 3k? + 1, Ts = 15k? + 62, ---, 
Zo = 0,21 = 1, Z2 = 3k, Z3 = 15k? + 1,---. 
Referring to equations (4), this shows that T>/Zo, T1/Z1, T2/Ze, - + +, are the 


convergents to the simple continued fraction (Rk, 3k, 5k, 7k, - - +). Moreover, 
from (27) we have 


Sm = 2% Im — Zm), Rm = 2™( Tm + Zm); 
which transforms the inequality (25) into the form 


k [2 


21k —1)Tm — (e2/# + 1)Zm 
| (e* — 1) Tm — (e?* + 1) | < Gam 


and after dividing both sides by Zn(e?/*—1), and noting by the recursion for- 
mula (28) that Z, increases as m increases, we see that 
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e2lk 


(2h) ™m!Zm(e2/* — 1) 


etlk 1. Tn 


eee 


ek — Jf Zn 


— 0 


as m—o. We are justified, then, in writing as a simple continued fraction 


ee+t oo tf 1 1 
erlk — 4 3k+5k+7kR+ --> 


(29) = (k, 3k, 5k, 7k,... 


Subtracting 1 from both sides of (29) gives 


2 1 1 1 


elk — J ( + 4+ Tet: 


(30) 


This can give excellent approximations to e. For example, if k=2, then from 
(30) we get 


G1) e—1 1 
2 - 1+ 
6+10+ --- 
Using the 7th convergent to (31), we have (e—1)/2 = 342762/398959, which 
shows that e=2.718281828458 - - -, in error by only one unit in the last place. 
From (31) we see that 
(32) 1+ ? 
é= aan nme eee eee 
1 1 1 


i-+—_  -—-'_—-— 
TO 4+a+ 


and it is quite easy to transform this into a simple continued fraction by the 
use of two transformations. Write 
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and if a is odd, that 


2 1 
1 (a—1 1 
a+ way 
1 1 
b-+ — 2 1 + ——_____—— 
y 2 
1+ OF 
(b— 1) +— 
y 
Thus, the continued fraction (32) transforms easily into Euler’s result: 
(33) rM- - + 2 F (0, Tint) 
é= a ~~ ~ ~~ ~~ = » 1, oN, 
t+t2+i1+i1+4+-:-:- n=l 


where the “bar” indicated the periodic part of the fraction. 
A similar discussion would show that 


ie) 


(34) e? = (7, 3n — 1, 1, 1, 3u, 6(2n +1))_ ; 


=] 


and that for k=3 and odd 


(35)  et/# = (1, 3[(6n + 1)k — 1], 6R(2n + 1), 3[(6n + 5)k — 1], 1) 
n=1 

3. Conclusion. The continued fraction (29) was published by Lambert in 
1761. For historical data on continued fractions see Perron [5]. For the tech- 
nique for proving e transcendental, see Chapter 2 and 9 in Niven [3]. 

What gave Hermite the idea to start with the integral (5)? A partial answer 
can be found by thumbing through his previous papers to see his great skill in 
handling various difficult integrals involving transcendental functions. One soon 
senses that he must already have had in mind all the basic techniques needed 
to prove that e is transcendental. 


References 


1. L. Euler, Introductio analysin infinitorium, Lausanne 1 (1748), Chapter 18. 

2. C. Hermite, Compt. Rend. Acad. Sci. Paris, 77 (1873) 18-24, 74-79, 285-293. 

3. Ivan Niven, Irrational Numbers, Wiley, New York, 1956. 

4, I. Niven and H. S. Zuckerman, An Introduction to the Theory of Numbers, Wiley, New 
York, 1960. 

5. Oskar Perron, Die Lehre von den Kettenbruchen, Teubner, Leipzig, 1929. 

6. H.S. Wall, Analytic Theory of Continued Fractions, Van Nostrand, Princeton, N. J., 1948. 


AN ELEMENTARY MODEL FOR SKEPTICISM 
DAN E. CHRISTIE, Bowdoin College 


The following presentation has a number of possible uses. It can serve as an 
allegory to indicate how, in applied mathematics, mathematical models are 
fitted successively to a real life situation. It is a source of exercises on trivial set 
theory and pre-orders, suitable for liberal arts students at levels K-13. It pro- 
vides an instance of the vivid imagery of digraphs. It also offers a non-geometric 
but intuitive motivation for assigning a topology to a set. 


The “real life” situation. Let us consider models for certain interactions 
among members of a set of mathematicians, for example, members of a research 
seminar. Let us suppose that mathematical conjectures and theorems are inter- 
mittently generated and announced by individuals or by clusters of members. 
Ignoring the main business of proof and counter-example, let us concentrate on 
the patterns of skepticism aroused by new claims or conjectures. Credibility is 
a personal matter. Some mathematicians have extraordinarily reliable hunches, 
while for others optimism is matched only by carelessness. On the receiving end, 
we are familiar with many degrees of acceptance ranging from utter gullibility 
to extreme caution, from faith in the infallibility of established experts (or of 
mathematics in print) to mistrust of any proposition not proved in detail. To 
bring the relevant aspects of the real life situation into focus, we turn to a spe- 
cific example which will serve repeatedly as a prototype. 


Prototype example. Let us suppose that the membership S of a seminar is the 
union of three disjoint subsets, each typifying a plausible pattern of response. 
First, there is a clique K of individuals ki, k2, ks who trust only each other: k; 
never disbelieves k;, but otherwise k; is a skeptic. Second, there is a cluster 
L=h, l, ls led by lh. No , ever disbelieves an assertion proposed by h, but I, 
too is a doubter when /; is not involved. Finally, there is a residual pair M of 
modest members m1, m2 who cautiously refrain from rejecting the claims of any 
colleague. To illustrate by special cases, a table is given. 


A claim by the faction is predictably disbelieved by the 
listed below on the left members listed below on the right 
Ry, Re, ty, 2, me none 
ki, ks, ba, me by, Ls 
Ie, my ki, Ra, Rs, by, |e 
hy ki, Ro, ks 


Ri, my Li, de, ds 
ks, ls hi, be 


The author did his undergraduate work at Bowdoin, his graduate work at Cambridge, and at 
Princeton, where he received his PhD under S. Lefschetz, returning one year as a Ford Faculty 
Fellow. Since, he has been on the staff at Bowdoin. He directed a number of summer and academic 
year institutes, including several Advanced Algebra seminars, and has been active in numerous 
MAA committees and section offices. His publications include Intermediate College Mechanics 
(McGraw-Hill 1952) and Vector Mechanics (McGraw-Hill 1964). Edztor. 
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This seminar S has only eight members. Obviaqusly the same example can be ex- 
panded by enlarging the subsets or by having several substructures such as L 
with various patterns of leadership. We now proceed to apply mathematical 
paraphernalia to the class of situations typified by this example. 


A preliminary set model. To say that skepticism is a personal matter is to 
say that our models will be naively set-theoretic. Let S denote the set of mathe- 
maticians in question—the membership of the seminar for instance. To each 
subset or faction F where FCS corresponds another subset, DF, of disbelievers. 
We shall then be concerned with a disbeliever function ® which maps the family 
of subsets of S into itself: 


D: 28 — 28, 
We shall wish to compare D with the familiar complement function 
©: 25 — 28, 


defined by CF= fx S | x F I For later comparison we state a few properties 
of C: 


(1) CCF = F (for every F CS), 
(2) F, C F,= CF, C CF, 
(3) CO =S, 

(4) C(F,U F,) = CF, N CF.. 


Property (1) states that the function © is an involution. Property (2) points 
out that © reverses inclusions. Assuming that CS=@, we can regard (3) as a 
consequence of (1). Property (4) is one of the famous De Morgan rules. 

The bare language of sets permits us to write concisely additional simple 
statements about the prototype example: 


(5) KULUM=S, 

(6) KONL=LOAM=MO\K = @, 
(7) FCKS>KOOF =@Q, 

(8) lL CFS3LOOF = @, 

(9) forall FCS, MO ODF = ©. 


These statements are part of the example, depending on the decomposition of S 
into special subsets. 

A preliminary set model (S, D) consists of a set S and a set function D sub- 
ject to the restriction 


(10) OF C CF, forevery FCS. 


It is clear that the prototype example satisfies the limitation stated in (10). In 
general, the mere existence of a set function subject to (10) does not reveal 
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much. If we apply (10) to S itself, we obtain 
DS C CS = PG, 


a result which makes a cheerful comment on unanimity. Note that extreme and 
unsubtle examples also satisfy (10). For instance, suppose DF = @ for all FCS. 
Clearly (10) is satisfied. Or suppose that for all F it happens that DF=€CF. 
Again (10) is satisfied. To get closer to the dynamics of disbelief, we specify 
additional structure. 


A dominance digraph. In our prototype example, the set function D was 
based on simple characterizations of three subsets. These special cases and others 
may be handled automatically by assigning a dominance relation to S and then 
defining D in terms of that relation. We read “x dominates y” for “x—>y.” (This 
notation is used by Moon in [2].) The relation — is to be reflexive and transi- 
tive: 


(11) *«— «for every «CS; 
(12) ifx—-y and yas, then x—z. 
y 
(a) (b) 2 
x ¥ x z 
Fic. 1 
(a) 
he (b) h 


le ls 


Fic. 2 


The relation — is thus a pre-order. It can be conveniently portrayed by a 
directed linear graph (digraph), just by replacing the statement x—y by the 
picture in Figure 1a. (For an extensive treatment of digraphs, see [1].) The loops 
at the vertices, required by reflexivity, are often omitted from diagrams. Sim- 
ilarly, in Figure 1b three loops of reflexivity are omitted as is the arrow x—2 
which is dictated by transitivity. 
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In the prototype example the clique A can be represented as in Figure 2a. 
(Note that other dominances given by reflexivity and transitivity are omitted.) 
The cluster L led by , looks like Figure 2b, expressing the dominances /,—,, 
all 1. The subset M has members which are effectively dominated by everyone. 
The total diagram for the prototype example looks in part like Figure 3. To 
complete the digraph, as before, one must use (11) and (12) wherever possible. 


h; ky ke 


Fic. 3 


Let us extend slightly the nomenclature introduced above. For a subset 
FCS we shall write Fx (read “F dominates x”) when there exists some yC F 
such that yx. We point out three simple propositions using this notation. First, 
a technicality based on reflexivity: 


(13) FAx>c«¢eE F. 

And now we make two obvious remarks. 

(14) TfiF,.~Ax and FiCF,, then Fi ~x. 
(15) TiFi~x and Fe~Ax, then F,U PF, Ax. 


These three conclusions are immediate consequences of the definition of “F—«x.” 
It is interesting to apply this extended nomenclature to the prototype example. 
For instance, Ll; for all 1,;EL; {h}-l; for all 1,;EL; Km; and Lm; for all 
Mm: M. 


A dominance model. In the preliminary set model, we postulated a’ disbelief 
function D subject only to (10). Now, assuming a pre-ordered set (S, —), we 
define D explicitly: the set of disbelievers of a faction consists of all members of 
S which are not dominated by a member of the faction. Symbolically, 


(16) DF = {x EC S|F Ax. 
A neater statement describes the complement of DF: 


(17) *CeCOF iff Fx. 
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For any pre-ordered set (S, —), we now have a dominance model (S, —, D) 
with D defined as above. A digraph illustration is given in Figure 4. The arrows 
of reflexivity and transitivity are omitted. Dashed lines indicate a sample F 
and the corresponding DF. 


— aw ee ee lee 
. — fone 
a om 


Fic. 4 


It is soon clear that the dominance model (S, —, D) agrees with our prelim- 
inary set model. The extreme cases cited earlier (D=€ and, for F not empty, 
DF=@) now appear as immediate consequences of extreme pre-orders. If 
xG DF, then Fx, so by (13) we know x€ F. Hence (10) is verified as is its corol- 
lary, DS= @. For completeness, let us also verify 


(18) DO =S. 
Obviously, D@CS. Suppose xES but x EDS. Then by (17), we have O—x 


which is impossible. A more significant result is the following: 
Lemma. In a dominance model (S, —, D) 
(19) F,C Fo DF. C DF. 


Proof. Let xG DF, Then Fox and by (14) Fix. It really is sufficient to 
point out that (19) is a rewording of (14). 
We now confront a major consequence of our definition of D. 


PROPOSITION. In a dominance model (S, —, D), for any two subsets Fy and Fy 
of S, ut follows that 


(20) O(F,U Fo) = DFM OF 2. 


Proof. The inclusions F;C PiU F, and FeC Fi Fy enable us to conclude 
directly from (19) that D(/{U Fr) CDFANDF:. Now suppose that xG DANDFs. 
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Then by definition of 9D, Fi-Ax and F.eAx. Hence by (15), MU FeAx; so 

The assertion about disbelief contained in (20) has a degree of commonsense 
plausibility. In general, we might expect disbelief to diminish in the face of a 
broader consensus, and, more specifically, the familiar De Morgan pattern 
(compare (4)) is perhaps reassuring. If (20) is extended, by an easy induction, to 


(21) O(F,U -»» UF) =9FiAN +--+ ODF, 

for any integer k>2, we can observe that a convenient formula for calculation 
has emerged. Suppose that F= { 21, Xe, °°, xn}. Then 

(22) DF = D({m}U --- U fat) =Dfa} OA --- ADfaef. 


If we can tabulate disbelievers for each individual in a faction, then the disbe- 
lievers for the faction as a whole are those who disbelieve every member of the 
faction. The preceding conclusion about (5S, —, D) gives us warning that this 
model is as rigid as the original set model was flexible. Yet the dual De Morgan 
equation does not hold. We have instead 


(23) D(F;\ F:) D DF, U DF». 


To see that equality need not hold, let Fy and F; be the singletons {1,} and {h} 
(referring to Figure 3). Then OD( ANF) =9@=S, but DRUDR=KU {ih}. 
The proof of (23) is immediate from (19), using the inclusions Fi\ F.C F, and 
PLN F2C Fe. 

Note that, with one exception, analogues for D have been found of the four 
displayed properties for @: (19) corresponds to (2), (18) to (3), and (20) to (4). 
We now look for an analogue of (1). We rule out the obvious candidate, DOF= F, 
because it can be combined with (20) to establish an equality in place of (23). 
An alternate form of (1), CCC F= CF, suggests a less restrictive analogue which 
we now establish. 


PROPOSITION. In a dominance model (S, —, D), for every FCS we have 
(24) DCOF = OF. 


(Note that parentheses have been consistently omitted. We could write 
D(C(DF)) for DCHF.) 


Proof. First, one inclusion is immediate; for DFC€CF by (10), hence by (2) 
CDFDCCF. That is, by (1), CDFDF. Finally, by (19), DEDFCOF. 

Suppose now that the opposite inclusion does not hold, i.e., for some par- 
ticular F and some y in DF we have yEDCOF, i.e., yYECDCHF. By (17) then 
@CXF—y which means by definition of — that there is some z©@ CDF such that 
z—y. Again by (17) F-s. Hence by transitivity F—y, contrary to the assump- 
tion that yEOF. 

An unsurprising consequence of (24) is the following 
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COROLLARY. In a dominance model (S, —, D), f DOF ts equal to F for every 
faction FCS, then D 1s equal to the complement function @. 


Conclusion (24), perhaps more than (22), may make the model (S, —, D) 
appear untenable. Do we indeed expect the disbelievers of the nondisbelievers 
of a faction F to be indistinguishable from the disbelievers of F itself? To test 
the model further, let us designate as subservient any faction F for which DDF 
is empty. It is easy now to show that if S has a subservient faction F, then DF 
dominates every member of SS. In the next section, we suggest an approach which 
allows us to retain or discard (24) as we please. 


Axiomatic models. Our preliminary set model (5, D) required too little of 9. 
The dominance model (S, —, ) may have required too much, but at least it 
provided several properties which we may or may not wish to retain. 


(10) DF C CF I 
(18) DO=S II 
(20) —(F,U Fy) = DF, OF, III 
(24) DCOF = OF. IV 


Another candidate for special status, Lemma (19), is an immediate consequence 
of III, using the equivalence of F,C F, with Fi, Fo= Fy and Fif\ Fo= F,. One is 
tempted to seek consequences of I, IJ, and III, omitting IV, or substituting some 
other restriction. But it is well to point out that we are working near familiar 
ground. Let us apply the operator € to both sides of I, II, III, and IV. We get 


COF D CCF = F 1’ 

CDAF=eCs= GH Il’ 

CO(F,U F.) = C(OFiNM DF.) = CDF, U CDF, IIT’ 
CDCOF = CHF. Iv’ 


Regarding the combined operator CD as a closure operator, we recognize in 
I’, II’, III’, and IV’ the Kuratowski closure axioms for a topological space. In 
fact, axioms I, II, III, and IV are also axioms for a topological space in terms of 
an exterior operator D. The connections between orderings and topologies are 
well known; e.g., see Sharp [3]. Students of dominance, skepticism, hostility, 
or other personal traits easily caricatured by a pre-order relation may find the 
world of topological spaces at least loosely related to their own milieus. 


Conclusions. At the beginning it was remarked that this presentation could 
be regarded, among other things, as a source of exercises on sets, digraphs, and 
finite topologies. Students can be invited to set up specific domination models 
based on their own or imagined social situations. Digraphs and corresponding 
connectivity matrices are a source of further explorations. Finally, if one calls 
a subset FCS open when F=DWM for some subset MCS, one can proceed to 
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verify the standard open-set axioms for a topological space and to check these 
axioms with specific examples. The closure operator CD and the interior operator 
DC are, of course, as susceptible to mathematical whimsy as the exterior opera- 
tor D. 


The author is indebted to his colleague, C. A. Grobe, Jr., for reading a preliminary draft and 
for pointing out infelicities, and to the Department of Mathematical Sciences at New Mexico 
State University for secretarial assistance during preparation of the final manuscript. 

The slight mathematical content of this article was first used by the author as the core of his 
inaugural lecture “A Biased Sample of One” as Wing Professor of Mathematics at Bowdoin College 
on March 21, 1966. Completely rearranged, it served again for an address “Postulates About 
People: An Introduction to Finite Topology” at the Annual Meeting of NCTM in Philadelphia on 
April 19, 1968. A revised version was used at the Boston Meeting of NCTM on Nov. 14, 1968, after 
a “dry run” under the title “The Topology of Antagonism” at a Bowdoin mathematics major meet- 
ing on October 1. 
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A CORRECTION 


EpwIn E. Motss, Harvard University 


On page 454 of the current volume of the MONTHLY, it was stated that the 
MAA was a sponsor of the Chauvenet Symposium at the United States Naval 
Academy. This is not correct. The meeting was sponsored by the local Section, 
and replaced the regular Sectional meeting. But the MAA has never sponsored 
one of its own Section meetings, and does not sponsor other local meetings un- 
less it has participated in their planning or in their financial support. 


MATHEMATICAL NOTES 


EDITED BY Davip DRASIN 
Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 
A CONVERSE OF THE STONE-WEIERSTRASS THEOREM 
R. E. Mutuins, Marquette University 


A familiar formulation of the Stone-Weierstrass theorem states that a 
subalgebra A of C(X) ts dense in C(X) tf A contains constants and separates 
points. Here C(X) denotes the sup norm algebra of all real-valued continuous 
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functions on a compact Hausdorff space X, that is, |[f|| =supsex|f(«)|. The 
purpose of this short note is to show that the Stone-Weierstrass theorem can be 
stated in a setting which makes a converse meaningful, and to prove that the 
converse is true. 

For any set X, the notation B(X) will denote the algebra of all bounded 
real-valued functions with sup norm defined on X. A subalgebra A of B(X) will 
be called an F-algebra if A is closed relative to uniform convergence, contains 
constants, and separates the points of X. A subalgebra A of B(X) will be called 
a minimal algebra in B(X) if A isan F-algebra which properly contains no other 
F-algebra. The Stone-Weierstrass theorem then states that C(X) is a minimal 
algebra in B(X) if X is endowed with some compact Hausdorff topology. The 
following theorem is a converse of the Stone-Weierstrass theorem: 


THEOREM. If A 1s a minimal algebra in B(X), then A =C(X) for some com- 
pact Hausdorff topology on X. 


Proof: Let S denote the space of all nonzero real homomorphisms on A. 
For each f in A, let f be the function defined on Sas follows: f(¢) =@(f) for each 
in S. If Ad={f:fEA}, and S is given the weakest topology that makes all 
functions in A continuous on 5S, it is well known that this topology is compact 
Hausdorff and 4 =C(S). (See [1], Theorem 20, p. 276.) Each point of X may 
be identified with an element of 5S, i.e., x is identified with ¢., where ¢.(f) =f(x) 
for each f in A. Suppose that under this identification X is properly contained in 
Sand ¢6€S—X. Let p be any element of X and set 4p= {fEA:f(p) =4(f)}. 
Then A, is a closed subalgebra of A containing all constants. To see that A, is 
an F-algebra, let x and y be any two distinct points of X. Since 4 =C(S), it 
follows by Urysohn’s lemma that there exists a function g in A such that 
2(p) =2(¢) and £(x) #2(y). Observe that g@ A, and hence A, separates points. 
Since A is a minimal algebra in B(X), we have A=A, and ¢=¢@,. This con- 
tradicts the assumption that ¢6©€ S—X. We conclude that X =S and A=C(X). 


I wish to thank Steve Friedberg for suggesting a shortening of my original proof. 
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FROBENIUS GROUPS AND WEDDERBURN’S THEOREM 
R. P. Burn AnD D. M. Mapuray, St. John’s College, Tamilnadu, India 


Ebey and Sitaram [1] have appealed to the geometry of a Desarguesian 
plane in constructing a new proof of Wedderburn’s theorem that every finite 
division ring is commutative. Their technique was to construct a Desarguesian 
plane from a given finite division ring k, and to consider a dilatation group of 
this plane. Such a dilatation group is for geometric reasons a Frobenius group 
in which a Frobenius complement is isomorphic to the multiplicative group k”. 
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The remainder of the proof is a discussion of the structure of Frobenius com- 
plements. 

It is, however, simpler to construct a suitable Frobenius group from the 
division ring k, by considering the group of transformations of the form 


x—xa + B, a #0, 


of k onto itself. This is a doubly transitive Frobenius group in which a Frobenius 
complement (the subset of transformations of the form x—xa, a0) is iso- 
morphic to k*. Ebey and Sitaram’s discussion of the structure of Frobenius 
complements then completes the proof of Wedderburn’s theorem. 

It is true that this Frobenius group has a geometric representation as the 
subgroup of a dilatation group of a Desarguesian plane which fixes a line other 
than the axis, but we have now shown that the geometrical argument is an 
excursion which it is possible to short-cut rather than an integral part of their 
proof. 
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A NONSTANDARD EXAMPLE OF A DISTRIBUTION 


LEE PENG-YEE, University of Auckland, New Zealand 


Itis known ([1] p. 74) that every locally Lebesgue integrable function defines 
a distribution in the sense of Laurent Schwartz. There are other functions, how- 
ever, that are not locally Lebesgue integrable but which still generate distribu- 
tions, for example, the derivative of the function 


(1) F(0O) = 0, F(x) = x? sin 1/x?, x 0. 


Therefore it is not without interest to consider the above result for more general 
integrals, such as the Perron integral ({2]| p. 312). In this note, we shall present 
a simplified version first, and then indicate how the general case for the Perron 
integral may be obtained. 

If a proposition p(x) is true for every x in [a, 6] except perhaps a countable 
number of points, we say that p(x) is true nearly everywhere on [a, 0]. A real- 
valued function f is said to be J-integrable on [a, 5] if there is a continuous func- 
tion F whose derivative exists nearly everywhere and is equal to f nearly every- 
where on [a, 6]. We may therefore define the J-integral of f on [a, b] as follows: 


[ sea = F(b) — F(a). 


The uniqueness of the integral follows from the following 


THEOREM 1. Let F be differentiable nearly everywhere and continuous on [a, b]. 
If F'(x) 20 nearly everywhere on [a, b], then F is nondecreasing on [a, b|. 
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Theorem 1 is well known and is usually proved by reductio ad absurdum; see, 
for example, [2] p. 200, Theorem 34.1. Here we give a constructive proof which 
appears to be new. 

Proof of Theorem 1. Let S be the set of points at each of which the derivative 
F’ does not exist or if it does F’(x) <0. By assumption, S is countable. Let us 
denote S by { x1, xe, +» }. Now we shall construct an open covering of [a, d]. 
Let & be arbitrary and x€[a, b]. If x€&S, ie., F’(x) 20, then we can choose 
5(x) >0 such that forx—-6<u<x<u<x+6 


FO FOL § FO APM 
y— x 2(6 — a) x— U 2(b — a) 
so that 
F(v) — F(u) = {F(v) — F(x)} + [F(«) — F()} 

2 E(u — 4u 
(2) > 9. 
Since F is continuous at x=Xn, n=1, 2, +++, we can choose 6(x,) >0 such that 
for Xn—6(Xn) <U<Xn << Xn +6(%n) 
(3) | F(v) — F(u)| < 82-"-1, 


Thus the family of all open intervals («—6(x), x +6(x)) forms an open covering 
of [a, 6]. It follows from the Heine-Borel covering theorem that there exists a 
finite subcovering and hence a division 


a= <<" < soe Sy, = D 
such that F(y:) — F(y:-1) satisfies either (2) or (3). Consequently, 


n 


F(b) — F(a) = Do (F(x) — Fie) } > — &. 
t=1 
Since § is arbitrary, F(b) — F(a) 20. Similarly, we can prove that F(v) — F(u) 20 
whenever aSu<vSb, 1.e., F is nondecreasing. 
As acorollary of Theorem 1, we have 


THEOREM 2. Let f be J-integrable on [a, b|. If | f(x) | <M forasxsb, then 


J “fade 


For proof, we replace F in Theorem 1 by M(x—a)+ 7 f(u)du. 

We remark that the J-integral is not included in the Lebesgue integral. For 
example, the derivative of (1) is J-integrable on [0, 1] but not Lebesgue inte- 
erable since the function F is not absolutely continuous on [0, 1], or even of 
bounded variation. It is interesting to note that the Dirichlet function: 


< M(b — a). 
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1 when *# is rational, 
f(x) = ‘ 


0 when ¢ is irrational, 


is also J-integrable. 

In what follows, we shall show that every locally J-integrable function, i.e., 
a function J-integrable on any closed bounded interval on the real line, defines 
a distribution in the sense of Laurent Schwartz [1]. Let be the space of all 
complex functions on (— ©, ©) which are indefinitely differentiable and have 
bounded supports. A sequence { on} of functions in © with supports contained 
in the same bounded set is said to converge to a function ¢ of Das n—o if the 
derivatives of any given order m of the ¢, converge uniformly as n— © to the 
corresponding derivative of ¢. A linear functional T on D is continuous if g,—-¢ 
in the above sense implies that the complex numbers 7(¢,) converge to the com- 
plex number T(y) as n—o. A distribution is a continuous linear functional on 
the vector space D. A complex function is J-integrable if each of its real and 
imaginary parts is. 


THEOREM 3. If f ts locally J-integrable on (— ©, ©), then 
Tie) = | fe@eleds 
1s a distribution. 


Proof. Obviously 7; defines a linear functional on D. We shall prove its 
continuity. Let ¢,—¢ in D with supports contained in [—a, a], and let 


F(x) = | Huan, —asxsSa. 


For brevity, we write ¥, for ¢,—¢. Then we have 


[F(x)Wn(x)]" = f(x)bn(x) + F(a)Wn (x) 


nearly everywhere on [—a, a]. Since F is continuous on [—a, al, so is Fy. 
Hence Fy, is J-integrable on [—a, a]. Further, let 


G(x) = F(Wala) — F(—a)ba(—a) — fF va(u)du 


Then G is continuous and G’(x) =f(x)W.(*%) nearly everywhere. Hence fy, is 
J-integrable. Consequently, 


[ sear + [FV @de = F(@)¥a(a) — F(—a)¥a(—0), 
Note that the right hand side vanishes for sufficiently large a. 


Now let || F|| denote the maximum value of | F(x)| for x€[—a, a], and simi- 
larly for ||y/||. It follows that for sufficiently large a 
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| T s(n — ¢) | = | f f( 4) bn (a) dx = | | F (x) n(x) dx 
S 2all Fl ||¥al. 
Here we use Theorem 2. Since ga—¢ in 9D, hence ||¥, || =||¢./ —¢'|| 0 as n> & 


and 


lim Ty(gn) = Ty(¢). 


This completes the proof. 

We remark that the conclusion of Theorem 3 is still true with the J-integral 
replaced by the Perron integral. In view of Theorem 65.1 [2] p. 332, the proof 
will go through as above. It would be of interest to know whether there are any 
standard classes of integrals which need not generate distributions. 
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SIMPLE ENDOMORPHISM RINGS 


R. WARE AND J. ZELMANOWITZ, University of California, Santa Barbara 


It is a relatively easy chore to find the abelian groups which possess a simple 
endomorphism ring. As determined in [1, p. 219], such an abelian group must 
be, up to isomorphism, either a finite sum of cyclic groups of the same prime 
order or a finite direct sum of copies of the rational numbers. As a consequence, 
a simple endomorphism ring is necessarily artinian. The proof in [1] employs the 
elementary structure theory of abelian groups. 

In this note we obtain an analogous description of those modules over any 
commutative ring which have simple endomorphism rings. As a further applica- 
tion of our elementary ring-theoretic methods, we determine the torsion-free 
modules over a commutative integral domain which have a (von Neumann) 
regular endomorphism ring. 

Throughout, R will denote an arbitrary commutative ring with identity 
element, M will be a unitary R-module, and we let £ be the ring of R-endomor- 
phisms of JM, acting as right operators on M. 


THEOREM. The endomorphism ring of Mis simple tf and only 1f M 1s tsomor- 
phic to a finite direct sum of copies of the quotient field K of R/P for some prime 
ideal P of R. 

When thts ts the case, P 1s equal to the annthilator ideal of M, and E 1s tsomor- 
phic to a full matrix ring over K. 
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Proof. Suppose that Eissimple. Set F = thecenter of Eand P = {rER|rM=0 \ 
Then M is a faithful R/P-module, and R/P may be identified with the subring 
of # consisting of the endomorphisms of M given by multiplication by elements 
of R/P. Making this identification we have R/PCFCE. 

F, being the center of a simple ring with identity, is a field, and consequently 
R/P is an integral domain and P is a prime ideal. It follows that F contains the 
quotient field K of R/P. Misa faithful right Z-module and hence via restriction 
is a K-vector space. Thus M is K-isomorphic to a direct sum of copies of K. 
A fortiori, this is an R/P-isomorphism of M. For the usual reason, M is then 
R-isomorphic to a finite direct sum of copies of K; otherwise, the set of R-endo- 
morphisms with finite dimensional image would be a proper two-sided ideal of £. 

The converse is immediate. 

REMARK. Schur’s Lemma states that if R is any ring and M is a simple 
R-module then H=Hompr(M, M) is a division ring. The converse is false, even 
for R commutative; e.g., take R the ring of rational integers and M the field of 
rational numbers. The following corollary shows that for commutative rings 
this is essentially the only exception to the converse of Schur’s Lemma. 


CoROLLARY. Let P be the annthilator ideal of Min R. The following two state- 
ments are equivalent. 

I. Ets a division ring. 

Il. Either: (1) P is a@ maximal ideal and M is a simple R-module (and M 
~R/P as an R-module, E=R/P as a ring); or: (2) P is a nonmaximal prime ideal 
and M ts R-isomorphic to the quotient field K of R/P (and E&=K as a ring). 


DEFINITION: By a regular ring we mean one in which for every element x 
there exists an element y with x =xyx. We note that if x is an element of a ring 
with identity which has this property, and x is not a zero divisor, then x isa unit: 
for x(yx—1)=0 and (xy—1)x=0 imply that xy=1=yx. (This proof can be 
modified to show that any regular ring which possesses a nonzero divisor neces- 
sarily has an identity element.) 


THEOREM. Let M be a torsion-free module over a commutative integral domain 
R. Then M has a regular endomorphism ring tf and only tf M is isomorphic to a 
direct sum of copies of the quotient field K of R. 


Proof. Again let E=Home(M, 1M), and assume that E£ is regular. Then for 
any 0+7rCR, let ¢, denote the endomorphism of M defined by m¢,=rm(mEM); 
o, is in the center of E. We claim that @¢, is a unit in £. By the remark above it 
suffices to show that ¢, is not a zero-divisor in E, a fact which is an immediate 
consequence of the hypothesis that MV be torsion-free. A fortiori, ¢, is an epimor- 
phism; 1.e., rM=M. 

As r was an arbitrary nonzero element of R, M is a torsion-free divisible 
module, and as such is a K-module, and hence R-isomorphic to a direct sum of 
copies of K. 

For the converse, let K denote the quotient field of RK. Note that 
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Homer(M, M) =Homx(M, M), and apply the fact that the endomorphism ring 
of a vector space is regular [2; p. 123]. 
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RESEARCH PROBLEMS 


EDITED BY VICTOR KLEE 


In this Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (if any are known to the author) and by a brief description of known 
partial results. Material should be sent to Victor Klee, Department of Mathematics, University 
of Washington, Seattle, WA 98105. 


IS A BODY SPHERICAL IF ALL ITS PROJECTIONS HAVE THE SAME I.Q.? 
G. D. CHAKERIAN, University of California at Davis 


The Isoperimetric Theorem asserts that of all simple closed plane curves of 
a given perimeter, the circle encloses the largest area. If we denote the perimeter 
by L and area by A, then this is equivalent to the following isoperimetric in- 
equality: 


4rA 
< 


(1) = 


? 


where equality can hold only for circles. The expression on the left hand side of 
(1) is the “isoperimetric quotient” of the given curve. Polya [6, p. 180] abbrevi- 
ates this quotient as “I.Q.” and thereby obtains the following happy restate- 
ment of the Isoperimetric Theorem: “Of all simple closed plane curves, the 
circle has the highest I.Q.” 

In what follows we restrict our attention to convex bodies (a convex body 
in Euclidean n-space is a compact, convex subset with nonempty interior). 
We ask to what extent a 3-dimensional convex body is characterized by the 
I.Q.’s of its orthogonal projections onto planes through the origin. For example, 
if all projections have I.Q.=1, then the body is a ball, since all projections are 
circular disks. The title question asks: ts the ball the only convex body whose 
orthogonal projections all have the same 1.Q.? 

If we consider a 3-dimensional convex body K, then we have a number of 
analogues of the isoperimetric quotient. If V, S, and M are the volume, surface 
area, and total mean curvature of K, then the following analogues of (1) are 
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valid (see [4] for proofs and further discussion of these quantities) : 


4a 3MV 
(2a) S 1, (2b) —— S$ 1. 
M? S? 


From (2a) and (2b) follows the well-known analogue of (1): 


36rV2 
(2c) a S1 

In (2a) and (2c) equality can hold if and only if K is a ball. Equality can hold in 
(2b) in other cases. When we refer to “the I.Q. of K in the sense of (2a)” we 
shall mean the quotient on the lefthand side of (2a) computed for the 3-dimen- 
sional convex body K. Similarly for I.Q. in the sense of (2b) or (2c). Thus, for 
example, of all 3-dimensional convex bodies, the ball has the highest I.Q. in the 
sense of (2a) or (2c). 

Can one estimate the I.Q. of a convex body in terms of the I.Q.’s of its 
planar sections or projections? Croft [2] attributes the following (unsettled) 
question to Steinhaus. Let K be a 3-dimensional convex body, and for each 
plane E, let g:E(K) be the I.Q. (in the sense of (1)) of EK (consider only 
planes £ such that E(’\K has positive area). Then 7s at true that: 


36rV? 


(3) > inf(q(E A K))?? 
The infimum is taken over all planes E which cut K in a set of positive area. 
Qualitatively, this would imply that K has high I.Q. in the sense of (2c) if all 
its planar sections have high I.Q. in the sense of (1). 

Let A(E(M\K) and L(EMK) be the area and perimeter respectively of 
EC\K. Then one has the following integralgeometric formulas for V, S, and M: 


4) v=— f AEN Kae 
( On ) 
(5) s=— f pen waz, 
(6) = fae. 


These integrals are evaluated over the set of all planes E such that ENK#& 
using the integralgeometric density dE for planes in Euclidean 3-space. See 
[4] for the relevant definitions. Now, if we start with (5) and apply Schwarz’ 
inequality for integrals, we obtain 


(7) S? = =( (= e=. /A(EC\ K) in) 
tN IA/A(E XN K) ( 
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4 L(E C\ K)? 
<£( MEO a8)( acer mae) 
Bringing in (4) and (6), we convert (7) into 


fre (\ K)dE 


S? 32 

(8) <— 

3MV = _-:3r? 

| ETD 
From (8) we obtain 
() a = inf(Q(E AK) 
> ——~in , 
s 32. 


The derivation must be taken with a grain of salt in case q>'(E()\K) is not in- 
tegrable. (Indeed, does there exist K such that g-1(EM&) is not integrable over 
the set of Z such that A(EMK) 40?) 

The inequality (9) seems in any case rather crude, but it does contain a 
qualitative implication to the effect that K has high I.Q. in the sense of (2b) 
if all its planar sections have high I.Q. in the sense of (1). Can one replace the 
constant 312/32 by 1 in the inequality (9)? 

It is possible to get better results using orthogonal projections rather than 
sections. For each direction (unit vector) u, let K(u) denote the orthogonal 
projection of K onto a plane orthogonal to the direction uw. Let A(u) and L(u) 
be the area and perimeter respectively of K(u). Then one has the following 
expressions for S and M: 


(10) S = ~f A(u)du, 


1 
(11) M = —f{ L(u)du, 


where the integration is in each case over the unit sphere. If we start with (11) 
and apply Schwarz’ inequality (as in the derivation of (8)) and then apply (10), 
we obtain 


wee Lew f rma 
(12) Ss du = 


4oS Aer Aa A (u) f F 
u 


} 


where q(K()) is of course the I.Q. of K(u) in the sense of (1). From (12) follows 


AmS — 
(13) Gp = inf g(K(u)). 
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Again we have a qualitative result: K has high I.Q. tn the sense of (2a) ¢f all tts 
orthogonal projections have high 1.Q. 

Examination of the derivation of (13) shows that equality holds if and only 
if A(u) and L(u) are both constant. Bodies for which A(z) is constant are called 
“bodies of constant brightness,” and bodies for which L(u) is constant turn out 
to be precisely the well-known bodies of constant width. Hence equality holds 
in (13) if and only if K is a body of constant width which is also of constant 
brightness. Jf K has constant width and constant brightness, then must K bea ball? 
This is true if the boundary of K is sufficiently smooth (see [1, p. 140]), but the 
general case is still unsettled. Note that an affirmative answer to the title 
question would provide an affirmative answer to the last question. See [5] for an 
exposition of a number of interesting related problems of this type. 

Recent work of Groemer [3] is of interest in connection with these problems. 
His results imply, for example, that any 3-dimensional convex body has some 
projection with I.Q. at least 1/4 and also some section with I.Q. at least 1/4. 
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CLASSROOM NOTES 


EDITED BY DAvip DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


NORMALITY OF LINEAR COMBINATIONS OF 
NON-NORMAL RANDOM VARIABLES 


B. K. Kae, University of Manitoba 


1. If U and V are independent normal random variables (r.v.) then it is 
well known that U+ V is also normal. The converse proposition, which asserts 
that if U, V are independent, and U+ V is normal, then each U and V must be 
normal, is a well-known theorem due to Cramér [1]. If the assumption of inde- 
pendence of Uand V is dropped, then one could have the following possibilities: 
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4) Uand V are marginally normal and U+V is also normal. 

(i) U and V are marginally normal but U+ V is not normal. 

Gi) Uand Vare marginally not normal but U+ Vis normal. 

For situation (i), consider (U, V) bivariate normal. It may be noted here 
that U, V, and U+ V normal does not imply that (U, V) is bivariate normal. 
Indeed, the normality of each of (a:U+6;V), 1=1, 2, --+, m7, does not imply 
that (U, V) is bivariate normal. This follows from the characterization of bi- 
variate normal distribution which asserts that (U, V) is bivariate normal iff 
every linear function (aU+£8Y) is univariate normal (see Rao [2]). 

The situations (ii) and (iii) are not that easy to come by. Rosenberg [3] 
gave an example of (ii) but not of (iti). Most of the common textbook examples 
of (U, V) marginally normal but not jointly normal are examples of case (ii), 
as remarked by Ferguson [4]. In this note we consider a class of probability 
density functions (p.d.f.’s) which will be useful to demonstrate both situations 
(ii) and (iii). This is done by constructing an example of r.v.’s X, Y which are 
both normal but such that no linear combination aX +0bY, (ab¥0) is normal. 


2. For any real a, 0<a@<1, consider the class of p.d.f.’s of (X, Y), given by 


x 
oa X, ¥) = fla)fy) + a? = |x| <4, [9] <4, 


= f(x)f(y) otherwise. 


Here 


is the p.d.f. of standard normal r.v. By straightforward integration it is easy to 
show that: 


J cot y) de= fly), yER 
and 
J Sa(x, y)dy = f(x), «ER, 
Thus X and Y are both normal with mean zero and variance unity. Simi- 


larly, one can show that: 


2 


a2 4 
Cov(X, Y) =—-- = — > 
2re 9 


Let $(t1, #2) be the characteristic function (ch.f.) of (X, Y). Then: 
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+00 eo 
b(t ts) = J. iZ exp{itye + itay} fla) fly) dex dy 


a? 


f. exp| 1byx -+ itey} xy dx dy. 
Y ore 


Noting that the ch.f. of the standard normal r.v. is exp(—#2/2) we have 


(1) o(t, t) = exp{— 3(4 + t2)} + (o2/2me) (ty) H(t), 
where 
(2) y(t) = J " exp(itu) u du 


1 
=2if “usin tu du 


0 


0 if ¢ = 0. 
Let Z=aX +b Y, abX0, then the ch.f. of Z is given by (at, bt). Now 


f (2i/t2){sint—tcost} if t~0 


(3) (at, bi) = exp{— -. (at + BD} + vat) Wo. 


If Z were normally distributed, then noting that Z has mean zero and variance, 
a? -+-b2-+- (4a?/97e) (ab), its ch.f. must be given by 


{2 Aq? 
n(t) = exp AG ++ 6? + =a) ’ 
2 Ore 


and (at, bt) =n(t) for all the values of t. It is easy to show that (a, b) ¥n(1) and 
thus Z cannot be a normal variable. Thus we have case (ii) of X, Y being mar- 
ginally normal but no linear combination aX +0Y, ab40 is normal. 

It is now trivial to obtain situation (iii). Define U=aX+0Y and V=aX 
—bY, ab0; then both U and V are marginally not normal but U+V=2aX 
and U—V=20Y are both marginally normal, since X and Y are marginally 
normal. 


3. The above results can be generalized in a straightforward manner to the 
situation involving 2 variables. For 0<a@<1, consider the class of p.d.f.’s of 
(X1, X2,°° +, Xn) given by 

Sa(X1, Hay °° 8 y Xn) = f(%1)f (#2) - » + f(4n) + = 


Xy1X%o * 8 »® Xn 


Tis e)” 


for | 2, | <1, ¢=1,2,--+,23 gul&1, %2, °° + , Xn) =F(%1)f(%2) - + - f(%n) otherwise. 
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Then by the arguments similar to those used above, we have 
(1) X1, Xe, +--+, X, are marginally normal and any set of r<m variables are 
jointly normal, 
(2) >521 ais, r<n, is normal, 
(3) ory a;x; with a1, d2, - + + , @n3%0 is not normal. 
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A REHABILITATION OF (1-+2/n)* 
J. VAN YZEREN, Technological University of Eindhoven 


1. There are many roads to e*. Preference for one or another has become 
a matter of context or personal taste. Thus it could happen that the time- 
honored sequence (1-+2/n)" would fall into disgrace. Its “many disadvantages” 
even were pointed out by an authority like G. H. Hardy [1], and in fact at first 
sight (1-+2/n)* looks rather awkward to deal with. 

However, (1-++z/n)" leads simply and most directly to essential results if 
one uses an adequate expression for the differences. 


2. Starting with real z= we put 


1+ 


= U, 1+—= 2. 
n+i n 


Clearly nv=nu-+u—1. This is used in 
untl — ym = u(u™ — vo) + (wu — 1)o™ = u(u® — v0") + nv — u)v 
= (4 — v)(u™ + uo + +++ + uv! — no") 
= (u — v)*(ur + QZu72y + + + © + nor), 
Thus it follows that 


x ntl x \n x2 n 
2.1 1 --—— —{i+-—)]) = — Rkurkyk-1, 
2-1) ( +) ( +=) ebay os 


Here we see that for fixed x0, the sequence (1-+x/z)* increases with 2 as soon 
as 1+x/n=v20 (which implies (n+1+x)/(n+1) =u>0). It is also obvious 
that either (1-+x/n)*" has a limit if «<0 or (1—x/n)* has if x>0, and anyway 
(1—x?/n2)™" has a positive limit <1. Upon taking nth roots, we see that 
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(1—x?/n?)"=(1+x/n)*(1—x/n)" tends to 1. Consequently (1-+x/n)" and 
(1—x/n)* tend to reciprocal limits “e*” and “1/e*”. In addition the monoto- 


nicity gives 
x\" %\"" % 
n n n 


In the special case n=1 we find 
li+*sesitet++/1 —-x), —-isx«<il, 


which shows that e* is differentiable at 0. 
Further relevant properties (e7 - e”=e7t", De* =e*, log x) follow along well- 
known lines. 


3. The complex sequence (1-+7x/n)" consists of components which are 
monotonic (in ~) and bounded. It leads to cos x and sin x in basically the same 
way as (1-+x/n)" has led to e. See [2]. However, we omit this because (1-++2/n)” 
and the notion of Cauchy-sequence have a much broader scope. 

We replace x in (2.1) by z and majorize | ynkyk-1| as follows: 


(6 EY Cosy ele By am 


(If one prefers not to use the latter inequality, a less sharp bound will do: 


(: n lly s (: n lly (: _ lly" (: _ Lely" 
nj} kn] kn ~ k 


with natural k>2|2|.) 
Since >o?_, =4n(n+1), we find 
jz)? 1 
-— n(n + 1)ell 


Zz  \ntt z\" 
1 —~(144—)/<—Vt 
( +) ( +) — n(n+1)? 2 
1 1 1 
_ + | [et (— _ ) 
2 n n+i1 


1 1 1 
=— sl" (—-—), m> nN. 
2 nm 


and so 


(+e) -G +) 


According to Cauchy’s principle of convergence limms. (1-+2/m)™ exists, giving 


2 \n 
#—(1+-) 
n 


The case n=1 shows that e* is differentiable at z=0. As the sequence most 
easily leads to the proof that e# - e#=e4t, there remains only one important 
property to prove: the periodicity of 6’. 


1 1 
< — | g | ele! — . 
2 n 
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4. We try to invert e?=w. The possibility of 20 in e?=1 being of primary 
interest, we first consider w such that | w| =1. We write w= (unitary) and 
accordingly introduce unitary sequences, for example 


1+ ix/2 /1 + ix/4\? 1 + tu/2n\" 
1— ix/2 (; — =a) a (G — a) 
which tends to 
e(l/2ie/~-(1/2)iz = git = 4 ua = 1. 


With this procedure as motivation, we now seek to solve the equation e* =u, 
with u¥ —1, | 2 | =1. We construct a sequence %1, x2, - + + , x, such that 


1 + ix,/2 (- ++ 7) 1 + tx,_1/2*-! (- ++ =) 
Uu 8. eee si renee ) eo 8s e ; SS os eoeee 
1 — ix,/2 \1 — ine/4 1 — Gay_s/2*-2  \A — tx, / 24)’ 
Let x1 >0 (otherwise take ); then x2, x3, - + > (all >0) follow from 
1 1 4) 1 1 Vk 
(4.1) Jew... ye 


’ 
v1 Xo 4? Xk~1 Xk 4k 


Obviously *1>%2>%3> +++, and x,—7£20. We now show for & that e#=u. Let 


us write 2*=2n. Then 
(; + ay 
“= (| —__— 
1 — tx,/2n 
and as for unitary a and 6}, it follows at once from | Dof_, a*-!"-*| <n that 
|a"—b"| <[a—b|-n. Thus 
1 + 1&/2n\" lE/n — 4 
(ee re ee ee 
1 — it/2n [1 — i&/2n| | 1 — ixm,/2n| 


hence 


ee [A + G/20\" 
e = tim( Y= 
n>o\1 — i&/2n 


Now let x1>xj>0; then (4.1) gives x2>x2,°--+,s0 £>E&*(E=E* would give 
“u=u*). Even more: from 


41 — x xe — xe te — aot 


>] e s s ’ 


xxci* — Loncs* 42 
we see that x1—x; >x2—x, > ++ + >E—E*>0. In words: & is a strictly mono- 
tonic and continuous function of x1. _ 
The limit case x1= ©, x2=4, x3=8(./2—1), --- corresponds to uw=-—1. 
So, writing + for the corresponding &, we have e**=—1. Likewise e~** = —1 


corresponds to x1= — ©, X%,= —4, ---. This 7 is the area of the unit circle, for 
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0, 4, 8(./2—1),--- are the areas of subsequent circumscribed 2*-gons 
(k=1,2,3,---). The equality e** =e—* displays the periodicity of e’. 

In the same “pre-infinitesimal” way e?=w can be solved for all positive w: 
change 7x into x (that means in (4.1): —into-+). For example e?=2 leads to 
2 =(1+4%1) /(1—$%1), so x1 =2/3, which already approximates log 2. 
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A REMARK ON CHARACTERISTIC POLYNOMIALS 


JosrF ScHMID, University of Fribourg, Switzerland 
The purpose of this note is to give a simple proof of the following 


PROPOSITION. If A and B are nXm and m Xn matrices respectively, over a 
unitary, commutative ring R, In, and Im the untt-matrices of orders n and m, then 
for the characteristic polynomials of AB and BA holds the relation 


imdet(tI, — AB) = t'det(tIm — BA). 


COROLLARY. The characteristic coefficients of AB and BA are equal, as far as 
possible. In particular, the two extreme cases are trAB =trBA and a (generalized) 
Lagrange-identity. 


CoROLiaRY. If AB=I, and BA =Im, then n=m. 
Proof. In this case the proposition yields :*(¢-—1)"=i*(t—1)” and this im- 


plies n=m (cf. [1, 2]). 
Proof of the proposition. Consider the two quadratic matrices C, D of order 


n-+m over Rie]: 
tIn A I, 0 
(Se oS 8) 


Multiplication gives 


tI, -— AB tA tT yn A 
c= ( 8 He gh) 


and the result now follows from detCD =detDC. 
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A COMPUTER ASSISTED APPROACH TO INTEGRAL 
CALCULUS VIA JORDAN CONTENT 


G. J. PORTER, UNIVERSITY OF PENNSYLVANIA 


An increasingly large number of schools have been experimenting with the 
use of digital computers in freshman calculus courses. Many of these uses are 
purely computational. It is my feeling that the computer can be used in a 
meaningful way to present the basic concepts of pure mathematics. In particular 
I think that through a well thought out program of computer assignments the 
student can be led to formulate definitions for himself. As an illustration of this 
idea we discuss here a computer assisted approach to integral calculus via 
Jordan content. This course was taught to 30 freshmen at the University of 
Pennsylvania during the fall 1968 semester. 

Before proceeding, a few words about philosophy are appropriate. There are 
as many notions of what constitutes freshman calculus as there are teachers of 
freshman calculus. Such courses range from an introduction to real variables to 
an applied engineering course. As a pure mathematician I believe that before 
one can compute mass, volume, etc., one must arrive at a mathematical defini- 
tion of these quantities. The definition need not be rigorous but it must be 
mathematical. Thus definitions of area such as ‘extent of surface’ are math- 
ematically inadequate. My use of the computer is aimed at helping the students 
formulate mathematical definitions of these quantities. 

After a short introduction to computing and a discussion of the real number 
system, the students were assigned the problem of computing the area bounded 
by an ellipse. This assignment made explicit the necessity of a mathematical 
definition of area and the idea that area is a function which assigns a number to 
a region of the plane. Most students either divided the ellipse into vertical 
strips and approximated the area of these by rectangles (the single integral) or 
divided the plane into a rectangular grid and computed the area of those rec- 
tangles wholly contained within the ellipse (inner content) or those which had 
some point in common with the region (outer content). 

These programs were discussed in class and the basic concepts were ab- 
stracted into the definitions of inner and outer Jordan content in the plane. 
(These were defined using g.l.b. and I.u.b.) When the inner content of the region 
equals the outer content we call their common value the area of the region. 

The next assignments were to compute volume in three and four space. The 
students were able to see how to modify their programs for area to these cases. 
The fact that four dimensional space could not be visualized caused no diff- 
culty. (It should be noted that computing volumes in u-space by taking rec- 
tangular grids is very time consuming for a larger than three.) 
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Thus the definitions of Jordan content in the plane were also generalized to 
n-space with no apparent difficulty. At this point a mathematical study of the 
properties of volume (additivity on disjoint sets, invariance under translation, 
etc.) can be presented. The level of this presentation should be appropriate for 
the class. 

The next stage of development is to compute the mass of a planar region 
with a given density function, f(x, y). The students, as imagined, subdivided the 
region into a grid, chose a point in each rectangle, and computed > f(x: y;) 
Area(R;) where R; is a rectangle in the grid and (x;, y:) CR; Once again the sum 
is taken either over those rectangles contained in the region or over those rec- 
tangles which intersect the region. 

The students have thus arrived at the idea of approximating a function by 
simple functions. Again their idea generalizes to n-space with no difficulty. 
Class discussion was then directed at the problem of which simple functions to 
choose and whether this choice affected the computation. Given a bounded 
density function one can take g.|.b. and l.u.b. on each rectangle and using these 
simple functions define upper and lower integrals in a manner similar to the 
definition of Jordan content. When these agree onede fines [s fdV” to be their 
common value (SC R’, f: SR). 

Most importantly when the integral exists the choice of the simple function 
doesn’t matter and the naive computation > /f(x;) Vol(R;) should be thought 
of as an approximation of the integral. 

Returning to the student’s calculation of mass in the plane, an analysis of 
the computer program written will reveal that the student is computing a loop 
within a loop;i.e., he is first summing (say) vertical strips and then is adding 
these up. This leads at once to the definition of iterated integral. Generalizing 
this one sees that the n-space integral can be thought of as an (~—1)-space 
integral of a single integral by collapsing along some coordinate. Proceeding in 
this manner one can now express the n-space integral as an n-fold iterated 
integral. In fact this was what the students did in their computer programs. 
Moreover, the limits of integration on each variable appear in the programs. 

Another application of this point of view is to the calculation of volumes by 
viewing the n-space integral as a single integral of an (7—1)-space integral. 
This is particularly easy when an analytic expression is available for the (7 —1)- 
space integral. Examples of this approach are the calculations of ‘volumes of 
revolution’ and ‘volumes of known cross-section’. An interesting computation 
is that of the volume of the unit ball in n-space. To illustrate this the following 
problem was assigned. 

Let B(n, r) = 1 (x1, s+ Xn) CR*| DX Sr?}. 

(a) Prove by analytic methods that Vol(B(n, r)) =r"Vol(B(n, 1)). 

(b) Using the result of (a) and starting with Vol(B(1, 1)) =2 write a program 
to compute Vol(B(n, 1)) for n=2, 3,---, M 

By using this inductive procedure the student may now perform a calcula- 
tion which was formerly too time consuming. The fact that these volumes have 


1970] CLASSROOM NOTES 1001 


a maximum for n=5 and tend to zero as m increases is an amazing discovery for 
most students. In addition, when the student learns analytic methods for 
evaluating integrals he may return to this problem and derive exact expressions 
for Vol(B(n, 1)). 
With the evaluation of n-space integrals reduced to the problem of eval- 
uating single integrals the course specialized to the study of the single integral. 
It is my feeling that this approach has many advantages. Among these are: 


1. The student who takes only one year of calculus is exposed to the ideas 
in m-space as well as on the line. 

2. The integral arises more naturally in the plane than on the line. 

. The student is led to the definitions by his own intuitive ideas. 

4. The student is forced to make certain hazy thoughts into precise language 
in order to communicate with the computer. 

5. The definite integral has an existence of its own and is not confused with 
the antiderivative. 

6. After computing approximations of integrals the student is more appre- 
ciative of analytic methods. Early in the course my students were assigned 
the problem of computing the volume inside the sphere, x?+-y?++-2? =4, 
and outside the cylinder, x?-++-y2=1. When I did this problem analytically 
in class and arrived at an answer of 47+/3 the class gasped. They had 
worked very hard to get an answer they could not recognize in this form. 

7. When the fundamental theorem of calculus is proved, it is as amazing a 
theorem as it should be. The fact that the limit of some complicated sums 
which one has to work hard to approximate can be evaluated by means 
of an antiderivative is astounding and shows the strength of the analytic 
approach to mathematics as opposed to brute force machine computa- 
tion. Too often the student misses this point because he does not under- 
stand the definite integral. In fact he may think the fundamental theorem 
is the definition. 


oN 


In conclusion let me remark that there is a tendency on the part of some of 
those who use the computer in teaching calculus to use it only to compute those 
integrals which do not yield to analytic methods. While this technique may 
from time to time be useful in applications, I believe it adds nothing math- 
ematically to the calculus course. On the other hand I think the computer can 
be used effectively in motivating and helping the student understand the pure 
mathematics. 
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late in life. We agreed, therefore, to cooperate in starting a mathematical “Sun- 
day School.” The details of our new enterprise are not quite clear. We shall try 
to limit its scope, at least in the initial stages of its development. The implica- 
tions of the success of such an undertaking can be very significant. 

It is perhaps consistent with the nature of our New Careers mathematics 
program that we should conclude our account of what we have done with it by 
reporting that we are about to walk through another door that has come ajar. 
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ELEMENTARY PROBLEMS 


Solutions of Elementary Problems should be sent to Problems Group, Mathematics Depart- 
ment, University of Maine, Orono, ME 04473. To facilitate their consideration, solutions of 
Elementary Problems in this issue should be typed (with double spacing) and should be mailed 
before February 28, 1971. Contributors (in the United States) who desire acknowledgment of 
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E 2259. Proposed by J. R. Isbell, State University of New York at Buffalo 


A group in which all uth powers commute with each other and all vth powers 
commute with each other, uw and v relatively prime, is abelian. 


E 2260. Proposed by Marlow Sholander, Case Western Reserve University 


It is given for every real m>0 that three times the area between y=1 and 
y =x", 0 SxS1, equals the area between y=1 and y=x", 1 Sx Sa,,. Find limysodn. 


E 2261. Proposed by R. M. Meyer, SUNY College at Fredomia, N. Y. 


Show that for each integer m, OSm<zn, the function L,,(p) = eo G)p(1—p)4 
is strictly decreasing for O0<p<1. 
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E 2262. Proposed by G. J. Simmons and D. B. Rawlinson, Sandia Laboratories, 
Albuquerque, N. M. 


For every k(>1) there is a set of & positive integers whose sum and product 
are equal. For k=2, 3, 4, 6, 24 the set is unique. Is it unique for any other k? 


E 2263. Proposed by Bernard McCabe, Bell Comm. Inc., Washington, D.C. 


Suppose an urn contains m balls, each a different color. An observer draws a 
ball at random, records its color, and replaces it in the urn. He repeats this pro- 
cedure until some color reappears for the first time. Show that the expected 
number of drawings is 

m k(k + 1)m! 
E(m) = 2) ————— 


zai (m — k) ln 
and determine the leading term in the asymptotic expansion. 


E 2264. Proposed by Erwin Just, Bronx Community College 


If F, is the kth Fibonacci number (Fy = Fy_1+Fe-2, Fi= Fe=1), prove the 
following congruence relations: 


(a) Fin(1 —_ F413) = Fy, (1 —_ Fim+i) (mod 377). 


(b) 5(Fra + Fey1) = 2(—1)” [mod (Fis + Fras)]). 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Inequalities in the Products of Powers of an Ordered Set of Numbers 
E 2203 [1969, 1138]. Proposed by M. S. Klamkin, Ford Scientific Laboratory 


It is known that if OSx1Sx2.S +++ Sxn, (n 23), then 
Tq 73 21 71 Ze zn 
Ny X_ +++ hy 2 Xe hg +++ Hy. 


Are there any other nontrivial permutations {a;} and {6;} such that 


ae a3 a} bobs by 
Q1a, **'*Q, 2 bib, +++ dn? 


Solution (adapted) by G. L. Watson, University College, London, England. 
For n =3 there is no other nontrivial permutation of the x; of the form required. 
For n=4, there are other solutions. For one such solution, note that x3/x121, 
x4/x3=1, Xx3—-X_0, x3—-%x%120 imply 


(1) (23/21) @3-*2) (x4/ 4) =) > 1, 
whence (upon multiplying both sides by x2%/x3"s) 
(2) wy We 4g Day ay 2 Wa - 


For 2>4, the possibilities increase rapidly. For example, with n=5, 


1970] ELEMENTARY PROBLEMS AND SOLUTIONS 1009 


(1’) (x5/%2) (4-28) (75/44) (zp-z2) > { 
implies 
(2’) 04 Ky Ne gt, DS Kye We Ho Xa. 


Also solved by Haig Bohigian. 


A Limit Problem 
E 2204 [1969, 1138]. Proposed by A. C. Segal, University of Alabama, Bir- 
mingham, and Basil Lepp, Rust Engineering Co. 


Let f(n) = >of, 1/k, where & has no zeros in its m-ary expansion. Prove or 
disprove: lim,.. [f() —n log n]=0. 


Solution by N. J. Fine, Pennsylvania State University. The result is correct. 
Let S be the indicated set of integers. Then 


S= {1,2,---,n—-1}UU {nx +1, ne+2,-++,nx+n— 1}. 


zEs 


Hence 


n—1 n—I1 1 


f= 4 ¥ 


k=l K zES Kal NX +R 


n—1 1 n—1 1 1 


=h-t+h 2 


ae zeS pa0NX +R 7Eg9 NX 


_— ——~? 


1 n—1 1 n—-1 
fm +—fm) =i -— _ + a 
n k=l zes k=0 WX + R 


Now for every positive integer ), 


1 - (+t. 4(4 1 ) +0(+) 
bp. OB 2\d b+1 b3 J 


Putting b=nx-+k (k=0, 1, ---+,2—1) and summing, we get 


n—1 1 x+ 1 1 /il 1 1 
=| — — ) , 
2 tk on( x )+5-(< ——)+ (=) 


fe - D+ D4 ¥ tog(“) + (= - ) 


2n wes \% a+ 1 
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= gh Bin) Be 


es r=] 2&8 
t>n 2>n 
17/1 1 1 1 1 1 
AEE-) tz) 
on x x+1 om x x+1 n? 
m>n 


1 1 1 1 
=togn = {= = tog(= )b += + 0(—) 
zesS VX x 2n n? 
1 1 1 1 1 1 
= | — —|{—— O| — —+0(— 
em" rel; —)t (= )} +54 (=) 


1 /il 1 1 /i 1 
=logn — —(_ _ (ae 
opm 2, > (- =) p> > (- =) 


n<z<n® 2>n? 


m-1 1/1 1 m1 1/1 1 
wir EME) BYE) 
em =>(- —) Lu 5 nx nei 


+5, +0(3) 
saab 4o(! =) +on 5 — +0(—) 


1 
=logn+ o(—). 
n 


Therefore f(”) =n log n+O(1/n), as required. 
Also solved by D. M. Bloom, L. Carlitz, C. Gardner, and E. F. Schmeichel. 


Three Hyperbolas and a Quadrangle 


E 2206 [1969, 1138]. Proposed by R. C. Lyness, Blackpool, England 


Each of three hyperbolas has for its foci a different two of three noncollinear 
points. Each pair of hyperbolas has a set of six common chords. Show that three 
pairs of chords, a pair from each set, form the six lines of a quadrangle. Show, 
further, that the minor axes of the hyperbolas bisect the sides of a triangle whose 
vertices are three of the four points of the quadrangle. 


Solution by the proposer. Denoting distances from the foci by mn, re, rs, the 
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hyperbolic branches 71—72=c, 73—72=a intersect where 77 =73+2cre+c? and 
13 = 1 2ar,-+a? and 


ly = — ary — (c — a)re + crs + ac(c — a) = 0. 


This last is the equation of a straight line, and the intersection of 71—7r2= —c 
and r3—72,= —a lies on it. So it is a common chord. 

Since ). b(c-+a—b)l,=0, hi, b, ls concur. If —a is written for ain bh, we have 
I;, the chord joining the intersections of r1—re= +c and r3—re= Fa, and i, 
and /{ concur. The four points of the quadrangle are (J,, 41), r=1, 2, 3, and 
(ly, Jo, 73). 

At the point of concurrency of /,, 4, 4 we have r?—r2=c(a+b), and at the 
point of concurrency of Js, If, 1 we have ri—1r3 = —c(a-+b). Hence the minor axis 
r1=12 bisects the side (11/5, [{/.). 


Also solved by M. G. Greening (Australia) who establishes the result as a consequence of the 
following theorem which he proves using homogeneous coordinates: If three conics S,, S2, S3 are 
inscribed respectively in the quadrilaterals 42.IA3J, AsIAiJ, Ai1ITA2J, where no three of the points 
Ai, Ag, As, I, J are collinear, then there exist pairs of common chords of 1, So; Se, S33 S3, S; which 
form the six lines of a quadrangle. 

A. W. Walker refers to E. H. Neville (A focus-sharing set of three conics, Math. Gazette, vol. 
20, 1936, p. 182; also vol. 32, 1948, p. 184) who proves the first part of E 2206, and shows that if 
A, B, Care the diagonal points of the quadrangle, the directrices of one conic pass through B and C, 
those of another through C and A, and those of the third through A and B. The minor axes of the 
conics therefore bisect the sides of triangle ABC. 


A Bound for Another Integral 
E 2207 [1969, 1138]. Proposed by Anon, Erewhon-upon-Wabash 


Suppose f(x, y) vanishes on the boundary of the square S: 0Sx, yS1, and 
that | 04f/dx?ay?| <B. Prove that 


Sf, tose | stn 


Solution by Ralph Jones, University of Wisconsin. Perhaps the simplest example 
of a function satisfying the conditions of the problem is g(x, y) =x(1—x)y(1—y), 


for which 
d*g 1 4 
0x?dy? sg 36 )=—-« 144 


Since f vanishes on the boundary of S, 02f/dy? vanishes on the vertical sides 
of S. Four applications of integration by parts yield 


JJ. ae a JJ i 7 ffs 
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Thus 
é*f 


1 B B 
Sele sabe ede 
8 4 8 xd? 4 8 144 


Also solved by L. Carlitz, W. M. Causey, J. P. Fink, R. H. Garstang, Doug Hanto, M. 
Hioselihorn (Scotland), David Hoitsma, R. A. Horn, Simeon Reich (Israel), Steve Rodhe, E. F. 
Schmeichel, P. C. Sinha (U.S.S.R.), Sid Spital, Feets Uneven, E. J. Wilkins, Jr., P. H. Young, 
R. L. Young, and the proposer. 

Several solvers note the similarity between this problem and E 2155 [1969, 1142], and are 
able to use the latter in their solution. 


§ 


A Locus of Centroids 


E 2209. Proposed by M.S. Klamkin, Ford Scientific Laboratory 


Determine the locus of the centroids of all triangles similar to a given tri- 
angle and inscribed in another triangle. 


Solution by Michael Goldberg, Washington, D.C. If a triangle of given shape 
grows so that the vertices trace fixed straight lines in the plane, then every point 
of the triangle will trace a straight line. 

If the fixed straight lines are the sides of the triangle A BC, and the variable 
inscribed triangle is DEF, then its centroid P describes a straight line. However, 
there are some orientations for which the vertices of DEF cannot be confined to 
the straight line segments of the triangle ABC. Also, as DEF is turned, the mo- 
tion of the vertices will change direction as they shift from one line of ABC to 
another line of ABC. Hence, the complete locus of the centroid consists of three 
straight line segments, shown in dotted lines in the figure. 


B 


A C 


If, in addition to direct symmetry, reflected symmetry is acceptable, then 
three more straight line segments are to be added to the locus of the centroid. 
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Also solved by Anders Bager (Denmark), Jordi Dou (Spain), Daniel Pedoe, A. W. Walker, 
Mark Yu, and the proposer. 


Several solvers called attention to the well-known underlying theorem which can be found in 
Peterson’s text, also Johnson’s, and elsewhere. 


Tea for Two 


E 2210 [1970, 79]. Proposed by R. N. Lloyd, University of Nottingham, Eng- 
land 


While the Dormouse lay sleeping in the middle of the table, the Mad Hatter 
and the March Hare sat down for tea. A number of places had been laid, and 
the Hatter and the Hare each moved alternately to a seat next to his old one. 
Each changed his direction of movement when, and only when, either the other 
had just changed his or else so as to avoid their both sitting on the same chair. 
Tea finished when both the Hatter and the Hare were again sitting in their ini- 
tial positions. After how many moves was the tea-party over? 


I. Solution by R. W. Sielaff, Naperville, Ill. and G. P. Smith (undergraduate), 
MacMurray College, Jacksonville, Ill. (A composite by the editors.) Let n be the 
number of places that were laid, n>2. If both the Mad Hatter and the March 
Hare move in the same direction, each will require x moves to regain his initial 
position since there will be no occasion for either to reverse direction. Conse- 
quently, the tea-party will be over after 2m moves. 

Suppose then that they move in opposite directions and that there are k 
places between them in the direction of motion. To fix ideas, we assume without 
loss of generality that the Hatter moves first in the clockwise direction. There 
are three cases. 


Case I. Suppose & is even. After k/2 moves by each, they will be seated next 
to one another and each will reverse his direction of movement. After 2/2 further 
moves by each, they will both have regained their initial places. Consequently 
the tea-party will be over after 2k moves. 


Case II. Suppose k is odd and u even. After (k—1)/2 moves by each, they 
will be separated by one place. After the next move by the Hatter, they will be 
seated next to one another, and each will reverse his direction of movement, the 
Hare first and then the Hatter. After (x—2)/2 additional moves by each, they 
will again be seated next to one another and each will reverse his direction of 
movement again, with the Hare reversing direction first and the Hatter follow- 
ing suit. After (7—2)/2—(k—1)/2—1 further moves by each, the Hatter will 
be in his initial position, having required a total of (k—1)/2+1+(n—2)/2 
+-(n—2)/2—(k—1)/2—1=n—2 moves in all, whereas the Hare will require 
one further move to return to his initial place. That is, the Hare takes a total of 
(k—1)/2+(n—2)/2+(n—2)/2—(k—1)/2=n—2 moves in all. Consequently 
the tea-party will be over after 2n—4 moves. 


CasE III. Suppose both k and m odd. After (k—1)/2 moves by each, they 
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will be separated by one place. After the next move (by the Hatter), they will 
be seated next to one another and each will reverse his direction of movement, 
with the Hare reversing first and then the Hatter. After (1—3)/2 additional 
moves by each, they will again be separated by one place. After the next move 
(by the Hare), they will be seated next to one another, and it will be the Hatter’s 
turn to move. After (1 —k—2)/2 more moves by each, the Hatter and the Hare 
will each be one place in a clockwise direction from their original seats and, 
further, each will be moving in the direction he was moving orginally. This is be- 
cause the Hatter will have made a total of (k —1)/2+1+(n—k—2)/2=(n—1)/2 
moves in a clockwise direction and (~—3)/2 moves in a counterclockwise 
direction, whereas the Hare will have made a total of (k—1)/2+(n—k—2) /2 
= (n —3)/2 moves in a counterclockwise direction, and a total of (n—3)/2+1 
moves in a clockwise direction. Therefore ” of these cycles will bring them both 
home moving in the original directions. However, during the last cycle, each one 
touches home at the same time while moving in the opposite direction, continues 
on for (1—k—2)/2 further moves, reverses, and returns in (n—k—2)/2 moves 
to finish the cycle. Hence they both return home for the first time (before the 
end of the last cycle) after a total of (2) ((k—1)/2+1+(n—3)/2+(n—k—2)/2) 
—4((n—k—2) /2) =2(n—1)(n—2) +2k moves. 


II. Solution by D. G. Beane and E. F. Schmeichel, the College of Wooster. If 
the party starts with exactly two chairs separating the Hatter and the Hare in 
either direction, and the first move of each is onto these chairs, the party ends 
in four moves. Otherwise there are two possible outcomes. If the first move of 
each is clockwise or the first move of each is counterclockwise, the party ends 
in 2m moves (m denoting the number of places) since there is never a change of 
direction. If the first move of one is clockwise and the first move of the other is 
counterclockwise, they eventually occupy adjacent places. The next move of 
each is then back to the place which he previously occupied, and the following 
move of each again results in their occupying adjacent positions. This alterna- 
tion will continue indefinitely, and the party may still be going on. 


Also solved by Anders Bager (Denmark), Michael Goldberg, R. M. Ingle, S. Wu-Wei Liu, 
Norman Miller, Rik S. Moody, Paul Payne, Roger Weitzenkamp, and the proposer. 

Editorial Comment. As the reader may have gathered from the two conflicting solutions, there 
was some confusion as to the proper interpretation of the “reversing rules” as given in the statement 
of the problem. The proposer (and your editors) were looking for a solution in the spirit of Solution 
I above; that is, the reversing rules should read (i) reverse if necessary to prevent a collision, and 
(ii) reverse in direct response to a reversal of type (i) by the other player. The statement of the 
problem unfortunately does not eliminate (iii) reverse in direct response to a reversal of type (ii) by 
the other player, (iv) reverse in direct response to a reversal of type (iii) by the other player, and so 
on. Allowing these, we are led to the alternating solutions as presented in Solution IT. 

The solvers were split about half and half between the two interpretations. 
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ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State Unt- 
versity, New Brunswick, NJ 08903. Solutions of Advanced Problems in this issue should be 
typed (with double spacing) on separate, signed sheets and should be mailed before February 
28, 1971. Contributors (in the United States) who desire acknowledgement of receipt of their 
solutions are asked to enclose self-addressed, stamped postcards. 


An asterisk (*) means neither the proposer nor the editors supplied a solution. 


5759. Proposed by Robert Raphael, McGill University 


Let S be a commutative ring with unity, and let R be a subring of S contain- 
ing the unity of S. Assume furthermore that S has no nilpotent elements (other 
than 0), and that each element of S satisfies a monic polynomial equation with 
coefficients from R. Recall that a ring is regular (in the sense of von Neumann) 
if for each x there is a y such that x=x*y. Show that if R is regular then S is 
regular as well. 


5760.* Proposed by D. K. Kosaka, Dallas, Texas 


Without using the axiom of choice, or equivalent, decompose the unit inter- 
val [0, 1] into an uncountable class of disjoint sets, each of which is uncountable 
and everywhere dense in the unit interval [0, 1]. 


5761.* Proposed by A. S. Adtkesavan, Regional Engineering College, Tiruchi- 
rapalli, India 

Define u(x) and v(x) by the relation {(x) }*-+ {v(x) }"=1. Where n 22 is an 
integer, u(0) =0, v(0) =1, and du(x)/dx =v(x). Determine u(x-+y) in terms of 
u(x), u(y), v(x) and v(y). 

5762. Proposed by D. A. Zave, UNIVAC, Roseville, Minn. 


Let X be a real or complex normed linear space with norm ||-||. Let 2:0, 
4=1,---+,n bescalars and let p21. If di, -- +, 6, and s are fixed elements of 


X, compute 
int} > | 


t=1 


ass + bl? : 21 + t= sh. 


5763.* Proposed by J. T. Rosenbaum, University of Pittsburgh 


For any set 5S of reals, call the convergent series >) a, an SS series if for each 
e>O there exists a sequence {J,} of open intervals covering S with |I,| S¢ an, 
n=1,2,---+. Find a Cantor set series (see Problem 5665 [1970, 411 ]). Is there 
an S series for all S of measure 0? Is there a universal series? 


5764. Proposed by M. L. Glasser, Battelle Institute, Columbus, Ohio 
Evaluate the limit 


f ° sinh(yx) dx 
0 


L = lim sine —_—_____—— ; 
sinh(x) cosh x + cose 


aK 


|Rey| <a+1. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Groups with a Cyclic Center 


5689 [1969, 947]. Proposed by Frank DeMeyer, Colorado State University 


Let G be a finite group with cyclic center Z and assume G/Z is abelian. Show 
G/Z=HXH (direct product) for some abelian group H. 


Solution by C. R. B. Wright, University of Oregon. We prove the theorem: 
Let G be a finitely generated group with center Z and cyclic commutator group 
G’. If G’CZ, then G/Z~H XH for some group H. 

For each x in G the map p,:y—>[x, y]=x-1y—xy is a homomorphism of G 
onto [x, G| with kernel C(x). Choose a and b in G to maximize the group ({a, }]). 
Then for some g, [a, G] =Gpa=(gp.), since G’ is cyclic and therefore [a, 6] is in 
({a, g|). Thus and similarly ([a, 6|) = [a, G] = [b, G]. Let A =(a)Z/Z, B=(b)Z/Z 
and for each g in G let gt = C(g) /Z. 

Since p4 induces a homomorphism of G/Z onto [a, G| with kernel a4, and 
since Gpz=(b)b4, G/Z=atB and similarly at=(atNd4A. Thus G/Z 
= (atb+) AB. 

If | [a, b]| is infinite, then A/AMb+ is infinite, and ANb+=1. If | [a, b]| is 
finite, then G’ is finite and for each g, A(\gt has index | [a, g]| in A and hence 
contains A/\b+, the subgroup of index |[a, b]| in A. In this case, too, ANd+ 
=A(\Gt=1. By symmetry, BO\at=1. In particular, AV B=1. Also by the 
modular law, ABNMatNbt=A(BNat)Ob+=1. Altogether, G/Z=AXB 
X(atNb+), with A~[a, G]=[b, GJB. 

Since the center of C(a)(\C(b) is central in G, C(a)(\C(b) satisfies the hypo- 
theses of the theorem. Repetition of the above argument a finite number of 
times leads to the required decomposition of G/Z. 


Also solved by Kevin Brown, M. G. Greening (Australia), C. Y. Tang, and the proposer. 


Projective R-modules 
5695 [1969, 1074]. Proposed by Anon, Erewhon-upon-Wabash. 


Let R=Z [a1,de, a3, by, be, bs with the single relation a1b1+-aebe-+a3b3 =1. Let 
M be the R-module generated by x1, x2, %3 with the single defining relation 
b1X1+b2x2-+b3x3 =0. Prove that M is nota free module, but is projective. 


I. Solution by G. J. Janusz, University of Illinois. We interpret R as a 
noncommutative ring, and M as a left R-module. Then the problem 1s incorrect 
as stated. The result is true if the first of the defining relations is changed to 
read b1a1-+bed2-+b3a3 = 1. I shall first show the result false in its original form 
and then present a proof of the corrected version. 

Let F be the free left R-module with generators Xi, Xe, X3 and let 7 be the 
homomorphism of F onto M such that r(X;) =x;. The definition of Jf implies 
Ker t=RY, where Y=01X1+02.X2+03X3. The module M is projective if and 
only if RY is an R-direct summand of F. 
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Suppose J is projective. Let a be some projection of F onto RY such that 
ais the identity on RY. Then a(X,) =7,Y for somer;in R. It follows that 


Y= a(Y) = >> bia(X;) => >) bri. 


After we express Y in terms of the X;, we find that (1— > b,r;)b;=0. 

We now appeal to P. M. Cohn, Some remarks on the tnvariant basis property, 
Topology, v.5 (1966). The corollary on page 224 says this ring R has no zero 
divisors. (R is isomorphic to U3, in Cohn’s notation.) Thus 1= >0);7;. This 
relation, however, cannot be a consequence of the single relation > a; b;=1; 
this argument can be made precise by using the notation of reduced forms below. 

Now if the defining relation for R is changed to read }/b;a;=1, define a by 
a(X,;)=a:Y. Then a=a and Im(a) =RY, hence RY is an R-direct summand 
of F, and M is projective over R. 

To show that M is not free, we must again refer to Cohn’s paper for the 
assertion that the ring R has the property that the number of free generators of 
a free R-module is an invariant of the module. Thus we suppose that J is free, 
and after replacing M with an isomorphic copy we find D=M@ORY and M is 
free on two generators Yi, Ye. We know M=Ker a, where a(X;) =aiY, so M is 
generated by the elements X;—a;Y. If we write Y3 for Y, then we obtain equa- 
tions 


A;= > rig Vj, Y;= » Sig Xj, 


where all subscripts range from 1 to 3, ri; and s;; are in R, and rig =ay, 53; = Dj. 
Let A and B denote the matrices ||7,,|| and ||s.;|| respectively. Then AB=BA =I. 

To proceed from here it will be necessary to express elements of R in some 
normal form. Any element of R is a sum of monomials involving the a; and ),, 
each multiplied by some integer coefficient. An element 7 will be called reduced 
if no monomial with nonzero coefficients contains the product biai. By using 
the results of J. Shepherdson: Inverses and zero divisors in matrix rings, Proc. 
London Math. Soc., 1 (1951) 71-85, it can be shown that each element is equal 
to a reduced element (eliminate b,a, by using > +b; a@;=1) and this expression 
is unique. 

As a consequence of BA =/, it follows that 


O= bs ru fork =1 or 2. 


We assume all terms 7;; are in reduced form; then bere,-+0373; 1s also reduced, 
and hence birx, must not be reduced. For if bi, were also reduced, then so 
would be the sum; but a reduced term by, cannot equal a reduced expression 
— bors, — 6373, unless all terms are zero, whereas the 7;, cannot all be zero. 

Since 6171 is not reduced, we find 71, =aiwu, for some wy, in R. It now follows 
that the matrix A can be written as a product DC where D =diag {ai, 1,1} and 
C has the same second and third row as A, while row 1 of C is (wu, wy, 1). Now 
the equation DCB =1 implies that there is some (reduced) element d in R such 
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that aid =1. This is impossible since whenever d is reduced, so is aid. This 
contradiction is a consequence of the assumption that / is free. 


II. Solution by the proposer. We are dealing with modules over a commuta- 
tive ring. Let F be the free R-module with basis X1, Xe, X3, and set 
Y = 1X1 + 2X2 + 03Xz, Vy = a2X3 — a3Xo, 
Y. = a3X1 —_ a1X 3, Y3 = a1X¢ — aoX 1. 


Let G=RY,)CF and H=RY,+RY2+RY3CF. Since a1Vo=X1+b2V3—]3 V2, 
etc., G+H=F. If coYo=aYiteeVYetosY3EGN\H, then cobi=ced3—C32, Cobe 
= (301—C103, Cobs=C1d2—C201. Multiply by ai, ae, a3 and sum: co=0; hence 
coYo=0, GOH =0, F=GOHXH. But M=F/G=H, hence M is a direct summand 
of a free module and therefore is a projective module. (See H. Flanders, Trans. 
A.M.S., 145 (1969) p. 366 for another proof.) 

It seems much harder to prove algebraically that M is not free. In Flanders 
this is done by first identifying a;=); and the tensoring into the reals. The 
assertion then follows from the nonexistence of a nowhere zero (algebraic) 
vector field on S?. A completely algebraic proof would be desirable. 

Note that the module M=Rx1+Rx. is free; here 03%: 1+02%2=0, R 
=Z ai, a2, bi, be], a1b1t+-a2b2=1. A basis is y=a2%1—a1%2. The analogous case 
M=Rx,+ +--+ +Rx,zis open. Note that the specialization 


R = Z[ao, a1, a2, asl, ap t-+++a3=1, 
M = Rx+--+-+ Res, AoXo + +++ + 3x3 = 0 
yields a free module M. The usual quaternion argument (S' is parallelizable) 
gives the basis 
Y1 = — GX + dox1 — Aske + Aexs, Yo = — AeX%y + A3%1 + Aox, — a4%3. 
Ya = — 3% — Aehy + Aye + aos. 
Integral of a Uniform Sum 


5700 [1969, 1074]. Proposed by R. E. Chandler and R. A. Struble, North 
Carolina State College 


Let {an} ear be an enumeration of the rational numbers in (0, 1). For 
*€(0, 1) define f(x) = >) 1/2", where the summation is over all ” for which 
X,<x. Evaluate fo f(x)dx. 


Solution by W. H. Ruckle, Clemson University. For each n let 


0 
n\X%) = 
g(2) ti [2 x > Xn. 
Then >.7.1 gn converges uniformly to f. So 


f f(x) = > £r(x)dx = ys (1 — a%,)/2". 


n=l n=l 


* SX, 
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Also solved by R. G. Bilyeu, David Burdick, David Burman, L. Carlitz, W. D. Causey, Red 
Cougar, D. Z. Djokovié, R. J. Driscoll, G. B. Feissner, Neal Felsinger, Robert Fefferman, N. J. 
Fine, Addison Fischer, T. E. Gantner, Leon Gerber, Rudolf Gorenflo (Germany), M. L. T. Hautus 
(Germany), D. A. Hejhal, G. A. Heuer, M. Hirshorn, K. E. Hirst (England), O. P. Kapoor & G. D. 
Lakhani (India), J. A. Kelingos, M. L. Klasi, R. A. Kopas, Frank Kost & William Weller, J. R. 
Kuttler, E. S. Langford, Douglas Lind, D. H. Lorenz & J. G. Simmonds, M. D. Mavinkurve 
(India), M. H. Moore, Roy Olson, Ed Packel, T. M. Phillips, J. T. Rosenbaum, C. L. Sabharwal & 
M. D. Fraser & J. T. McEwen, E. F. Schmeichel, A. C. Segal, D. Hammond Smith (England), 
Charles Vanden Eynden, M. S. Waterman, and the proposers. 


Editorial Note. Hejhal proves the result in the form of the following abstraction: Let {an} ses 
be a sequence in (0, 1) and let yu be a totally finite Borel measure on (0, 1). Set Q(x) =u(%, 1), 
O<x<1. Let 5.2.2, be a real, absolutely convergent infinite series. Finally, set F(x)=) gn, 
where the summation is over all 2 such that a,<x, and 0x1. Then: 

(i) Q is monotonic decreasing, right continuous, and Borel measurable; 

(ii) F 1s bounded and Borel measurable; and 


1 ro 
(it) J Fu = 2 &:0(@. 


Smith, Driscoll, Lorenz, Simmonds, and Waterman note that the set of possible values of the 
integral is dense in (0, 1). But an unanswered question is whether the set of values consists of 
every number in (0, 1). 


A Diophantine Equation 
5703 [1969, 1152]. Proposed by Erwin Just, Bronx Community College 


If n>1 and bk is any integer, can there exist solutions to the Diophantine 
equation: 
gnti_y an on 
dy (% — ys) = (2% + 1)2"? 
t=a1 
Solution by E. F. Schmeichel, Itasca, Illinois. We answer negatively. (All 
congruences are modulo 27+*.) It is easily verified that a?” =0 or 1 according as 
a is even or odd. So if all the x;, y; are integers, we have 


te — v5 = — 1, 0, or 1 for all z. 
Thus 
otttl_y a " 
2 2 
S= > (x — ys) =d, 
i=1 


where |d| $2"*!—1. But this means 

keQnt?2 — (Q2nt1 — 1) SS S h-2nt? + (2"41 — 1) 
for some integer k, or (2kR—1)2"*!<S<(2k+1)2"+1. Since 2k—1 and 2k+1 
are consecutive odd integers, no integer solution for the equation is possible. 


Also solved by Merrill Barnebey, Simeon Reich (Israel), David Spear, Charles vanden Eyn- 
den, and the proposer. 
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A Linear Differential Equation 


5705 [1969, 1152]. Proposed by M. S. Klamkin, Ford Scientific Laboratory 
Find the general solution of the differential equation 


[aD=+! + 2nD» — xD — nly = 0. 


Solution by Robert Heller, Mississippi State University. It can be shown by 
induction that xD"+2D*—!=D"x, from which it follows that the given equa- 
tion may be written (D*—1)(xD-+n)y =0. Hence 


(wD + n)y = Di exer, 
k=1 
where a1, +--+, @, are the m distinct mth roots of unity. Multiplication by x77! 
gives 


D(xy) = Yo cyurteree, 
k=l 


Repeated integration by parts shows that 


n n n — 1)! 
xy =cot >» | ace > (—1)?? AS (a,x ad 
k=l p=l (n — p)! 

Taking limits of both sides as x0, we see that although solutions y may 
exist on (— ©, ©), such solutions are not expressed with precisely +1 arbi- 
trary constants. On (— ©, 0) or on (0, ©), we have the general solution 

n n n — 1)! 
y = cor + am DD cnet > (- 1) wai! (at) ; 
k=1 p=1 (n — p)! 


Also solved by I. N. Baker (England), M. T. Bird, Peter Brady, Jr., D. R. Breach (New 
Zealand), J. H. E. Cohn (England), Michael Deakin (New Guinea), Robert Desko, J. Gillis 
(Israel), K. K. Gorowara, Melvin Henriksen, J. M. Horner, O. P. Lossers (Netherlands), D. E. 
Myers, B. S. Popov (Yugoslavia), C. L. Sabharwal, St. Olaf College Students, L. E. Ward, Sr., 
J. H. Webb (South Africa), J. E. Wilkins, Jr., and the proposer. 


A Closure Property and Hamel Bases 
5706 [1969, 1152]. Proposed by J. H. B. Kemperman, Unwersity of Rochester 


Let H be a Hamel basis of a field R over a subfield Q. Show that for each 
aC, a1, there exists an element xCH with axG dH. 


Solution by Leroy F. Meyers, Ohio State University. For fixed a1, let 
* GH imply ax CH. For each"b = )\.cx 6.x ER with b,EQ and {x|b,+~0} finite, 
set f(b) = > zen b6z€Q. Then f is linear over R, and f(ab) =f(b) (since aHCH). 
With xCH and b=x/(a—1) we have 1 =f(x) =f(ab) —f(b) =0, a contradiction. 
Also solved by A. K. Charnow, E. J. Eckert, K. E. Eldridge, N. J. Fine, D. A. Hejhal, M. L. 


Laplaza (Puerto Rico), M. A. Mavinkurve (India), P. van der Steen (Netherlands), Bertram 
Walsh, and the proposer. 
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Editorial Notes. The proposer and the solver note that this problem is an extension of 5538 
(1968, 916). Eldridge and Walsh observe that this result does not depend on the commutativity of 
Q and hence remains true for division rings. 
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VECTOR PROOFS IN SOLID GEOMETRY 
M.S. KLAMKIN, Ford Motor Company 


1. Introduction. In this paper, we give vector proofs of a number of theorems 
in solid geoinetry. Although, as to be expected, one can usually give synthetic 
procfs, one should always be prepared to use any alternate representations, 
especially if they are simpler. In many cases, it will turn out that not only is the 
vector approach simpler, but it is also more direct. Admittedly, however, in a 
great many other cases, the vector approach will not be quite as simple, but it 
still may be more direct. This does not mean that the author eschews the use of 
synthetic geometry, in fact the reverse is true. However, the main difficulty 
with elegant geometric solutions is that frequently there is no general method to 
indicate the first few proper steps or constructions which should be made. But 
once the first few proper steps have been made, the rest of the solution is usually 
very easy and one sees what is behind the problem. This is probably one of the 
reasons why analysts and algebraists tend to shun geometry; they have not 
given themselves enough practice to acquire the necessary geometric intuition 
to make the first few proper steps. Contrast this with analytic geometry. Here 
the first few steps are usually routine. You coordinatize everything in sight and 
then write down the appropriate equations. However, the subsequent equation 
solving can often be tedious, extremely difficult or unmanageable. On the other 
hand, vector proofs frequently appear to “lie between” synthetic and analytic 
proofs in regards to both simplicity and directness. The point to observe here is 
that one should be ready to use any representation, be it synthetic geometry, 
analytic geometry, analysis, vectors, algebra, etc., which lead to results. An 
exception to this, of course, is when one is learning some particular representa- 
tion. This point is explored much further with many illustrative examples in [1] 
and [2]. 

Most of the theorems given here are known results. However, some of the 
proots are believed to be new. Although some of them are not simpler than their 
synthetic counterparts, they still can be used to provide nontrivial and, we hope, 
interesting exercises to be used in classes on vector analysis. A number of 
these results are inequalities and, as to be expected, their preofs will involve 
either the triangle or Cauchy’s inequality. 


2. Angles in tetrahedra. 


THEOREM 1: The sum of the measures of any iwo face angles of a trthedral 
angle is greater than the measure of the third face angle |3, p. 65 |. 


Proof: Since cos x is decreasing in [0, +], we equivalently have to show that 
cos AOC > cos AOB cos BOC — sin AOB sin BOC (see Fig. 1). 
Vectorially, this is equivalent to establishing the inequality 
(B-B)(A-C) > (B-C)(B-A) — |BXC||BXA}. 
1051 
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Since 
(B:C)(B-A) — (B-B)(A-C) = B-{C(B-A) — B(A-C)} 
= B-[AX (CX B)| = (BX A)-(C XB), 


the inequality immediately follows. There is equality only if O is in the plane of 
A,B,C. 


THEOREM 2. The sum of the areas of any three faces of a tetrahedron 1s greater 
than the area of the fourth face [4, p. 106]. 


Proof: Referring to Fig. 1, we have to show vectorially that 
\AXB| +|BxcC| + |CXxA| > |(A-B) xX (C—B)|. 
Since |(A—B) x (C—B)| = | AXB+BXC+CXA|, the desired result follows 


from the triangle inequality. Again there is equality only if O is in the plane of 
A, B,C. 


THEOREM 3. In any trihedral angle, the sines of the dthedral angles are propor- 
tional to the sines of the opposite face angles [3, p. 67 |. 


O 


B 
Fic. 1 
Proof: If we let A, B, C, denote the dihedral angles containing the edges 


OA, OB, OC (which can now all be assumed to be of unit length), respectively, 
in Fig. 1, we have to show that 


sin A sin B sin C 

sin BOC. sinCOA sin AOB_ 
Since 
sinB |XX @XO) AB XC] 
snCO4 |AXB|-|Bxc|-|CxA| |AxB|-|Bxc|-|CXxA| 


is symmetric in A, B, C, the result is established. 
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The synthetic proof given in [3] does not include the proportionality con- 
stant which enables one to compute the dihedral angles from the face angles. To 
accomplish this, we need an expression for [A-BXC] (a neater way but not 
quite so direct is given in [5, p. 35]) which gives the volume of a parallelepiped 
having A, B, C, as three coterminal edges in terms of the face angles. To this end, 


we let 
A =i, 
B=icos AOB + Jj sin AOB, 
C= pitqtrk, where pP+@’+r=1, 
Since 
A:C = cos AOC = , 
B-C = cos BOC = p cos AOB + qsin AOB, 
(1) V =[A-B XC] =rsin AOB = (1 — p? — g?)!/2 sin AOB 
= i (1 — cos? AOC) sin? AOB — (cos BOC — cos AOC cos AOB)?}1/2 
= {4 — cos? AOB — cos? BOC — cos* COA 
+ 2 cos AOB cos BOC cos COA}1/2, 
Thus sin A = V/(sin AOB sin COA), etc. 

Thesymmetrical function V/2 of the face angles of the given trihedral angle is 
called the norm of the sides of the spherical triangle A BC and is denoted by a in 
spherical trigonometry [6, p. 498]. Also, V or equivalently 27 is sometimes called 
the sine of the solid angle subtended by the trihedral angle. 

Reciprocally, the face angles as functions of the dihedral angles are obtained 
by letting 
sin BOC =) sin A, etc. 


and solving for \ as a function of A, B, C. Whence, 


sin AOB sin BOC sinCOA 2N 


— , 
sin A sin B sin C sin A sin B sinC 


where 2N = |1—cos? A —cos? B—cos? C—2 cos A cos B cos C}1/2. N is called the 
norm of the angles of the spherical triangle ABC. Alternate derivations by 
means of spherical trigonometry for the latter formulae are given in [6, pp. 
497, 503]. 

Since V ranges between 0 and 1, we also have the following inequality: 


3 2 sin? 6; + sin? 62 + sin? 63 + 2 cos 4; cos 62 cos 63 = 2, 
where 
r= 6;2 0, 6; + 62 + 63 & 2r, 
6; + O02 2 4s, 6. + 63 2 A, 63 + 0, = Oo. 
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The upper bound is achieved for 6;=a/2 and the lower bound when the sum of 
two angles equals the third angle. 
A determinantal representation for V? is 


1 cos AOB cos COA 
V2 =| cos AOB 1 cos BOC 
cos COA cos BOC 1 


The latter form is equivalent to the following result given in |7, p. 48]: 


THEOREM 4. If a, 8, y are the angles between each parr of the triad of directions 
OA, OB, OC, then 


1 cos y cos 8 
cos 7 1 cos a | = sin’ y sin? ¢, 
cos B COS a 1 


where ts the angle OC makes with the plane OA B. 


Proof: Referring to the previous figure again, sin @=(AXB- C)/| AXB| or 
V2=sin?¢ sin? AOB. 


THEOREM 5. If the edges of the base of a tetrahedron are a, b, c, and each of the 
lateral edges is equal to d, then the volume v of the tetrahedron is given by |4, p. 108 | 


v = {16p(p — a)(p — b)(p — c)d? — adc? $12, 
where 2b=a+b-+e. 


Proof: It follows from (1), that v= Vd'/6. By substituting cos BOC = 
1—a?/2d?, etc., in Vin (1) and simplifying we get the desired result. 


THEOREM 6: If a line makes congruent angles with each of three lines in a 
plane, the line is perpendicular to the plane {3, p. 45 |. 


R 


Fic. 2 


Proof: lf A, B, C denote unit vectors along the three lines in the plane and R 
is a vector along the other line, we have to show equivalently that (see Fig. 2) 


RA=R-B=R-C = RA=R-B=O. 
Let R-A= X. Then since A, B, C are coplanar, C=aA-+dB and 
R-C=(a+b)\=) or (a+d—1)A=0. 
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But a+b =1 is impossible, since this implies that A, B, C are collinear. Thus 
\=0. (A simple geometric proof follows by considering the perpendicular bi- 
secting planes of AB and BC. Their intersection which is normal to the plane con- 
tains both O and R.) 


3. Altitudes. The next set of theorems (7-12) relate to the altitudes of a 
tetrahedron. While the proofs of all of them are not as simple as the correspond- 
ing synthetic ones, they may be more appealing (especially to non-geometers) 
by virtue of their directness. All of them except Theorem 12 are given in [4, pp. 
61-65 |. 

All the proofs will refer to Fig. 3. Here A, B, C, D, etc., will denote vectors 
from a common origin to the points A, B, C, D, etc. 


A 


Fic, 3 


THEOREM 7. If a pair of opposite edges of a tetrahedron are rectangular (normal 
to each other), the two altitudes of the tetrahedron issued from the end of each of these 
two edges intersect. 


Proof: Equivalently, we have to show that given (A—B)-(C—D) =0, there 
exists a vector Hi such that 


(2) (Hi — A):(B — C) = (HM, — A): (C— D) = 0, 
(3) (H, — B)-(C — D) = (Hi — B):(D — A) = 0. 
If H is any point on ha, then (2) is satisfied by Hi = A+A(H—A) for any X. Since 
(A — B)-(C — D) = 0= (AM, — B)-(C — D) — (Ai — A): (C — D) = 0, 
the first part of (3) is satisfied. To satisfy the second part of (3), we must have 
(A — B):(D — A) +(H — A)-(D— A) =0. 
Since (H — A) -(D— A) #0a unique ad exists. 


THEOREM 8 (converse of Th. 7). If two altitudes of a tetrahedron intersect, the 
edge joining the two vertices from which these altitudes issue 1s perpendicular to the 
opposite edge of the tetrahedron. 


1056 M. S. KLAMKIN [December 


Proof: We wish to show equivalently that 


(4) (H — A):(C— D) = (H— A)-(B—C) =0 


(3) (H—B)-(C— D) =(H-B)-(A-©) «pe (A= B)(C=D) = 0. 


On subtracting the first part of (4) from the first part of (5), we get the desired 
result. Since we did not use the second parts of (4) and (5), we can make the 
stronger statement: 


THEOREM 8’. If two lines from A and B are normal to CD and intersect, then 
ABLCD. 


COROLLARY. If two altitudes of a tetrahedron intersect, the remaining two 
altitudes intersect. 


THEOREM 9. If two pairs of opposite edges of a tetrahedron are rectangular, 
then the remaining parr of opposite edges 1s also rectangular. 


Proof: We wish to show equivalently that 
(6) (A — B):(C— D) =0 


(7) -O.(D— a) -of AB D)=0. 


The result immediately follows by subtracting (6) from (7). 


THEOREM 10. If three altitudes of a tetrahedron are concurrent, then the four 
altitudes are concurrent. 


Proof: Our first proof is similar to the vector proof for the concurrency of 
three altitudes of a triangle given in [g, D. 32 |. A normal to the plane of A, B, C, 
is given by 


IAXB+BXC+CXAl. 
Also, 
AX(BXCO)4+BX(CXA+CX (AX B) = 0. 
Using the latter identity, it follows immediately that 


(H—A)X(BXC+CXKD+DXB) 


() t(H-B)XCXAtTAXD+DXOC) 
+(H-C)X(DXA+AXB+BXD) 


+(H—-—D)X(AXC+CXBt+BXA)=0 


is also an identity. By choosing H to be the point of concurrency of three of the 
altitudes, our result follows. 

Although this proof is rather neat, it admittedly is not very direct. Con- 
sequently, we now give a more direct one. 

We wish to show equivalently that 
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(H — A)-(B — C) = (H—A)-(C- D) = 0) 
(9) (H— B)-(C — D) = (H—- 8): (D— A) = oF 

(H — C)-(D— A) = (4 —C)-(A— 8) = 0) 

=> (H -- D)-(A — B) = (HW — D)-(B—-— C) = 0. 
As in Theorem 8, we then have (A— B)-(C—D) =(B—C) -(D— A) =0. Since 
C— D= (H— D) -(H-O), 
(H — D)-(A — B) = (4 -C)-(A-— B) = 0. 

Similarly, (H?— D)-(B—C) =0. 


THEOREM 11. If ABCD 1s a tetrahedron whose altitudes are concurrent in the 
point H (orthocenter), then each of the five points 1s the orthocenter of the tetrahedron 
formed by the other four points. 


Proof: Follows immediately from (9). 


THEOREM 12: If one altitude of a tetrahedron intersects two other altitudes, 
then the four altitudes are concurrent. 


Proof: It follows from Theorems 8 and 9 that 
(B —C)-(A — D) = (A — C)-(B — D) = (A — B)-:(C — D) = 6. 
If H denotes the point of intersection of h, and hp, then 
(H — A)-(B — C) = (H — A)-(B — D) = 0, 
(H — B)-(A — C) = (4 — B)-(A — D) = 0. 
(B — D)-(A — C) = (B — D)-(H — ©) — (8B — D)- (A —- A) = 
(A — D)-(B — C) = (A — D)-(H — CC) — (A — D)- (A — B) = 0. 
Whence, (H—C)-(A—D) = (H—C)-(B—D) =0, implying that h., hy and h, are 
concurrent. Then by Theorem 10, all the altitudes are concurrent. 


4. Tangencies. The next two theorems, which on first glance may seem un- 
related, will be shown to be essentially equivalent. 


THEOREM 13. If an ellipsoid 1s tangent to the four edges of a skew quadrilateral, 
then the points of tangency are coplanar [5, p. 180]. 


THEOREM 14. If four spheres touch in succession, each one touching two others 
(the number of external contacts being even), then the four points of tangency lie on a 
circle [5, p. 44]. 


We first establish Carnot’s theorem and its converse [4, p. 111]. 


THEOREM 15. If P, Q, R, S are four coplanar points on the four sides, respec- 
tively, of a skew quadrilateral ABCD, then 
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AP BOQ CR DS 
(10) | 
PB OC RD SA 


and conversely (here AP is taken to be positive if P lies on the ray AB, otherwise 1t 1s 
negative). (See Fig. 4.) 


Fic, 4 


Proof: Since P is on line AB etc., we have 
P=aA+(1—a)B, Q=0B+ (1-4)C, 
R=C+(i-ocD, S=dD+(1—-JdA. 
Since P, Q, R, S are coplanar, there must exist four constants p, g, 7, s, where 
b+q+r+s=0, pgrs40, such that 
pP+qQ@+7rR+sS =0. 


Since the origin of the vectors is taken outside the 3-space of the quadrilateral, 
A, B, C, D are linearly independent. Thus the coefficients of A, B, C, D in the 
latter equation are all zero, i.e., 


p(1—d)+qa=0, g(t —a)+rb=0, 
ri — 6)+ sc =0, si—c) + pd =0. 


In order that this latter set of equations be consistent, it is necessary that 
(1—a)(1 — d)A — co) — d) = abcd. 


On solving for p, g, rin terms of s and substituting back in p-+q+r-+s, it follows 
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after some siniplification that the condition is also sufficient. Since 
AP=(1—a)|A—B|, PB=alA-—B|, 


etc., we get the desired result. 
For the converse theorem, we are given that (10) holds. Let the plane deter- 
mined by P, Q, R, intersect AD in S’. Then by the previous theorem, 


AP BO CR Ds’ 


ee ee 


Whence S’ coincides with S, and P, Q, R, S are coplanar. 

We digress to sketch out an analytic geometry proof and a proof using 
centroids. Then we shall compare the proofs, including the synthetic one (using 
Menelaus’ theorem) which was referred to in [4], to illustrate the differences in 
simplicity and directness as mentioned in the introduction. 

For the analytic proof, we start out by coordinatizing the points A, B, C, D, 
1.e., let them be (%;, yi, 2;), where z=1, 2, 3, 4. Points P, Q, R are then chosen 
arbitrarily on the lines AB, BC, and CD, respectively. S is then determined as 
the intersection of the plane through P, Q, R with the line DA. Finally, the 
desired relation is verified. 

For the centroid proof, assunie that masses 


k k l l 
DR 


M, qd — 


b= } 


c 


= —, M,=——) 
AP BP CR 


are placed at the points A, B, C, D, respectively, such that k, / satisfy 


These masses have been so chosen that the centroid of M4, and M4 is at P, the 
centroid of M@M, and My is at R, and the centroid of M@, and Mz is at S. It now 
follows immediately that the centroid G of the four masses is on the segment PR. 
If the centroid of Ms, and M, is at 0’ on BC, then G is also on SQ’. Thus, BC 
must intersect SQ’ at G. Unless Q’ and Q coincide A, B, C, D will be coplanar, 
contradicting the hypothesis. Then, 

BQO CQ 


k— = 1— 
BP CR 


which together with the previous condition in k, / gives the desired result. 

On comparing the different proofs it is seen that the analytic geometry one is 
very direct but involves a lot of arithmetic. The vector proof is somewhat less 
direct but the arithmetic is much less onerous. The computations in the centroid 
proof are few and easy but the proof requires the very indirect step of introduc- 
ing appropriate masses and using centroids. The synthetic proof (see reference) 
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is also very easy provided that you invoke Menelaus’ theorem. To any decent 
geometer, this would be a fairly direct step. However, to many others, it will 
not be. 

We now return to the proofs of Theorems 13 and 14. Since tangency and 
planarity are preserved under affine transformations it suffices in Theorem 13 to 
prove the result for a sphere. Also, since tangents from a point to a sphere are 
congruent, the configuration we obtain is the same as in the first figure for the 
four possible configurations, Figs. 4-7 of Theorem 14. 


Fic. 5 


Fic. 6 Fic. 7 


In three of the cases (Figs. 5, 6, 7), some of the points of tangency P, Q, Rk, S 
do not lie in the interior of their corresponding segments AB, BC, CD, DA. This 
occurs when a pair of corresponding spheres are tangent internally, e.g., P in 
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Figs. 6, 7. However, in all the cases, 


BO b CR 


n C DS d 
OC ~c RD °° dad SA a 
where the number of minus signs is even. Then by the converse of Carnot’s 
theorem, P, Q, R, S are coplanar. Since in Theorem 13, P, QO, R, S are ona 
sphere, they also must lie on a circle. To prove the circular part for Theorem 14 
requires an extension of the proof that a sufficient condition (which is also 
necessary) for a plane quadrilateral ABCD to have an inscribed circle is that 
AB+CD=BC+DA (9, p. 28]. Our proof is geometric as we do not havea simple 
enough vectorial one. It will be in two parts, one for Figs. 4, 5 and the other for 
Figs. 6, 7. 

Since P, Q, R, S have been shown to be coplanar, it suffices to show also that 
they are cospherical. 

In Figs. 4,5, AB+CD=BC+DA. Assuming without loss of generality that 
AB2DA, then BC2CD. Points X, Y are chosen on AB and BC, respectively, 
such that AX =DA, YC=CD and then XB=BY. It is to be noted that al- 
though the configuration in Fig. 8 is taken from Fig. 4, it also applies to Fig. 5. 
The only change is that R and S are then exterior points of their corresponding 
segments. The perpendicular bisecting planes of DX, X Y, YD will intersect ina 
line through the circumcenter of ADX Y and perpendicular to its plane. By the 
symmetry of the configuration, each point on this perpendicular line will be 
equidistant from the four sides of ABCD and also from P, QO, R, S. Thus, P, Q, 
R, S are cospherical and, additionally, the skew quadrilateral has a family of 
“inspheres.” 


Fic. 8 Fic. 9 


In Figs. 6, 7, AB+BC=CD+DA. In the case of a plane quadrilateral, this 
will turn out to be the necessary and sufficient condition for the existence of a 
circle tangent to the four sides (2 internal and 2 external points of contact). 
Since the previous argument now applies also for Fig. 9, our proof is completed. 
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5. Higher dimensions. In the next four theorems, we consider z-dimensional 
figures. For problems in higher dimensions, the vector approach is particularly 
useful and often a necessity. 


THEOREM 16. Corresponding to any n-dimensional simplex, there exists a skew 
(n-+1)-gon whose sides are congruent and parallel to the n+1 medians of the 
simplex. Furthermore, the volume of the simplex spanned by the n-+-1 vertices of the 
skew-gon 1s (1+1/n)"—1/n times the volume of the initial simplex. This generalizes 
the known results for a triangle and tetrahedron |4, p. 54]. 


Proof: Let the n-+1 vertices of the given simplex be O, A1, Ao, +++, Anand 
let A; denote the set of z linearly independent vectors from O to A;. The volume 
of the given simplex is then 


G11 Gye Ain 
1 | der dee Aon 
VY = [ Ay, Ao, cos , An|/n! = 
nil eee eee eee 
Qn1 Qn2°° * Ann 
Here (di, Giz, - + * , @in) denotes the rectangular components of A;. The +1 
medians of the simplex are given by 
n+i 


nM =A,+A,++---+A, M— A; (i = 1,2,+-+,7). 


n 


The sum of these x+1 vectors is zero and consequently they form a closed skew 
(n-+1)-gon. 

If we now choose the starting point of M as our origin, the x vectors to the 
remaining vertices of the (x+1)-gon are 


M, 2M — Ai, 3M — (Ar + As), +++, nM — (Ay + As + +++ + And), 
where A = (n+1)/n. The volume V’ of this simplex is given by 
V'/n! = [M, 2M — Ai, +++, nM — (Ar + Art: ++ + Ans). 
By elementary operations on the determinant, 
V'/n! = [M, Ai, > ++, NAn—1| = [An/n, \A1, \Ao, > + +, NAn—1] 


or 


n—-1 


V'/n! = [A1, Ao, +--+, Aal. 

It is known [10, p. 139| that in order that two points in the interior or on the 
boundary of a polygon be farthest apart, they must be two of the vertices that 
are farthest apart. The polygon need not be convex. An extension of this to n- 
dimensional polytopes is easily proved by means of the triangle inequality. 
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THEOREM 17. Jf two points in the interior or on the boundary of a polytope are 
furthest apart, they must be two of the vertices which are furthest apart. 


Proof: We need only consider convex polytopes. For if the result is valid for 
the convex hull of the polytope, it is also valid for the polytope. 

Now let Vi, Ve, - - - , Vx» denote vectors from a common origin to the vertices 
of the polytope. Then R= >. A;V; and R’= >) di Vi, where d;, Af 20, DAG 
= >> \/ =1 will denote two vectors from the common origin to two points with- 
in or on the boundary of the polytope. Using the triangle inequality repeatedly 
and the properties of A; and A/, we get the following sequence of inequalities: 


V;—R| 


| R’—R| = EWR) 3 DM 


a 


< Max|V,— R| = Max 


DAV; — Vi) 
. 


< Max >. A,| V;—V; < Max | V; — V: | ° 

i j i,j 

For our next theorem, we consider a special case of “the optimal location of a 
warehouse” problem [11, p. 394]. Here we have to determine a point R which 
minimizes >) w,;|R— V;|, where V:, are given points and w; are given positive 
weights. If we set the appropriate derivatives equal to zero, the resulting equa- 
tions are given by 


R—V; 


Sw; =0 


R—V; 


(a set of forces in equilibrium). It has been shown that the point R is unique. A 
minor difficulty arises if R coincides with one of the V;’s. 

If the weights w; are equal and the given points V; correspond to the vertices 
of a regular polytope (in any dimension), then one expects intuitively from the 
symmetry of the configuration that R will correspond to the center of the poly- 
tope. We now give an extension of this which is established elementarily using 
the triangle inequality. 


THEOREM 18. Let V; denote vectors from a common origin O to n given points on 
a unit sphere (of any dimension) with center O. If > wiVi=0, where w; are 
arbitrarily given positive weights, then 
Min » Wil Vs — R| = > Ws. 
R i 


v 


Proof: By symmetry (see Fig. 10), | Vi-R| = |7Vi—R/r| . Whence, 
Dy Wi 


with equality, if and only if, 7 =0. 


Vi-— RI = > w:| rv; — R/r| >| 0 (rwiVi — w:R/r)| = Dd wi 
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Fic. 10 


Our last theorem is an elementary generalization of the convexity of ellip- 
soids of revolution. The proof via vectors is almost automatic. However, a 
simpler proof for general ellipsoids follows by transforming the ellipsoid into a 
sphere by an affine transformation (which preserves convexity). 


THEOREM 19. If A1, Ao,--+, A, denote arbitrary points, not necessarily 
distinct, in E,, then the region spanned by the points P satisfying the inequality 
PA,;+PAe+ +--+ +PA,Sk (constant) is convex. 


Proof: We first show that for any three collinear points P1, Q, Ps, where Q is 
strictly between P; and P., that 


< max} D | Pi— Aid, % | Pe= Ail}. 


q==1 


IA 


a1) Dena. 


Here A; denotes the vector from an origin O to the point A; ,etc. 
Since Q is between P; and Pe, @=APi+(1—A)P2 with O<A<1. Then by the 
triangle inequality, 
Q@—A;| = |\(Pi— Ad) + (1 — d)(P2 — Ai) | 
S|P1—A;| + (1 —d)|P2— Ail. 
Whence, 


| @-Ail SAD | Pi— Ail + (1-2) DD | Pe — All 


with equality if and only if the points are collinear and also (11). 
Now if Pi and P, satisfy >);PA;Sk, so also does Q and thus the region is 
convex. For a generalization of Theorem 19, see [12]. 
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A PROOF OF THE NEWTON-COTES QUADRATURE 
FORMULAS WITH ERROR TERM 


D. R. HAYES, University of Massachusetts, AND L. RUBIN, University of Hartford 


1. Introduction. The Newton-Cotes quadrature formulas with error term 
are well known. A survey of recent textbooks on numerical analysis, however, 
seems to indicate that rigorous proofs of the standard expressions for the error 
terms are not nearly so well known as the expressions themselves. A relatively 
complete and rigorous proof can be found, for example, in Steffensen [3 |; but 
the reader must first master a rather considerable amount of machinery. 
Further, Steffensen’s method does not apply so well in the case of an even num- 
ber of interpolation points as it does in that of an odd number, and additional 
difficulties must be surmounted. Our aim in this paper is to present a direct and 
elementary proof of these formulas with error term. It is hoped that the proof is 
simple enough that it can be presented to an undergraduate class in numerical 
analysis in three or four lectures. Besides a standard mean value theorem for 
integrals, we require no more mathematical background than is usually taught in 
undergraduate calculus courses. Our method applies equally well whether the 
number of interpolation points is odd or even. Further, although our main aim is 
a simplified proof, we also arrive at a slightly more general theorem than is 
sometimes stated, since we do not require any continuity assumptions for the 
derivatives of the function being integrated. 

We now introduce some conventions and notation which will be used 
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throughout the paper. We assume given a positive integer », and we define k by 


(1.1) —_ v if m is even 
~— Un = 1)/2 if n is odd. 


We also assume given a real valued function f which is defined and differentiable 
2k-+2 times in some open interval containing the closed interval |a, 6]. Further, 
we assume that the values of f are known explicitly at the »+1 points a=xy 
<x%1< +++ <x,=b which one obtains as endpoints when |a, bd] is partitioned 
into x equal subintervals. Let p be the unique polynomial of degree less than or 
equal to ~ which takes the same values as f at each of the points x9, - + +, Xn. 
We call # the interpolating polynomial. Set 


(1.2) A(a) = I (x — x3), 


so that A is a polynomial of degree n-+1 or » depending on whether is even or 
odd. The polynomial A is called the scaling function. Finally, set 


t 
(1.3) A*(t) = i) A(«)dx. 
THEOREM 1.1. With notation and assumptions as introduced above, we have 
b b f PF) (§) b 
1.4 | ajdx = f x)dx —- ————— A*(t)dt 
(1.4) _ Le) _ Pe) Opa nis, (¢) 


for some & such thata<E<b. 


The first integral on the right in (1.4) is easily evaluated, as p is a poly- 
nomial, and yields the Newton-Cotes quadrature formula for n+1 points. This 
formula approximates the integral of f in terms of the values of f at the points 
Xo, X1, °° *, Xn. The integral in the second expression on the right in (1.4) is 
independent of f (and also easily evaluated as A* is a polynomial). This ex- 
pression, therefore, gives the error of approximation in terms of the value of the 
(2k-+2)-nd derivative of f at some (unknown!) point between a and 0. 

Theorem 1.1 for » >1 is proved in section 4. Sections 2 and 3 contain the 
proofs of two auxiliary results. The idea of our proof of Theorem 1.1 is basically 
the same as that of Steffensen. We write f(x) =p(x)+e(x) and attempt to 
estimate f?e(x)dx. Following Steffensen, we put e(x) =A(x)g(x); that is, we 
define g by 


_ fe) = a) 


(1.5) g(x) Ate) 


(This is a departure from Steffensen if 2 is odd.) The function g is defined wher- 
ever f is defined except at the points Xo, “1, --°, Xo. Now f has at least two 
derivatives since »21 and we have assumed 2k-+2 derivatives. Therefore, f’ is 
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continuous, and we can define 
4.6) ee) = im OTP) _ Fe) — 1 ) 


(i= 0,1,---, 2k) 
LE A(x) A’(%3) 


by l’Hospital’s rule. With this definition, g is defined and continuous in an open 
interval containing [a, d]. 

The proof in section 4 does not apply if 2 =1 unless f happens to have a third 
derivative. Our assumption of 2k-+2 derivatives only yields two derivatives for 
n=1; and as we do not wish to make unnecessary assumptions, we note that a 
proof of Theorem 1.1 with 7 =1 is given as an exercise with hints in Hardy [1, 
p. 330]. For the remainder of the paper, therefore, we may assume that > 1. 
This assures us that our function f has at least four derivatives. In section 2 we 
need to know that g’ exists, and in section 4 we need to know that g’ is con- 
tinuous. If f has four derivatives, these two results follow from I’Hospital’s rule 
as follows: First, if x is not one of the points Xo, - - + , Xx, then g is differentiable 
by (1.5) and the quotient rule for derivatives; and one obtains 


A(x) [f"() — p'(#)| — A’) [F(@) — p@)] 
[A(«)]? 


Therefore, we have to show in addition that 


(1.7) g'(") = 


g(x) — g(%:) _ A'(x)[f(~) — p(~)] — A(@)[f'(ma) — £'(«s)] 
lim -————— = ]jim —————_— oS Ne 
LL xX — X; LE A(x) A'(%3) (x — x4) 
exists forz=0, 1, ---, 2k. This can be done by two applications of I’Hospital’s 


rule. As f® exists, f”’ is continuous, and we have 


_ ADP Ge) — "ed — AVC) eo) — 2’) 


1.8 ’ NG 
(1.8) gl (x) TG) 
forz7=0,1,---, 2k. Therefore, g’ exists. To prove g’ continuous, we note first 
that this is obvious from (1.7) if x is not one of the points xo, - - - , Xex. We have 
left, therefore, to prove that 

lim g’(”) = g/ (x2) (4 =0,1,---, 2k). 


This can be done by evaluating the limit of (1.7) as xx; by again two ap- 
plications of |’Hospital’s rule. One obtains the right hand side of (1.8). One 
needs the continuity of f®, which of course is at hand since f has four deriva- 
tives. 


Zz. The derivative of g. In this section, an estimate is derived for g’(x) on 
(a, b|. This estimate will be needed in section 4. A similar result has been ob- 
tained by Ralston in [2]. The reader will note that the fact that the points 
Xo, X14, °° +, Xo are equally spaced in [a, b] is never used so that we actually 
prove a more general theorem than is stated. 
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THEOREM 2.1. For each x€|[a, b|, there exists a number — with a<&<b such 
that 


(2k+2) 
(2.1) g(a) <2. 


Proof. We use the standard technique of repeated application of Rolle’s 
theorem. For fixed x€ |a, 6], define a new function F on an open interval con- 
taining |[a, b| as follows: 


(2.2) F(t) = f(t) — p(t) — A(d)[g(x) + (t — «)g’(x) I. 
We shall need F(2*+2), which exists since f has 2k+-2 derivatives. In fact 
(2.3) FORD (f) = FORO) — (2k + 2) 1 g"(x). 


To prove this, note that the (2k-+2)-nd derivatives of p(#) and A (t) both vanish 
since these functions are polynomials of degree less than 2k +2; and A (#) - (t—x) 
is a monic polynomial of degree 2k+2 and so has derivative (2k+2)!. 

Our aim now is to show that F’’(t) has at least 2k-++1 zeroes in the interval 
[a, b|. First, by a simple calculation, 


(2.4) F(t) = f'() — p'() — Ae’ (~) — AO (g(x) +  — a) e'(#)] 
and 
(2.5) FP’) =f") — pe") — 24'a'(@) — A”) [e(@) + GE — 2) 2'(x)]. 
If one now differentiates the identity 

gQAD =f) — gs 


twice and then puts ¢ =x, he will discover that 

(A) F’(«) =0 and 

(B) F’’ (x) =0 if x is one of the points xo, x1, - + + , Xan. 
We must now consider two cases: 


CASE 1: x 1s not one of Xo, X1, ° + +, Xox. In this case, F obviously has 2k+-2 
roots in [a, db], namely, x itself and the points xo, x1,° °°, Xo. By Rolle’s 
theorem, F’ will have at least 2k-+-1 roots in [a, 6], none equal to x (since x is one 
of the 2k+2 roots of F used in the application of Rolle’s theorem). By (A) 
aboye, F’ also has x as a root and hence has 2k+2 distinct roots in |a, 0]. Now 
Rolle’s theorem applied to F’ shows that F”’ has at least 2k+-1 roots in [a, 0]. 


CASE 2: x equals one of xo, X1, - + + , Xx. In this case, F has the 2k+1 roots 
Xo, X1,° °°, Xx. Therefore, F’ has 2k roots, none equal to x, in [a, b|. But 
F'(x) =0 by (A), so that F’ has at least 2k-+1 roots in [a, 6|. This implies that 
F" has 2k roots, none equal to x, in [a, b]. But F’’(x) =0 by (B) in this case. 
Therefore, F’”’ has 2k-+1 roots in |[a, 0]. 
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We can now prove that F*+® has at least one root & in the open interval 
(a, b). This follows at once from the fact that F’’ has 2k-+1 roots in [a, | by 
repeated application of Rolle’s theorem. If now we substitute ¢=& in (2.3), we 
get our result. This completes the proof of the theorem. 


3. The scaling function. The fact that the integral of the scaling function 
satisfies 


(3.1) A*(t) = [ Acoae > 0 


for all tG [a, 6] is required in the next section. Our aim in this section is to prove 
this result. We first introduce a change of variables which brings the integral in 
(3.1) to a more manageable form. Put 


b—a b+a 
10-9 


b—a kat+ kb+a 
x= h(u) = (——~) “+ (2) 
n n 


if m is odd. Then the increasing linear map h(u) carries either the interval 
[—k, k] or the interval [—k, R+1] onto |a, 6| according as is even or odd. 
Therefore, if iG [a, 6], we find after a little calculation that 


if mis even and 


t a b — a\2kr2 tl 
(3.2) A*(t) = f A(a)dx = f A(h(u))-h'(u)du = (——*) f A,(u)du 
a —k —k 
for some 
3.3) he {vt k | for 7 even 
*— Uk k +1] — for 2 odd, 
and where 
k 
(3.4) Ayu) = [] wt+d. 


i=—k 


Of course, Ai(u) is the scaling function for the special symmetric case when 
Xo = —k and the points Xo, %1, - - + , X, are spaced one unit apart. 

Now Ai(u) 20 for uP as the factors in (3.4) are all nonnegative for these 
values of wu. Therefore, in order to establish (3.1), it suffices by (3.2) and (3.3) to 
show that 


tl 


k 
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for every ii€@ [—k, k]. The remainder of this section is devoted to a proof of this 
latter result. For notational convenience, we drop the subscript on 41, as there is 
no longer any danger of confusion. We also put v, = ~k+2r. 


LEMMA 3.1. The sign of A1(u) ts determined by the expression (—1)* where s ts 
an integer,OSs<2k, anduc(—k+s, —k+s+1). 


Proof. It is only necessary to note from (3.4) that for uE(—k+s, —k+s-+1), 
there are 2k —s negative factors of Ai(w), and thus the sign of Ai(z) is (—1)?*-* 
=(—1)*. This completes the proof. 


LEMMA 3.2. For each integer r with OSrS(k—2)/2, we have fi, Ar(u)du =0 
whenever t©|—k-+2r, —k+2r+2]. 


Proof. The proof is immediate from Lemma 3.1 ift@ [—k+2r, —k-+2r+1]; 
hence we can assume ?#€ [—k+2r+1, —k+2r+2]. For ¢in this interval we have, 


| Ax(u)du = ™ Ay(u)du + 
J aoa J f 


t 
r r vp+1 


fo Asean 


t t 
{ Ax(u)du| = { — Ax(u)du 
Vpt1 v,p+1 
since for u@(—k+2r+1, —k+2r+2), —Ai(u)>0 by Lemma 3.1. Thus, we 


need only show 
t wrtl 
| —A,(u)du S | Ax(u)du 


rel , 


A 1(u) du. 


This sum will be =O if 


Now, 


for every ¢€@(—k+2r+1, —k+2r+2). But, since —Ai(u) 20 for u in the above 
interval, it suffices to show that 


Vy +2 Urtl 
f — Ay(u)du S { Ax(u)du. 
v,-+1 Ur 


If we make the substitution u=s+1 in the left hand integral above, we are 
reduced to proving that 
vr tl 
| Ax(s)ds. 


ra 


A 


vrtl 
J — Ax(s+ 1)ds 


ra 


We need only demonstrate that 
—Ai(s +- 1) < Ax(s) 
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for every sE(—k+2r, —k+2r+1). To prove this, form the ratio 


_ 1—j 
-ays+t) ASF ! seats 
Ax(s) 7 k ks 
s — 2) 
i 


Since —k+2r<s<—k+2r+1 and rX(k—2)/2, it follows that s=—1. This 
in turn implies that 2s<—1. Adding —s+k-+1 to both sides of this last in- 
equality, we find that 


stkt+1i<k—-—s or -—— 
k—s 
as k—s>0O. Thus, since Ai(s) >0 for s©@(—k+2r, —k+2r+1) by Lemma 3.1, 
we conclude that —Ai(s+1) <A1i(s). This completes the proof of Lemma 3.2. 


Lema 3.3. We haveft, A1(u)du=0 for eacht SO. 


Proof. Put * =t¢ unless k is odd and ¢> —1, in which case put ¢* = —1. Then 
for somer S(k—2)/2, we have 


t —h+2 —h+4 1 
{ Ax(u)du = | A,(u)du + | Ax(ujdu + +--+ A;(u)du, 
7h —k —k4+2 —k-2r 

where ¢*€[—k+2r, —k+2r+2]. If Rk is odd and t>—1, then the inequality 

follows as A1(u) 20 for u€|—1, 0]. By Lemma 3.2, each term in the above sum 

is nonnegative. Therefore, the same is true of the sum, and this completes the 
proof. 
Weare now in a position to establish our result. 


THEOREM 3.4. The integral [, A1(u)du is nonnegative for —k<t<k. 


Proof. The theorem is true when ¢S0O by Lemma 3.3, so it will only be 
necessary to demonstrate the result for 0 <i <k. Since A1(u) =u] [*_,(w?—s?) isan 


odd function, we have 
t —t 
| Ax(u)du = | Ay(u)du. 
k k 


Thus, since !>O implies —t<0, we have f=; Ai(v)dv 20 by the results of Lemma 
3.3. This completes the proof. 

We note one final result. Equation (3.2) and the fact that A, is an odd func- 
tion show that 


(3.5) A*(b) = [ Aca: = 0 


for ” even. 
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4. Proof of Theorem 1.1. We are now in a position to derive the standard 
Newton-Cotes quadrature formulas with error term for both an odd and even 
number of interpolating points. 

From the definition of the essential error term g(x) we have f(x) =p(x) 
+A (x) g(x”), and hence 


(4.1) J sea = f plade+ f [ A («) g(x) |dx. 


The last integral of (4.1) can be integrated by parts. Thus, 
b b b 
(4.2) f U@elds = °@e@hi- J [°@¢ las, 
where A*(z) is defined by (1.3). Now A*(6) =0 by (3.5) if x is even; and g(b) =0 
if 2 is odd from the definitions. Also, 4*(a) =0 by (1.3). Hence the first term of 


the right member of (4.2) is zero. By the results of section 3, A*(t) 20 on [a, b] 
and so we may apply the mean value theorem for integrals which gives 


b b 
(4.3) [ Le@e@les = £00 f at@ae 
for some X,aSX Sb. An estimate for g’(X) is given by Theorem 2.1. Hence, 


b (2k-+2) (£) b 
J [A*(x) g’(«) ]da = a af A*(x)dx, 


where a <E <b. Substituting this result in (4.1), we have Theorem 1.1. 


This paper consists in part of results appearing in the second named author’s Master’s Thesis 
at the University of Tennessee. 
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SYSTEMS OF ROADS WITH COUNTERS 
ROBERT KOPP, San Diego State College 


In this paper we shall try to develop some ideas which will lead to a solution 
of problem E 1980 which appeared on page 438 of the April 1967 edition of this 
MONTHLY and was proposed by R. E. Chandler: 

FE 1980: “The street plan for a certain town is a generalization of the tic-tac- 
toe board with » vertical streets and m horizontal streets (with no vertical 
street intersecting the end of a horizontal street and vice versa). An automobile 
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will start at the end of one (unspecified) street and travel to the end of another. 
What is the minimum number of traffic counters (each to be placed in the middle 
of a block) necessary to determine the route of the automobile? The route is to 
have no cycles, and the counters record the time that the automobile passes so 
that the direction of the route is determined when two counters have been 
passed.” 


1. Preliminary definitions and observations. Let G be a connected graph. 
An edge of G, no portion of which, with the exception of one end-point, meets any 
other edge of G will be called an end. 

By a route of G we shall mean a path in G containing exactly two ends. 

A graph G with the property that each edge belongs to some route will be 
called a system of roads. 

In this paper we shall provide a method for determining the minimum num- 
ber of counters necessary to distinguish routes on such graphs and also the 
location of the counters on the system of roads. 

It is no loss of generality to restrict consideration to such graphs. For sup- 
pose G is an arbitrary connected graph. Remove all edges which are not part of 
some route to obtain a new graph G’. G’ is easily seen to be connected and each 
of its edges is a part of some route. Find the minimum number of counters for 
G’ and their position on G’. Then reinsert the edges deleted from G. 

We shall use the word route in two ways, one way as in E1980, and the sec- 
ond way to mean the equivalence classes consisting of pairs of routes lumped 
together without regard to direction of travel. When this is the case we shall 
italicize the word and write it route. 


DEFINITION 1. A spanning tree of a connected graph 1s a connected subgraph 
containing all the vertices of the graph but no cycles. 


DEFINITION 2. An edge of the complement of a spanning tree 1s called a chord. 


It is known [1, p. 26] that there are e—v+1 chords associated with a 
spanning tree for a graph, where e and v are the number of edges and vertices 
respectively of the graph. The number e—v-+1 is called the nullity of the graph. 
Some other names for this are the first Betti number or the cyclomatic number. 


DEFINITION 3. A system with counters 1s called complete 1f the arbitrary route 
of an automobile can be determined by inspecting the counters 1t has tripped. 


DEFINITION 4. A system with k counters will be called minimal if 
(i) tts complete, 
(ii) af 2t71s also complete with | counters, then k Sl. 


OBSERVATION 1. If a system is complete and a edge in the system has two 
counters on it, then without altering the completeness of the system one of those 
counters may either be placed on a vacant end or discarded. 

To see that this is so, consider all distinct routes passing through the given 
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edge with two counters on it. Since the system is complete, all but possibly one 
of these routes must have tripped at least one counter in addition to the two on 
the given edge. Consequently the only need for two counters on this edge is to 
determine the direction along the exceptional route. In the event there is no 
exceptional route one of the two counters may be discarded without altering the 
completeness of the system. 

This argument also provides a proof of our next observation. 


OBSERVATION 2. If a system is minimal and an edge has two counters on it, 
there is one and only one route containing this edge and exactly two counters. 


OBSERVATION 3. In a minimal system no edge has more than two counters. 
This follows directly from Observation 2. 


2. The fundamental principle for systems for roads with counters. In order to 
prove the Fundamental Principle we shall need the following: 


LEMMA 1. Suppose an edge AB, not an end,in a minimal system has a counter 
on it. Then either there is a simple curve 643 in the system with ends at A and B and 
no counters or else the counter on AB 1s used only to distinguish direction along some 
unique route and so may be placed at an end. 


Proof: The counter is necessary if the system is minimal. Consequently it 
must be used to distinguish two routes 6 and @. These two routes have either the 
same edges throughout in which case it is easily seen by completeness all such 
pairs must have at least one common end, or else there is a segment in @ which 
is not on @. In the first case the counter on AB must be used to distinguish 
direction of travel and so there is at most one counter in addition to the one on 
AB along 6 and ¢ and so the counter may be placed at an end. Let us consider the 
second case. By completeness we know @ and @ must both have a counter along 
them besides one on AB. Since these routes are indistinguishable except by the 
counter at AB, any counter on 6 and not on ¢@ could only be on AB. Now start a 
route at the beginning of 6 toward A and travel along until we just pass the last 
counter common to @ and ¢. (9 and @ must cross or one of them would have no 
counter. We may suppose they cross before A.) Call this point X. Some place 
between X and A, 6 and ¢ diverge since they are routes not containing the same 
edges and are distinguished only by a counter at AB. Call this point Y. Now 
continue along 6 to point A. Then 6 and ¢ do not cross (by completeness) on @ be- 
tween Y and B since there can be no counters along here on either 6 or ¢ (if so 
this would obviate the need of a counter on AB). After leaving B they must join 
once more ata point Z. (If they didn’t, they would go to different ends, say 6 to 
HK and ¢ to MN. But then the route from HK back on @ across BA to Y then 
forward along ¢@ to MN would have only one counter on it at 4B since the 
counter at AB is necessary to distinguish 6 from @.) Again by reasoning similar 
to that above there is no counter on the section of 6 from B to Z. Consequently 
the path A to Y along 6, Y to Z along ¢ and Z to B along @ is the simple curve 
OAB. 
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It is an immediate consequence of the lemma, that we may arrange a minunal 
system (with at least two ends) if we so desire so that both 

(1) Given any two nodes A and B there is a simple curve 648 with no 
counters joining A and B. 

(2) Each end has a counter and the only counters distinguishing direction 
are on ends. 

To see (1), simply observe that the system is connected so it is possible to get 
from A to B by a finite number of edges 


AA, AiAa, +++, AnB. 


But by the lemma we may assume either A;,Axz41 has no counter on it or else we 
may bypass it with a curve 64,4,,, without counters. The curves 64,4,,, and 
O Ag Agi, Cannot intersect except at Az41 or in an arc, because of completeness of 
the system. (2) is an immediate consequence of (1). 

The possibility of arranging a minimal system so (1) and (2) are satisfied 
will be called The Fundamental Principle of Systems of Roads with Counters, and 
we shall henceforth assume that all minimal systems are so arranged. (Note: if a 
system is minimal and has more than two ends there must be a counter at every 
end.) 


THEOREM 1. The minimum number of counters required to distinguish the 
routes in a system of roads S 1s equal to the number of ends of the system plus the 
nullity of the system. If a counter is placed on each end of S and on each chord for 
some fixed spanning tree we obtain a minimal system. 


Proof: We know we may assume a minimal system S has a counter at each 
end. In addition there must be a counter on at least one edge of every cycle. Let 
I’ be the set of edges without counters. Then TJ is a tree of the system S and thus 
the number of edges with counters must be at least equal to the nullity of S. 
Thus the number of counters must be at least equal to the number of ends plus 
the nullity of S. 

Conversely, let T be a spanning tree of S and put a counter on each end of S 
and on each chord of T (hence, number of counters=number of ends plus 
nullity). Then S is complete. For if these counters did not distinguish two routes 
Ri and R; then the set of lines belonging to either Ri or R, but not both contains 
a cycle without counters. Yet this is a subset of J which is a contradiction. 


3. Solution to E1980. In Figure 1 (p. 1076) we have an m Xn system and in 
Figure 2 a spanning tree. From this we see the nullity of the system is (x—1) 
(m—1). There are 2x -+2m ends in the system. Consequently a minimal system 
must have 2n+2m+(n—1)(m—1) =(n+1) (m+1) counters. Such a system is 
illustrated in Figure 3. 
Acknowledgements. | should like to thank Professors Edgar Howard, Daniel Saltz, and Nick 


Morez for the suggestions they made. I am also most grateful to the referee for considerably sim- 
plifying this paper. 
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Note. Problem E 1980 was solved by others after Professor Kopp’s article was accepted. See the 
October MONTHLY, page 883. Editor. 
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COMPLEMENTS AND COMMENTS 
Davip DrasIn, Purdue University 


1. The end of another volume of Mathematical and Classroom Notes 
prompts the annual review of correspondence based on material we have pub- 
lished. I wish to thank those readers who have communicated the information 
below. 


2. Elementary mathematics (material covered normally during the first 
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two undergraduate years). W. C. Waterhouse has observed that R. P. Boas’s 
proof of Lhospital’s rule (Nov. 1969, p. 1051) is essentially the same as one given 
by F. Lettenmeyer in J. Reine Angew. Math., 174 (1935) 246. 

M. V. Subbarao in the September 1968 MONTHLY, p. 772, gave an elementary 
demonstration of the fact that if z is a natural number, then +/z is either an 
integer or irrational. At the end of the paper, he asked whether the method could 
be extended to the kth roots, for k>2. In fact, I. Niven and E. A. Maier had 
surmounted this difficulty in a paper which appeared in MATHEMATICS 
MAGAZINE, 1964, pp. 208-210. 


3. Algebra. Richard Singer, in the December 1969 issue, p. 1131, gave proofs 
of the standard theorems of Gauss and Eisenstein, using the natural morphism 
Z(X |—Z,[X]. That proof of Gauss’s theorem also appeared in Richard A. 
Dean’s book Elements of Abstract Algebra. Dean comments that Singer’s proof 
of the irreducibility criterion is less general than the traditional one, in that the 
last line of Singer’s proof does not go through for an arbitrary UFD. For if D 
isa UFD and ? isa prime in D, then D,=D/(p) need not be a UFD; a common 
counterexample is D=Z[X| and p=x?+5. 

Since C. E. Linderholm’s article “A group epimorphism is surjective” (Feb. 
1970, p. 176) appeared, the author discovered that W. Burgess had already 
published this fact in the Canadian Math. Bulletin, 1965, pp. 759-769. 

Peter Yff calls attention to some facts about finite groups which may have 
been overlooked. Presumably the results are due to A. Speiser, since they appear 
in his book Die Theorte der Gruppen von endlicher Ordnung, (3rd ed., Dover, 
1943). Part (i) of Theorem 1 in “A generalization of Frobenius’s theorem” by 
H. G. Bray is covered in Speiser’s book (pp. 112-113), albeit not stated as a 
theorem; the fact also appears on p. 62 of I. N. Herstein’s text Topics in Algebra. 
The remainder of Theorem 1 is covered by the following general result which 
appeared on p. 117 of Speiser’s book: If H is a subgroup of index n 1n a finite 
group G, and tf N 1s the largest normal subgroup of G contained in H, then the index 
of Nin H is a divisor of n!. 

Many of the results in J. A. Eagon’s paper “Finitely generated domains over 
Jacobson semi-simple rings are Jacobson semi-simple” were obtained earlier 
by R. Gilmer (Pacific Journal, 1966, pp. 275-284). 

Gilmer also observes that it is possible to give many examples of irreducible 
polynomials which are reducible mod pf for all p; M. A. Lee had given some in 
the December 1969 issue, p. 1125. For example (see article by W. J. Guerrier, 
Jan. 1968, p. 46) if z is a positive integer such that the multiplicative group of 
units of Z/() is not cyclic, then the mth cyclotomic polynomial F, is irreducible 
in Z[X | but reducible in (Z/(p)) [X | for each prime ». Those positive integers 
for which the multiplicative group of units of Z/(m) is cyclic are known; cf. 
exercises 4-6 of vol. 2 of van der Waerden’s Modern Algebra (English ed. pp. 
114-115). 

The note by R. Datko and V. Seshadri entitled “A characterization and a 


1078 DAVID DRASIN [December 


canonical decomposition of Hurwitzian matrices,” vol. 77, August-September 
1970, pp. 732-733, is similar to a result of Olga Taussky Todd which appeared 
in the Jour. of Math. Anal. and Appl. (vol. 2, #1, Feb. 1961, pp. 105-107). 
Prof. Todd’s result proves that every real stable matrix is unitarily similar to a 
matrix of the form (—J+S)P, where S is skew symmetric and P is a diagonal 
matrix with positive elements. Thus in the case of real matrices, since the ma- 
trix SP is similar to the matrix P}/*SP4/?, the results of Datko and Seshadri 
follow as a consequence. The extension to the complex case is routine. The 
method of proof used in the two papers is different, except that the starting 
point is a theorem of A. M. Lyapunov (see, e.g., Gantmacher, F. R., Theory of 
Matrices, vol. II). 


4, Topology. P. M. Rice reports that there is a gap in the proof of Lemma 1 
of his paper “A topological characterization of the real numbers” (Feb. 1969, 
p. 184); indeed he has a counterexample. The main result of the paper is, of 
course, correct; however, the short proof presented by Rice now seems in 
question. 

Theorem 2 of N. Levine’s article “Dense topologies” (Oct. 1968, pp. 847-852) 
is also incorrect, but for typographical reasons; the statement of the theorem 
should be “every dense subspace of a D-space is a D-space.” The slip was pointed 
out to us by Sabah A. Ghullam. 

Some time ago (March 1967, pp. 261-266) A. Wilansky wrote “Between T; 
and 7>.” The main results of J. E. Joseph’s article (Dec. 1969, p. 1125) are con- 
tained in Wilansky’s note. 


5. Number theory. In the May 1970 issue, pp. 510-512, G. J. Simmons 
showed that given r and , there exist infinitely many integers m2r such that 


(A) ("), = 1 


However, it is possible to completely characterize those m for which (A) holds, 
using the article “On periodicities of certain sequences of residues” of W. F. 
Trench, which appeared in this MONTHLY, August-September 1960, pp. 652-656. 
Trench offers the following procedure. Define No= []?_; p:, where fi, - ++, pn 
are the distinct prime factors of N. Then (A) holds if and only if 


m 
( ), No) = I, 
YT 


m 
( )= g (mod No), where (g, No) = 1. 
r 


which is in turn equivalent to 


Lemma 1 of Trench’s paper states that the sequence of residues 


m 
("") mod ive m=+++—1,0,1,--- 
r 
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is periodic with fundamental period 
P= I] P; 
t=] 


where s1, °°, Sa, are integers such that pi~'Sr<pi(1 Sin). Therefore an 
integer m satisfies (A) if and only if it is of the form m=my)+kP, where k is an 
integer, rSm,SP-—1 and 


Mo 
( ), 9 <1 (1 Sin). 
r 


6. Analysis. In 1947, R. Arens (Bull. AMS, p. 623) proved that a normed 
algebra in which |) = ||x|| ial is either the reals, comlpexes, or quaternions. 
This generalises the celebrated Gelfand-Mazur theorem and contains as a special 
case the note of Howard A. Seid (March 1970, p. 282); this reference was noted 
by Enzo R. Gentile. 

Gerald A. Heuer observes that a paper which he and some undergraduates 
published in the April 1965 MonrTHLY, pp. 370-373, contains those results in 
J. E. Nyman’s note “An application of diophantine approximation” (July 1969, 
pp. 668-671). Later, Heuer himself (April 1966, pp. 378-379) improved his 
earlier results. 

R. P. Boas calls my attention to the article “Limits of integrals” by R. P. 
Agnew (Duke Math. J., March 1942, pp. 10-19). Agnew’s result allows an im- 
mediate generalization of the theorem of my paper “An Application of Tau- 
berian Methods to a Problem in Series” (Feb. 1970, pp. 152-156). For a suitable 
function f(t), (1 St<), I let 


w(x) = f fas 


and consider consequences of the assumption that 1 W (ox) —W(x)} tends to a 
limit (as x) for each fixed o>1. Agnew shows that my deductions can be 
derived even under the apparently weaker hypotheses that {W(cx) —W(x) } 
tends to a limit (as x) for a set of o of positive Lebesgue measure or even 
for two appropriate values of oc. 

A problem left open in “Nets and sequences, an example” was later resolved 
by the same author, W. M. Priestly, in a paper which has recently appeared in 
the Proceedings of the A. M.S. (Feb. 1970). 

“ An example of a series > fn(x) which converges absolutely and uniformly, 
but, for some rearrangement, fails to converge uniformly, was given by D. J. 
Harris and B. L. D. Thorp in the September 1969 issue, pp. 801-2. P. Turan 
reports that a related result has appeared in Matematikat Lapok Fasc., 1 (1953) 
p. 36; namely that there exist analytic functions @¢,(g) in | 2! <1} with the 
property that > ¢a(z) converges absolutely but not uniformly. The examples 
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were given by Akos Cs4nszar and Cathleen Rényi. Rényi’s example was particu- 
larly simple: da(z) =43(1 —z) [2(1+2) |”. 

And finally, we record that the theorem given by Roger Horn (Jan. 1970, 
pp. 65-66) on the Wronskian test for linear independence has a long past. One 
reference, supplied by John Baxley, is “On the Wronskian tests for linear depen- 
dence” by Maxime Bocher (Annals of Math., Ser. 2, v. 17 (1915-16), pp. 167- 
168). Bécher himself references earlier papers, and indicates that Peano was 
aware that for a family of analytic functions, nonvanishing of the Wronskian 
becomes a necessary and sufficient test for linear dependence. 


EVALUATION OF SOME INTEGRALS BY CONTOUR INTEGRATION 


M. Lutzxy, U.S. Naval Ordnance Laboratory 


In this note we evaluate some real integrals by contour integration, using a 
method involving the integration of certain logarithmic functions over portions 
of the unit circle. We shall use the fact that on the unit circle 


(1) log(1 + z) = log(2 cos $0) + $10, (— 7 <6< 7m), 
and 
(2) log(1 — z) = log(2 sin 36) + $7(0 — 7), (0 < 0 < 2r). 


For our first case, we integrate the function log(1+z)/zz about the closed 
contour consisting of the real axis between 0 and 1, the imaginary axis between 
0 and 7, and that portion of the unit circle lying in the first quadrant. Since there 
are no singularities within the contour, Cauchy’s theorem yields 


w {2 


1 1 
(3) if log(1 + «)d«/x = f {log (2 cos 30) + 310} d0 + if log(1 + iy)dy/y. 
0 0 0 


We have 
(4) log(1 + zy) = glog(1 + y?) + 7 arctan(y), 


and a simple change of variable shows that 
(5) fi 80 + y)dy/y = 5 [tos + «)du/x. 
Using (4) and (5), we find that the imaginary part of (3) yields 
(6) frost + «)du/x = 1/12, 

0 
while the real part gives the result 


w/[2 1 
(7) f log(2 cos 40)dé = fe (arctan y)dy/y. 
0 0 
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Similarly, we can integrate log(1—z)/7z about the same contour, adding an 
indentation at g=1 to exclude the singular point. The integral around the inden- 
tation goes to zero as the indentation shrinks, so that Cauchy’s theorem, (2), 
and (6) yield 


1 
(8) | log(1 — x)dx/x = — 17/6 
0 
and 
w /2 1 
(9) J log(2 sin $0)d0 = -{ (arctan y)dy/y. 
0 0 


The results (6) and (8) are well known, although they are not usually arrived 
at by contour integral methods. Furthermore, upon addition of (7) and (9), 
another familiar result is obtained: 


w {2 
(10) i) log(2 sin 6)d0 = 0. 
0 
For our next example, we integrate [log(1—2) |?/iz for p=1, 2, and 3 about 


the unit circle, indented at z=1. The integral around the indentation vanishes 
in the limit, and Cauchy’s theorem gives 


(11) [- [log(2 sin 46) + 47(0 — r)|?> do = 0. 


When ~=1, the real part of (11) yields an integral equivalent to (10). When 
p=2, relation (10) and the imaginary part of (11) yield 


(12) f "9 log(2 sin $0)dé = 0, 
0 
while the real part gives 
(13) f- [log(2 sin 40) ]?d0 = 73/12. 
0 
When /=3, relation (13) and the imaginary part of (11) yield 
(14) [ohosc2 sin 46) |?d0 = 14/6. 
0 
Unliké the previous cases, however, the real part of (11) with p=3 does not 
yield a particularly simple result. After using (10) and (12) to simplify the result- 


ing expression, we end up with 


2r 20 
(15) af [log(2 sin 36) |? d@ = sf 67 log(2 sin $60)dé. 
0 0 
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Integrals of the form {7 [log (2 sin 46) |? d0 have been studied by Bowman [1] 
and Beumer [2] using real-variable methods, and it can be shown that they are 
expressible in terms of the Riemann zeta function of integral argument. For 
instance, if we substitute the Fourier series 


(16) log(2 sin 59) = — cos@f — 5 cos 26— -:- 


into the right hand side of (15), we obtain 
(17) 2f [log(2 sin 30) |8d@ = — 3m >) k-3 = — 3nf(3). 
0 1 


To introduce our last example, we point out that the integrands of (14) and 
(12) have the form @f(cos6, sin@). Clearly, the efficacy of the method in dealing 
with integrals of this type arises from the fact that the imaginary parts of (1) 
and (2) are linear functions of @. For our final example, we study another inte- 
gral of this kind in which, however, the function f(cos@, sin@) is not logarithmic. 
We define g(z) =a(z—1)(2+1)/[i(ze—a)(az—1)] for —1<a<1, and integrate 
log(1-+2)g(z)/z about the unit circle, indented to exclude the point z= —1. The 
only contribution to the integral comes from the simple pole at z=a, where the 
residue is —72 log(1+a); consequently Cauchy’s theorem yields, using (1): 


(18) | [log(2 cos 30) + 126|g(e*)d@ = — 2milog(1 + a). 


It is easily shown that g(z) is real-valued on the unit circle; specifically, 


2a sin 6 


19 e8 So 
19) ste") a? — 2acosé+ 1 


Therefore, the imaginary part of (18) gives the integral 


(20) { / @ sin 6 io =| (1 + a) 
rr = — !10 a). 
_¢ a — 2acoséd@+i1 a 


This integral previously was treated by complex integration by Lindeléf [3], 
who integrated the function z/(a—e—) around the rectangular contour with 
vertices at —7, 7, 7+72R, and r—21R. His method is somewhat more cumbersome 
than the one presented here, involving as it does the evaluation of subsidiary 


integrals along the sides of the contour, as well as requiring the passage to the 
limit Ro. 
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EXISTENCE OF FOUR CONCURRENT NORMALS 
TO A SMOOTH CLOSED CURVE 


NARSINGH Deo, California Institute of Technology, and M. S. KLiamxkin, Ford Motor Company 


In arather elegant lecture on calculus, Guggenheimer [1] establishes, among 
other related theorems, the existence of a point from which at least four nor- 
mals can be drawn to an oval. Two such points are the mass centroid and the 
curvature centroid. Chakerian and Stein [2] have shown that another four 
normal point is the perimeter centroid. The latter proof is by a simple continu- 
ity argument whereas the proofs given by Guggenheimer cannot (at least up to 
now) be obtained the same way. In this note, we give a simpler intuitive geo- 
metric proof for the existence of a four normal point which can be used in ele- 
mentary courses dealing with continuity methods in convexity or geometry 
(e.g., a la Yaglom and Boltyanskii [3]). 

For an oval (smooth, convex curve), it is well known that there exists at 
least one largest chord and, furthermore, this chord is perpendicular to the 
curve at each end. 


A 


Cc 
Fic. 1 


We now show that the midpoint O of a largest chord AB has the desired 
property. If ABC in Fig. 1 is circular, our proof is over. If ABC is not circular, 
then there must exist points whose distance from O is either larger or smaller than 
AO. The point (or points) P which maximize or minimize the radial distance 
OP will be such that OP is normal to the curve. Geometrically, if we consider 
the family of concentric circles centered at O, there will exist at least one circle 
which is tangent to open arc ABC. Since a similar argument applies to the other 
open arc CDA, our proof is completed. 

By a similar argument, one can show that another four normal point is the 
midpoint of a chord of minimal width. 

For the case of C? closed nonconvex curves, a largest chord which is per- 
pendicular to the curve at each end still exists. Consequently, the midpoint O 
will still be a four normal point provided that it falls inside the curve. If O falls 
on the boundary, then possibly only three distinct normals can be drawn from 
O. However, there are then at least four points on the curve such that the nor- 
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mals at these points are concurrent (note that if the curve touches itself at O, 
this point is counted twice). If O falls outside the curve, it is possible that two 
of the normals are on the same ray. Whether or not these two normals are to be 
considered distinct is a matter of definition. 


Fic. 2 


For the case (Fig. 2) when O falls on the curve and the curve is also simple 
and contains no circular or straight segments, then for points O’ of AC, suffi- 
ciently close to O, there exists a normal from O’ to the open arc ABC. For the 
case illustrated, being sufficiently close means that if OE is normal to the curve 
and OF <OA; then O’E must be <O’A. If OE>OA, then O’E must be >O’C 
(and in this case the extension of circular arc DE will intersect the open arc 
EBC). Also, there exists a normal from O’ to the portion of the arc ADC con- 
tained in a circle of center O’ and radius OO’-++e but here O’ must satisfy OO’ 
<OA/2. Hence, we have a segment of four normal points. The same also 
applies for an oval not containing any circular arcs. It now follows easily 
that the circle is the only oval with a unique “four-normal” point. Note that if 
the midpoint O of the maximum length chord AC turns out to be the only point 
on AC which is a four-normal point (e.g., if ABC was a semi-circle), then we 
would consider points on the minimum width chord. 


Acknowledgment: The authors are grateful to Professor Guggenheimer for his suggestions 
improving the style of the original note as well as extending its scope. 
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A COMBINATORIAL LEMMA INVOLVING A DIVERGENCE 
CRITERION FOR SERIES OF POSITIVE TERMS 


K. A. Post, Technological University Eindhoven, The Netherlands 

In this paper we shall give sufficient conditions for a mapping f of the set of 
positive integers into itself in order to ensure that a series >.., an diverges 
whenever 0 <@n<Qniit@y(ny for all n. 

This problem came up around 1964 when Graham and Jewett showed that 
f(n) =n(n+1) (n=1, 2, 3, - ++) has the desired property. They were also able 
to show roughly that a sufficiently regular function f that grows “as fast as” $n? 
also must have this property. This result is not published since the method 
they used was rather complicated (private communication). Later on, Erdés 
pointed out that the condition f(z) $n?/(2+6) (e being a fixed positive number) 
is not sufficient for divergence, by taking a,=1/n log n(log log 2)!*° for suffi- 
cient small 6=d6(e) >0. Around 1965 D. J. Newman showed that the condition 
f(n) =n? (n = 1, 2, 3, - - - ) implies divergence. 

We shall prove the following theorem: 

THEOREM. Let f be a mapping of the set of positive integers into itself satisfying 
the difference condition 


(D) fa+1)—f(y)2n+1 (2 = 1,2,3,---). 
Then any series ).°_1 On, for which 
(1) OS Qn S Oni t gay) (n = 1,2,3,---) 
ts divergent or dn=0 for all n22. 
The proof of this theorem is based essentially upon the following lemma: 


LEMMA. Let f satisfy the difference condition (D), and let k be an integer =3. 
Consider the tree 


R level 0 


+1 f(k) level 1 


fk+1) fl) +1 fF) level 2 
AN 


r e ° e ° e e ® 


Then for every t=1 all of the 2‘ numbers ai level t are distinct. 
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Proof of the Lemma. From the difference condition (D) on f it follows that for 
every n 


(2) f(n) 2 yj = in(n + 1). 


Hence, f(k) >k+1, so that level 1 contains two distinct numbers. Now let ¢ be 
the smallest number such that level ¢ contains a number, q say, twice. Then 
level ¢—1 contains two numbers, r and s say, such that 


(3) r+1= f(s) = 4¢. 


Now we trace back in the tree from 7 up to the root k. Each element in this path 
is obtained from the preceding one by subtraction of unity or by taking inverse 
f-image. Since by (D) we have f(s) —f(s—1) 2s, each of the first s—1 steps in 
this back-tracing process must be subtraction of unity. On the other hand, the 
back-tracing process starting from the element s on level t—1 must lead to the 
root k in at most s—k& steps, so that 7 is actually deduced from k by successive 
addition of unity in the tree. In other words, 


(4) r=k+i— 1. 
We also haves 2k+i—1, so that by (2) 

f(s) 2sRtt-—-DR+)N >k +t=7r41 
which contradicts (3); hence, the lemma is proved. 


Proof of the theorem. Suppose {dn} ee is not identically zero. Let a; be the 
first element 0 in the sequence {dn}s-2. Then {a,}%., contains an element 
a, >0 (k 23). Repeated application of (1) implies 


O< a S Qeyi1 t+ uy S-+: S >, Am, 
mE level ¢ 
where the sum is taken over all indices m occurring in level t of the tree with 
root k. (The notation is defined in the lemma.) The smallest index in level ¢ 
equals k-++? so that apparently a; may be majorized by a finite sum of distinct 
a» With indices m greater than an arbitrary chosen number N. For example, 


O<aSa,ta,t-:- ta, (N < ky < ho < +++ < Ry). 


Now we repeat the same process with any of the numbers az,, - + + , dz, obtaining 
Ap» * i * 1 Up, (Ri<pi<pe< +++ <p,), such that 
l q 
» Qk; < > Qp;) 
t=1 j=l 
and soon. 


Finally, we get infinitely many disjoint blocks of {Qn ho each having a sum 
greater than az, so that > 021 dn= @. 
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REMARKS. (a) With a slight modification of the proofs we may show thata 
weaker version of the difference condition, viz., 


fm+D)—f)zn-C (n=) 


(N and C being fixed) is already sufficient for divergence of >. Gn. 

(b) It should be noticed that for any increasing function g there exists a 
function f2g and a nonvanishing convergent series }\7., @, such that 0Sa, 
Sdn41t+4y(n) for all x. To illustrate this we take a monotonic sequence india 
that tends to infinity sufficiently rapidly, and define 


2-4 ifn =n, (¢ = 1, 2,3,---) 
nyt+i (= 1,2,3,---) 


QO otherwise. 


a, = 12>" ifn 


Finally, we take f(z) =i (4s-i1ti<nSn;+1). So we must conclude that a 
difference condition like (D) seems to play the central role in the theorem. 


A NOTE ON HADAMARD PRODUCTS 
T. E. Davis, Kansas State Teachers College 
The Hadamard product of two nXn matrices A =|lai,|| and B=|[dil| is 


AoB =|la,;b;;||. The following result is given in [2] and generalized to normal 
matrices in [1]. We give here a new proof based on tensor products. 


THEOREM. Let A and B be positive semi-definite Hermitian matrices with 
characteristic roots Qi SaeS5 +--+ Sa, and Bi) S828 +--+ SBr resp. Then AoB ts 
also positive semi-definite Hermitian, and each of its characteristic roots » satisfies 


aiGy S rN < AnOn. 


Proof. The tensor product A @B=||axB,|| (with row index (i, j) and column 
index (k, /)) is Hermitian with characteristic roots all 2? products a:6;. Since 
a,20 and 6,20, the smallest of these is ai8; and the largest is a,8,. Now AoB 
is a principal submatrix of A @B, so it is (obviously) also Hermitian and (by the 
minimax principle) its characteristic roots ) lie in the interval ai61 Sd SanBn. 

The generalization in [1] to normal matrices also follows this easy path once 
you show that each characteristic root of a principal submatrix of a normal 
matrix A lies in the convex hull of the roots of A. 
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RESEARCH PROBLEMS 


EDITED BY VICTOR KLEE 


In thts Department the Monthly presents easily stated research problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied 
by relevant references (if any are known to the author) and by a brief description of known 
partial results. Matcrial should be sent to Victor Klee, Department of Mathematics, University 
of Washington, Seatile, WA 98105. 


A PROBLEM IN GRAPH COLORING 


BRANKO GRUnBauM, Michigan State University 
(Presently: Univ. of Washington, Seattle) 


A graph is said to be colorable by R colors if its set of nodes can be partitioned 
into & disjoint subsets such that no edges connect nodes in the same subset, and 
is said to be k-chromatic if it is colorable by k colors but not by k—1 colors. In 
recent years considerable attention has been devoted to the following problem: 
Given positive integers k and , with k2=3 and 23, do there exist graphs which 
are k-chromatic while containing no circuits of length Sn? 

The following results have been obtained: Zykov [1949] and Mycielski 
[1955 | proved the existence of k-chromatic graphs of arbitrarily large Rk having 
no circuits of length <n=3. Descartes [1954] and Kelly-Kelly [1954] ob- 
tained the same result for n=5, and NeSetiil [1966] for n=7. Erdés [1959| 
established by probabilistic methods the existence for arbitrary k and n, while 
Lov4sz [1968] gave direct constructions for such graphs. The graphs con- 
structed in all those papers are “very large” and, in particular, contain nodes of 
valence large in comparison to k. 

It is of some interest to note that the existence of k-chromatic graphs with 
no circuits of length <m was recently (Taylor [1969]) used to answer in the 
negative a question in model theory posed by Mycielski [1964 |. 

On the other hand, a result of Brooks [1941] asserts: If all nodes of a con- 
nected graph G have valence at most k, where k 23, then either G is k-colorable or 
else G 1s the complete graph with k+1 nodes. 

Some years ago (see Erdés [1964]|) observations like the above led me to 
formulate the following conjecture: 


CONJECTURE. If k2=3 and n 23 are integers, there exist k-chromatic, k-valent 
graphs G(k, n) that contain no circutts of length Sn. 


The progress in settling the conjecture seems to have been very slow. Graphs 
G(3, %) are easy to construct, for example from circuits with an odd number of 
edges and from 3-valent graphs with girth 2m (concerning such graphs see, for 
example, Sachs [1964]|). Recently Chvatal [1970] constructed a 12-node graph 
G(4, 3). In the following lines I shall describe a 25-node graph G=G(4, 4) I 
found in 1963. As far as I know, no other cases of the conjecture have been 
decided. 
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Fic. 1 


The-graph G (see Figure 1) may be described as follows: Out of the 25 nodes 
of G, 20 nodes and the edges connecting them form the net of a regular dodeca- 
hedron (heavy edges in Figure 1). It is well known that there are 10 quad- 
ruples of vertices of the dodecahedron which are the vertices of a regular tetra- 
hedron. These ten quadruples fall into two sets of five quadruples each, the 
quadruples of each set covering simply the vertices of the dodecahedron. One 
such set is represented in the heavily drawn part of G by nodes bearing the same 
designation. The graph G is obtained by taking five additional nodes (the “outer” 
ones in Figure 1) and joining each of them to the four nodes in one of the quad- 
ruples. Thus constructed, G is obviously 4-valent, and contains no circuit with 
less than 5 edges. It is also easily checked that the action of the group of auto- 
morphisms of G on the five outer nodes coincides with that of the alternating 


group. 


1090 BRANKO GRUNBAUM [December 


Fic. 2 


In order to show that G is not 3-colorable, assume that a 3-coloring of G is 
given; then there are two possibilities to be considered for the outer five nodes: 

(i) some three of them have the same color; 

(ii) two nodes have one color, two another, the last node having the third 
color. 

In case (i) we may, without loss of generality, assume that the three nodes in 
question are those marked by two circles in Figure 2; then the nodes in the paths 
indicated by heavy edges are colored by the remaining two colors. But then for 
at least one of the edges indicated by dotted lines, both nodes incident to the 
edge must have the same color, that of the outer three nodes. 

In case (ii) we may assume, again without loss of generality, that the outer 
nodes are colored by 1, 2, 3 as indicated in Figure 3. Then, if the node indicated 
by two circles is colored 2, we arrive at a contradiction by following the arrow 
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Fic. 3 


leading from it to the left, there being no color available for the node indicated 
by the black disc. If, on the other hand, the two-circle node is colored 1 we reach 
a contradiction by following the arrow to the right. 

Thus G is not 3-colorable, and we established that itis a graph of type G(4, 4). 


The author is indebted to Professor R. A. Duke for many helpful suggestions. 
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CLASSROOM NOTES 


EDITED BY DAVID DRASIN 


Manuscripts for this Department should be sent to David Drasin, Division of Mathematical 
Sciences, Purdue University, Lafayette, IN 47907. 


ON THE SIMILARITY OF PARTIALLY ORDERED SETS 


ALEXANDER ABIAN, Iowa State University 


It is well known and easy to prove [1] that for linearly ordered sets (R, <) 
and (S, s’), if there exists a one-to-one mapping » from RK onto S which pre- 
serves order (i.e., «Sy implies p(x) S'p(y)), then (R, S) is similar to (S, S$’) 
since the converse mapping p_1 of p also preserves order. 

It is a curious fact that in the case of partially ordered sets (P, S) and 
(Q, S’), even a stronger hypothesis does not imply that (P, S) is similar to 
(Q, S’). For instance, as shown in the example below (suggested by Daryl R. 
Fischer), if there exists a one-to-one mapping f from P onto Q such that f pre- 
serves order and if there exists a one-to-one mapping g from Q onto P such that g 
also preserves order, then it is not necessary that P be similar to Q, 1.e., it is not 
necessary that there exist a one-to-one mapping # from P onto Q such that both 
h and its converse h_; preserve order. 

Example 1. Let (P, S) and (Q, S’) be partially ordered sets represented in 
the diagram below, where p; <4; if and only if there exists an upward connecting 
line from p; to p;. Similarly, g:<’q; if and only if there exists an upward connect- 
ing line from q; to g;. Otherwise, the distinct elements are not comparable. 


Po Pe Ps Pie Pre 
\ \\\ j\ se 
Po pr Ps bs b1 bs bs bro bu 
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J Qt qs 3 
Now, let f be a mapping from (P, S$) into (Q, S’) defined by: 
f(p:) = q: fori = 0,1, 2,3,4,---. 


Clearly, f is one-to-one onto and order preserving. 
Next, let g be a mapping from (Q, S$’) into (P, S) defined by: 


g(qi) = p2 and — g(qs) = po 


and 
g(q:) = pire fort = 0, 2,4,5,6,---. 


Clearly, g is one-to-one onto and order preserving. 
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Jo G4 qs G12 916 
\\ \/\ (QO, 3’) 
G1 d2 q6 710 147 


However, P is not similar to Q since in (P, S) there exists an element, 
namely $2 which has one and only one successor, namely, fo, whereas in (Q, S$’) 


there exists no element with one and only one successor. 


Again it is a curious fact that if # is a one-to-one order preserving mapping 
from a partially ordered set (R, <) onto itself then the converse h_1 of h is not 
necessarily order preserving, i.e., 4 is not necessarily a similarity mapping. This 


is shown by the example below. 


Example 2. Let (R, S$) bea partially ordered set represented in the diagram 
below where 7; <7; if and only if there exists an upward connecting line from 7; to 


r;. Otherwise, the distinct elements are not comparable. 


12 7% 710 14 


Ty Ys V3 Yt 79 v4 Ts riz 716 


Now, let kh be a mapping from (RX, S) into itself defined by: 
h(ri) = 1o and h(r3) = 12 
and 


A(r;) = risg fort = 0, 2,4,5,6,--+. 


Clearly, # is one-to-one onto and order preserving. 


However, #1 is not order preserving since r7<7rz whereas h_i(ro) =" and 
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h_i(r2) =rs and 7; and 73 are not comparable. Thus, indeed h is not a similarity 
mapping. 

In contrast to the above two examples, we have the following two theorems 
for the case of finite partially ordered sets. 


THEOREM 1. Let (R, S) bea finite partially ordered set and let h be a one-to-one 
order preserving mapping from R into itself. Then h is a similarity mapping from 
R onto itself. 


Proof. Since h is one-to-one and R is finite it is clear that 4 is onto. Thus, to 
prove the theorem it is enough to show that if h(x) <h(y) then x <y. Let S be the 
set of all the ordered pairs (4, v) with «ER and vER such that u <z, i.e., 


(1) S= {(u,v)|u ER and vE&R and u<v}. 
Consider the mapping h* from S into S defined by: 
h¥(u, v) = (h(u), h()). 


Since h is one-to-one, obviously, h* is one-to-one, and since S is finite we see 
that h* is one-to-one from S onto S. Now, if h(x) <h(y) then by (1) we have 
(h(x), h(y)) ES. However, since h* is onto, for some (u, v) CS we must have: 


(2) (h(x), h(y)) = h*(u, v) = (h(w), h(v)). 


However, since h is one-to-one we see that x =u and y =v. But then from (2) and 
(1) it follows that u<v and hence x <y, as desired. 


THEOREM 2. Let P and Q be finite partially ordered sets and let f be a one-to-one 
order preserving mapping from P into Q and g be a one-to-one order preserving 
mapping from Qinto P. Then P 1s similar to Q. 


Proof. Since P and Q are finite and f and g are one-to-one, we see that f as 
well as g is an onto mapping. Clearly, to prove that P is similar to Q it is enough 
to show that the converse f_1 of f is order preserving. However, 


(3) fy = flic(g-a-g) = (f-1°¢-1)-2 = (g-f)-1°8. 


But since f and g are one-to-one onto and order preserving, we see that the 
composite g-f is a one-to-one order preserving mapping from P onto P. But then 
from Theorem 1, it follows that g-f is a similarity mapping from P onto itself 
and consequently its converse (g:f)_1 is order preserving. On the other hand, 
since g is also order preserving we see that the composite (g-f)_1-g is order 
preserving. But then, from (3) it follows that f_i is order preserving, as desired. 


The author thanks Daryl R. Fischer for the short proof of Theorem 1. 
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ON CHARACTERIZATION OF UNIFORMIZABLE SPACES 
J. V. MicHaLowicz, The Catholic University of America 


In most topology books which mention uniform spaces, uniformizability is 
characterized in terms of complete regularity. More precisely, let us say that a 
space X is 73, if it has the property that for each x © X and closed A not contain- 
ing x, there is a continuous map yg: X—>I which has g(x) =1 and vanishes on A; 
we call X completely regular if it is both 73, and Hausdorff. Then we have: 


THEOREM 1. A topological space 1s a separated untformizable space tff 1t ts 
completely regular. 


THEOREM 2. A topological space 1s untformizable uff tt is T 3a. 


An exception is Dugundji [1], which gives an introduction to uniform spaces 
that is very brief but worth building on since it presents uniform structures both 
in terms of entourages and uniformizing families and clearly shows their equiv- 
alence. It is the purpose of this note to show how the above theorems can 
easily be appended to that treatment. 

The usual first example of a separated uniform space is provided by a 
metric space or, more generally, a gauge space (that is, a space whose topology is 
induced by a separating family of gauges). The main result on uniform spaces in 
[1] is that every separated uniformizable space is a gauge space, which yields 
Theorem 1 since the gauge spaces are precisely the completely regular spaces. 

We show that with the addition of one definition and one simple proposition 
we can likewise infer Theorem 2. We define the “semi gauge space” (for want of a 
better term) exactly like the gauge space but with the requirement that the 
family of gauges be separating omitted. By the same development as before, we 
deduce that the uniformizable spaces are precisely the semi gauge spaces and 
that any semi gauge space is T3.. The proof in [1] for the converse of the second 
statement in the completely regular case relies on embedding a completely 
regular space in a parallelotope, and cannot be used for the 73. case. However, 
the required converse can also be proved in a manner that is in fact applicable to 
the 73. case, for we have: 


PROPOSITION. Every T3_ space 1s a semt gauge space. 


Proof. Let Y be a T34 space and 3 its topology, and J” the set of all con- 
tinuous maps f: YJ. For each fEl*, ds: YX YE! defined by d;(x, y) = | f(x) 
—f(y) | isa gaugein Y. Then D= td | fer} is a family of gauges which induces 
3 since the family of all balls Bly; d,, €) = {x 4 d;(x, y) = | f(x) —f(y)| <e} 
forms a basis for 3. Indeed, each B(y; dy, €) =f—(|f(y) —e, f(y) +e[) is open in 5 
since f is continuous. And ifx © UC Y, where U is open in 3 then the 73, prop- 
erty gives a pC/J* with g(x) =0 and ¢(y) =1 for yECU so that xE B(x; dy, 1) 
CU. Thus Y isa semi gauge space. 
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AFTER THE DELUGE 
D. A. Moran, Michigan State University 


The purpose of this note is to provide a somewhat simpler proof than that 
given by Professor Marston Morse of his elementary theorem about pits, peaks, 
and passes on the sphere [1]. It should be recalled that Professor Morse views a 
positive, real-valued, bounded, differentiable function on the sphere as an alti- 
tude, measured from the center of some hypothetical spherical planet. Critical 
points of the function then correspond to pits (=minima), peaks (=maxima), 
and passes (=saddle points of index —1) on the planet. It is assumed that no 
critical points more complicated than these three types ever occur, and that no 
two of these singularities occur at precisely the same altitude. Our viewpoint is 
essentially the same as this, but we start with the following slightly different 


ADDITIONAL HyYpPoTHESIs: No pass 1s as high as a peak, or as lowas a pit. 


This hypothesis is easily fulfilled, if we agree to drill deep holes at the bottom 
of each pit, and raise tall flagpoles atop each peak. Let No, Ni, and Nz be the 
number of pits, passes, and peaks. 

Now let rain begin to fall on the planet which represents the sphere. Im- 
mediately No lakes are created. As the water level rises to the altitude of a pass, 
a lake can merge with itself, creating an island, or else a lake can merge with 
another lake, resulting in a net decrease by 1 in the number of lakes. When the 
water level has risen to inundate every pass, but is not yet as high as any peak, 
the number of islands will be 


1-+-number of island-increasing passes, 
and the number of lakes will be 

No—number of lake-decreasing passes. 
The number of lakes less the number of islands is therefore 

No — Ni — 1. 
On the other hand, at this point in time there is clearly one lake and Nz islands, 
sO 
No-~Ni-—-1=1-—N;, 
proving Morse’s generalization of the theorem of Euler. 
Written while the author was partially supported by NSF grant GP 8962. 
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In conclusion, we have found the seminars to be an asset to our departmental 
objectives and felt that other Universities might be interested in implementing a 
similar program. 


PROBLEMS AND SOLUTIONS 


EDITED By EMory P. STARKE 


ASSOCIATE Epitors: JOSHUA BARLAZ, GRATTAN P. MurpHyY. COLLABORATING EDITORS: 
LEONARD CARLITZ, GULBANK D. CHAKERIAN, HASKELL COHEN, ISRAEL N. HERSTEIN, 
Murray S. KLAMKIN, DANIEL J. KLEITMAN, ROGER C. Lynpon, Marvin MArcus, 
ALBERT WILANSKyY, OSWALD WYLER, and UNIVERSITY OF MAINE PROBLEMS GROUP: 
GEORGE S, CUNNINGHAM, CLAYTON W. DopGE, Howarp W., EvEs, WILLIAM R. GEIGER, 
CHARLES A. GREEN, Eric S. LANGFORD, Puitip M. Locker, JoHN C. MAIRHUBER, 
EDWARD S. NorTHAM and WILLIAM L. SOULE, JR. 


All problems (both elementary and advanced) proposed for inclusion in this Department 
should be sent to E. P. Starke, 1000 Kensington Ave., Plainfield, NJ 07060. Proposers of 
problems are urged to enclose any solutions or information that will assist the editors. Or- 
dinartily, problems in well-known textbooks and results in generally accessible sources are not 
appropriate for this Department. No solutions (except those accompanying proposals) should 
be sent to Professor Starke. 


ELEMENTARY PROBLEMS 


Solutions of Elementary Problems should be sent to Problems Group, Mathematics Depart- 
ment, University of Maine, Orono, ME 04473. To facilitate their consideration, solutions of 
Elementary Problems in this issue should be typed (with double spacing) and should be mailed 
before March 31, 1971. Contributors (in the United States) who desire acknowledgment of 
receipt of their solutions are asked to enclose self-addressed stamped postcards. 


An asterisk (*) means netther the proposer nor the editors supplied a solution. 


E 2265.* Proposed by N. M. Dongre, Sydenham College, India 


Let a regular star polygon be constructed by dividing a circle into ” equal 
parts and drawing the chords joining alternate points of division. Each of the n 
chords will carry four points of intersection. It is desired to assign the integers 
1,2, --++,2mn to the 2m points of intersection so as to have a magic star polygon 
(i.e. the sum of the four numbers on each chord is constant, see Problem E 2092 
[1969, 557]). Prove that a necessary condition for the existence of a magic 
star polygon is that 2 >5. Is this condition sufficient? 


E) 2266. Proposed by Leon Gerber, St. John’s University, Brooklyn, N.Y. 


If the lines joining the vertices of asimplex {A;}% to its centroid M meet the 
circumsphere again in the points {B,;}$, then 


1970] ELEMENTARY PROBLEMS AND SOLUTIONS 1107 


(a) AiM-MB;= >. 4:4j/m+1), 4,7 =0,-++, 0,4 4. 
(b) >, A,M/MB; =n +1. 


E 2267.* Proposed by Emilia Myctelska, University of California, Berkeley 


Given a permutation do, a1, - ++, a, of the sequence 0, 1, +--+, ”; a trans- 
position of a; with a; is called Jegal if a;=0 for 7>0, and a;_1+1=a,. The per- 
mutation Qo, @1, - °° , @, is called regular if after a number of legal transpositions 
it becomes 1, 2,°--+, ”, 0. For which numbers » is the permutation 1, 2, 
nm—1,--+-,3,2,0regular? 


E 2268. Proposed by Leonard Carlitz, Duke University 


An arithmetic function f(m) is completely multiplicative if f(ab) =f(a)f(o) for 
all positive integers a, 0. Show that 


> f@sd) = (n)f(n) s(n = 1,2,---), 


ab=n 


where 6(m) is the number of divisors of m, if and only if f(z) is completely multi- 
plicative. 


E 2269.* Proposed by S. R. Conrad, Bayside, New York. 


A shuffled deck of ordinary playing cards is dealt out in the manner of the 
French gambling game of “treize” [cf. Rouse Ball, Mathematical Recreations and 
Essays, p. 325]. The dealer deals out the first 13 cards as he calls the numbers 
1, 2,---, 13 [J, Q, K counting as 11, 12, 13, respectively]. He repeats the 
procedure three more times, without replacement, exhausting the deck. What is 
the probability of (a) no “match”? (b) exactly k matches? (c) at least k matches? 
[A match is defined as calling out the number n while dealing card u. | 


E 2270. Proposed by Peter Yff, American University of Beirut, Lebanon 


Let a, 6, c be elements of a group G of odd order. If a?b?=c?, prove that ad 
and ¢ are in the same coset of the commutator subgroup K. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
An Estimate of an Integral 


E 2211 [1970, 79]. Proposed by M. D. Landau, Lafayette College 


An exercise in Rudin, Principles of Mathematical Analysis states 


rt+l 
J sin t?di < 2/x for x > 0. 


Can this bound be improved to some smaller multiple of 1/x? 
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Solution by J. Gillis and M. Shimshoni, Weizmann Institute of Science, 
Israel. The bound can be replaced by 1/x and this is best possible. Indeed we 


show that 
etl 
(1) «{ sin fdi| <1 forx >0, and 
z+l 
(2) lim sup] i) sin (dt| = 1. 
I 0 £ 


Integration by parts yields 


etl cos #774 ot ot sin? 
(*) i) sin di = | — —_-— J dt, 
z 2t Zz 2 Zz t? 


and hence 
etl 
i) sin ??dt 
zx 


To prove (2) it will suffice to show that, given e>0, there exist arbitrarily large 
x such that cos x?>1—e, cos(x+1)?< —1+e. It will then follow from (*) that 


eyl 1—ef1 1 1 
i) sin i2dt| > |— + | + o(-), 
x 2 x x+i1 x? 


etl 
x { sin tdt 
Zz 


But € was arbitrary and x could be arbitrarily large, and hence 


etl 
x J sin ?2dt 


This, combined with (1), yields (2). The existence of such x will follow if we show 
that, for any 6>0, we can find positive integers m, n such that | «28 —2mrr| <6, 
where x, is defined by (2x, +1) = (2n-+1)z. For any 1, let yn, = (87)7! { (Qn+1)7r—1 \2 
and we have to prove that the fractional part of y, can be arbitrarily 
small. However, for any fixed positive integer h, we have 2n=Ynin—Va = 
nh+4h|(h+1)2—1] and theset \2n } is well known to be equidistributed, and the 
same is therefore true of the set { Yn} by van der Corput’s theorem [Acta. Math. 
56(1931), 373]. 

Also solved by Meher Baba, D. M. Bloom, R. P. Boas, R. J. Dickson, W. O. Egerland, S. C. 
Ferry, Doug Hanto, Norman Locksley, O. P. Lossers (Netherlands), E. F. Schmeichel, Sid Spital, 
St. Olaf College Students, M. K. Vamanamurthy, Konrad Victor (Israel), L. E. Ward, Sr., and 
P. H. Young. 


<>(=+ 1 )+- dt 1 
—2X\¢% a+l te gg 


and so 


> a9 +0(2) 


lim sup = 1. 


ZI 
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An Average Sequence 
E 2212 [1970, 79]. Proposed by G. E. Peterson, University of Missouri at St. 
Louis 


If {s,} and {tn} are sequences such that of: [t.| converges and 
liMp-»co (dot Stk) /Sn eXists and is nonzero, then limz.. Sn exists and is nonzero. 


Solution by R. J. Dickson, Lockheed Palo Alto Research Laboratory. Set 
n= (> ner Sete) /Sni then sp_wea/Sire=1—(te/rx). Let m be chosen large enough 
so that 7,0, ra¥t, for n2=m. Then [[tina1 (1 —te/re) =Smtm/Satn Letting 
n— ©, the conclusion follows if the product on the left converges. By a standard 
criterion, it suffices to have Dora | tz/7z| < © and this follows from >on [te| < ©, 
since r,—r 0. 

Also solved by M. T. Bird, L.S. Bosanquet (England), Steven Ferry, M. L. J. Hautus (Nether- 
lands), Ellen Hertz, D. E. Manes, E. F. Schmeichel, Simeon Reich (Israel), Sid Spital, Mark Yu, 
and the proposer. 

Quadrilaterals with the Nagel Property 

E 2213 [1970, 79]. Proposed by H. Demir, Middle East Technical University, 
Turkey 

Let us say that a (planar) polygon has the Nagel property if the lines through 
the vertices of the polygon and bisecting the perimeter of the polygon are con- 
current. It is known that all triangles have the Nagel property and that not all 
quadrilaterals have the property. Determine the simple nondegenerate quadri- 
laterals that have the Nagel property. 


Solution by the editor based on the proposer’s solution. Let ABCD be a quadri- 
lateral having the Nagel property. Let each of the lines AA’, BB’, CC’, DD’ 
bisect the perimeter and pass through the Nagel point NV. Let AB=a, BC=), 
CD=c, DA=d. We may suppose a+b2c+d and 6+c2a+d. It then follows 
that C’ lies on segment AB, B’ on CD, and A’ and D’ on BC (Fig. 1). Set BD’ =u, 


Fie 1 
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A’C=v, CB’=w, and atd+ctd=2s. Since BC+CB’=s=D’'C+CD, then 
DB'=u. Similarly AC’=v, BC’=w and D’A’=d. We distinguish three cases. 

CasE 1. 40 and v0. Thinking of AA’ and DD’ meeting at NV, we see that 
a necessary and sufficient condition for CC’ to pass through JW is 


by Menelaus’ theorem applied to triangle ABA’ cut by line CNC’. Similarly, 
BB’ passes through WN if and only if 


— Ee eee Oe 


Hence AD is parallel to D’A’, and since these segments are also equal, it follows 
that ADA’D’ is a parallelogram. The diagonals 4A’ and DD’ then bisect each 
other, so AN=WNA’ and b=w. But d+w=s, so 2w=a—b+c+d= 20, from which 
we obtain 
atect+d 

3 


b= 


Hence ABCD is a trapezoid such that the longer of its two parallel bases is the 
arithmetic mean of its other three sides. Reversing the argument of this para- 
graph shows that every such trapezoid has the Nagel property. For example, 
the trapezoid with vertices (0, 0), (19, 0), (6, 24), (0, 24) has N=(7, 12). 

CASE 2. u=v=0. Then atd=c+d=s and a+d=b-+c=s, so 2a+b+d=2c 
+6b-+d. Hencea=c and 6=d,so ABCD isa parallelogram. Clearly every parallel- 
ogram has the Nagel property. 

CAsE 3. u=0 and v0 (or vice versa). Let AC and BD meet at M (Fig. 2). 
Now a+d=b+c=s,sodt+tyvt+w=C’B+BC=CD+DA+AC’=c+d-+7, whence 
c=w. Applying Menelaus’ theorem to triangle ABA’ cut by CNC’ and to tri- 
angle AA’C cut by BNM, we obtain 


b A'N v A'N C 
— -——-— = — 1, S =—) 
—» NA c "WA 
AN —d CM CM bA'N 4 


c c 
Oe ee Ye) = —— . —— oe Oe 
MA d NA d b d 


Since MM divides side CA of triangle DAC in the ratio c/d of the adjacent sides, 
then Di bisects angle D. Hence (ADM= X MDC=a. Applying the law of 
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A ‘ 


Fic. 2 


cosines to triangles ABD and CBD and letting BD=f, we have 
a? = d’? + f? — 2df cosa, 
6b? = ¢? + f? — 2¢f cos a, 


SO 
a? — 6? = d? — c? — 2f(d — c) cosa. 


Assuming d 2c, let-d =c+e,sob=c-+v+e and a=c-+v. Now a?—(a+e)?= (c+e)* 
—c*—2ef cos a, which simplifies to 


ef cos a = ce + ae + e?. 


If e0, then cos a= (cta-+e)/f=(c+)/f, an impossibility since c+)b>f. Hence 
e=0,so c=d and a=b. The figure is therefore a kite. By symmetry, every kite 
has the Nagel property. 


Three other correspondents mentioned the kite and the parallelogram. 


Interior Diagonals of a Polygon 


E 2214 [1970, 79]. Proposed by M. S. Klamkin, Ford Scientific Laboratory 
and B. Ross Taylor, York High School. 


It is intuitive that every simple 2-gon (n> 3) possesses at least one interior 
diagonal. For a simple 2-gon what is the least number of diagonals which, except 
for their endpoints, lie wholly in its interior? 
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Solution by Anders Bager, Hyi¢rring, Denmark. The two tangents from a point 
P outside a circle touch in points A and B. Connect A and B with a broken 
line consisting of 7 —2 chords succeeding each other along the smaller arc from 
A to B. Join P to A and B to obtain a simple 2-gon with exactly »—3 inner 
diagonals (all issuing from P). 

The number 2 —3 is minimal. This is trivially so if n=3. Suppose it true for 
some 2 and consider an arbitrary simple (7-+1)-gon. From this cut off a triangle 
such that two sides are sides of the (z-+1)-gon, and the third side an inner dia- 
gonal. This is always possible and leaves a simple n-gon which, by assumption, 
has at least n —3 inner diagonals. Hence the (w-+1)-gon has at least (n—3) +1 
=(n+1)—3 inner diagonals. Thus the assertion of the problem is true by in- 
duction. 


Also solved by M. Deakin (New Guinea), R. B. Eggleton (Australia), Michael Goldberg, 
M. G. Greening (Australia), Norman Miller, E. F. Schmeichel, Roger Weitzenkamp, William 
Wernick, W. G. Wild, Joseph Zaks, and the proposers. 

Eggleton establishes the result that a simple 2-gon has precisely n—3 inner diagonals if and only 
if no two of its diagonals intersect. 


Packing Circles into Quadrilaterals 
E 2215 [1970, 192]. Proposed by Michael Goldberg, Washington, D.C. 


Find the shapes of the quadrilaterals of least area K which enclose three given 
circles of radii 1, 0, 0. 


Solution by Thomas Hughes, Arlington, Texas. The formulas for K as a 
function of b are as follows (compare the corresponding figures) : 


(1) 0<b5 (V2-1)//VW24+1), K=4. 
(2) f/2 — 1)/(/2+1)Sb80415, K= (14+ 0)2/(/8(1 — 5)). 
(3) 0.415 <b S 0.926, 
K=(1+60+ V1 + 26)(1 + 5+ V1 + 28) cot A + 2 cot(90° — $A)) 
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where A = B+C, B=Arcsin((1—b)/(1+6)), C=Arecos(6/(1+)). 


(4) 09265681, K = (d+ 6+ 1) tan 6+ 3 cot @, 
where d = (56? +8b+/6+26+1)"/2, 0=45° +2 Arcsin((1—b)/d). 
(5) 1<o85 22-1, K = 20(3b+ 2Vb+ (b(2 — 5))"). 


(6) 2/2—-18 6 S 3.783, 

K = b(6b + 4/b + 2v/b(2 — 5)/(b — 1) + (6 — 1)/2V/)). 
(7) 3.783 Sb54, K = 4b(b + 2). 
(8) 4 <b, K = 86%. 


Also solved by the proposer. 


A Comparison of Integrals 
E 2216 [1970, 192]. Proposed by M.S. Klamkin, Ford Scientific Laboratory 
Which of the two integrals 


1 1 1 
J “dx, f J (xy)*4dx dy 
0 0 0 
is larger? 


Solution by R. A. Groeneveld, Mount Holyoke College. Making the substitu- 
tion u =xy, the second integral may be written 


1 1 ue 1 
f i} — dxdu = — i) u“(log u)du. 
0o%yu % 0 


1 
= 0, 


0 


Since 


1 
f u“(1 + log u)du = u¥ 
0 


the two stated integrals are equal. 


Also solved by D. S. Adorno, Christian Andersen (Denmark), K. F. Anderson, J. Beeman & B. 
Dimsdale, J. W. Burns, Joe Chance, J. A. Coatimundi, Steven Ferry, N. J. Fine, B. A. Fusaro, J. 
Gillis (Israel), M. F. Gillis, David Gootkind, C. B. Grosch, Judith R. Gumerman, Seymour Haber, 
Ellen Hertz, R. A. Horn, D. G. Kabe, M. M. Klein, E. F. Knapp, H. S. M. Kruijer (Netherlands), 
J. R. Kuttler, Harry Lass, J. J. Leeson & D. L. Sherry, H. S. Lieberman, J. G. Mauldon, Simeon 
Reich (Israel), Valerie Rindone, E. F. Schmeichel, Michael Shimshoni (Israel), J. S. Shipman, 
G. A. Stoops, St. Olaf College Students, E. W. Trost (Switzerland), M. K. Vamanamurthy, 
M. M. Wells, J. E. Wilkins, Jr., and the proposer. 

J. Gillis proves the following generalization: Define 


l pl 1 
[, = f f cee f (sq + + Xp)%1%2°+ 7 daydxe > > + dx, 
0 40 0 


r=1,2,---. Then KH=h<k<i<---, and lim,,,J,=1. Klein reports the computer value of 
I, is 0.78343051. 
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ADVANCED PROBLEMS 


All solutions of Advanced Problems should be sent to J. Barlaz, Rutgers—The State Uni- 
versity, New Brunswick, NJ 08903. Solutions of Advanced Problems in this issue should be 
typed (with double spacing) on separate, signed sheets and should be mailed before March 31, 
1971. Contributors (an the United States) who desire acknowledgement of receipt of their 
solutions are asked to enclose self-addressed, stamped postcards. 


An asterisk (*) means neither the proposer nor the editors supplied a solution. 


5765.* Proposed by Simeon Reich, Israel Institute of Technology, Haifa 
On p. 325 of Dugundyji, Topology, the following result is stated: A is a defor- 
mation retract of B over X if and only if A is a retract of B and B is deformable 


into A over X. 
Is this true? 


5766. Proposed by Morris Newman, National Bureau of Standards 

Let a, 8 be complex numbers linearly independent over the reals. Let g(z) 
be an entire function, and suppose that there are constants a, } such that for all 
z, g(zta) =ag(z), g(¢+) =bg(z). Prove that constants A, B exist such that 
g(z) =A exp(Bz). 


5767. Proposed by Howard Jacobowitz, New York University 


Let g(t) be a real valued function on the open unit interval. The Cauchy 
criterion can be expressed in the following form: 
lim sup {g(t) — g(tz)} = 0 
t—0 0 <byity <t 
implies that g(#) has a finite limit as t-0. 
Prove or disprove the following modification: 
lim = sup { g(t) — g(te) } = 0 


0 t<t,,t, <2: 
implies that g(¢) has a finite limit as 0. 


5768.* Proposed by Peter Flor, University of Vienna, Austria 


Minkowski’s singular monotone function M(x) is defined as follows: (0) =0, 
M(1)=1; if a=p/q and b=p’/q' are rational numbers in [0, 1] such that 
pq’ —p'g=1, and if c=(p+p’)/(¢+q’), then M(c) =4|M(a)+M(d) |. This de- 
fines M(x) for every rational x€ [0, 1]; the function can then be extended by 
continuity to all of [0, 1]. Obviously 0, 4, and 1 are fixed points of M(x). Prove 
that there are exactly two further fixed points, dj=0.4203723 ---+ and 
dz=1—d,. Decide whether they are rational. 


5769. Proposed by L. W. Shapiro, Howard University 


Show that a finite simple group has no irreducible representation over the 
complex numbers of degree two. 
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5770.* Proposed by J. P. Jones, Unwersity of Calgary 


Does there exist a function Y=y(x, vy) continuous on some domain in R? 
with the following property? For each real valued 6(x) continuous on a connected 
domain, there exists a yo such that ¥(x, yo) and 6(x) have the same domain and 
agree there. 


SOLUTIONS OF ADVANCED PROBLEMS 


Product of Algebraic Numbers 
5674 [1969, 565; 1970, 661]. Proposed by L. Carlitz, Duke University 


It is proved in problem 5542 [1968, 1021] that the following statement is 
incorrect: If a, b are algebraic over F of degree m and n respectively and if m and 
n are relatively prime, then ad is algebraic over F of degree mn. 

Show that the statement is correct when F=GF(q), g=$', p prime, 21. 


II. Solution by R. A. Moore, Southern Illinois University. Let F be any field 
with F(a) and F(0) normal separable extensions of F of degrees m and 1 respec- 
tively. If (m, 2) =1, then Galois Theory yields the following facts: 

(i) deg(F(a, 6)/F) =m-n. 

(ii) The Galois groups G(F(a, b)/F(a-b)) and G(F(b)/F(b)(\F(a-b)) are 
isomorphic. 

It follows that deg(F(a, 0)/F(a-b)) divides deg(F(0)/F) =n. 

In like manner, deg(F(a, b)/F(a-d)) divides m. Since (m, n)=1, F(a, b) 
= F(a-b). This includes the special case F=GF(q). 


Editorial Note. The author of Solution I [1970, 661] reports that his solution is incorrect. 
Irving Guest has provided a counterexample to the claim that ab is a root of unity of order l.c.m. 


(a, 8). 


A Set Hierarchy 


5697 [1969, 1074]. Proposed by Frederick Hammer, Paine College, Augusta, 
Georgia 


A set is transitive 1f xCyES implies xES, disjoint if x, yES implies «Ny 
= @M orx=y. The Axiom of Regularity states that x~@ implies the existence of 
yEx with yox=@. Show that in each finite cardinality there is exactly one 
transitive and disjoint set, and further, that there is only one transitive and 
disjoint infinite set, assuming the Axiom of Regularity. 


Solution by W.G. McArthur, Shippensburg (Pa.) State College. Let Ni= @ and 
for k & natural number greater than or equal to 2, let N,= {Nis}. Let Av=2, 
Ai={Mi}, and in general, 4g={Ni, No, ---, Nz}. Finally, let 4,=Us.,Ap 
Evidently each A, and A, is transitive and disjoint. Furthermore, card A, 
=k and card 4,=No. 

We shall first prove, by induction, that if S is a transitive disjoint set and 
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card S2R, then Nz€C S(k 21). For k=1, suppose Ni=@ is not an element of S. 
By the Axiom of Regularity, there is an x©S such that x and S are disjoint. 
Since x @, there is yCx. By transitivity of S, y is also anelement of S. Thus, 
y isin S(\x, a contradiction, so that NiG.S. Now assume that the condition is 
true for a fixed & and suppose that card S2k-+1. By the inductive hypothesis, 
AxCS. Let T=S—A;, and note that T#@. Thus, by the Axiom of Regularity, 
there isx€@T such that xT = @. Sincex# @, there is yEx. Then yCS—T and 
hence yCA,. By disjointness of S, y=N;, and we must have x= {N;} = Nyt. 

Now, let S be a finite transitive disjoint set with card S=k; by the pre- 
ceding we have S=Aj,. On the other hand, if S is an infinite transitive disjoint 
set, then 4,CS. If A,#S, then by the Axiom of Regularity, there is an ele- 
ment x in S—A, such that x and S—A,, are disjoint. Then x#@ and there is 
a yin x; by transitivity, y is in A,. Hence there is a k such that y= Ny. But 
{Nu} =NiiiCGAeCS. This contradicts the disjointness of S. Thus S=A, and 
the proof is complete. 


Also solved by M. L. Laplaza (Puerto Rico), J. M. Plotkin, and the proposer. 


Inner R-Derivations 


5707 [1970, 84]. Proposed by W. A. Vasconcelos, Rutgers—The State Uni- 
versity 


Let R be an integral domain and G a finite group. Assume the characteristic 
of R does not divide | G| . Prove that each R-derivation of R[G] is inner. 


Solution by Barbara Osofsky, Rutgers—The State University. Let 6 be an 
R-derivation of R[G] and let {c;|1Si<m} be the distinct conjugate classes of 
G, hiEe: di= digec, gER[G]. Let x= dYonechd(h-) = Deg agg, a,ER. Let 
y=x— >? an,d:; Then x and y induce the same inner derivation, and the co- 
efficient Bx, in y= > ,¢¢ Bog, By CR, must be 0. 

Now, for all g€G, 


gig = p2 ghi(h)g = p2 gh[5(h-1g) — h-*8(g)] 
= Dy ge ns(h-'g) — DY g8(g) = « — | G| g%(g). 
hEG hEGE 


So |G| 5(g) = gx —xg =gy—vye for all gEG. 

Let REG. k=gh.g-' for some g and t. Then the coefficient of gh; in gy is 
zero, and the coefficient of gh; in |G|8(g) is divisible by |G|, so that the co- 
efficient of gh;=kg in yg, namely §;, is divisible by | Gl. Hence 6 is the inner 
derivation induced by (1/| G| ERG]. 


Also solved by G. A. Elliott, E. R. Gentile (Argentina), M. G. Greening (Australia), Miguel 
Torres (Spain), and the proposer. 
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Separable Extension Fields 
5708 [1970, 84]. Proposed by C. W. Avery, San Jose State College 


Let K be a finite extension of the field k, complete in a non-Archimedean 
valuation. Let K and & denote the residue class fields. Problem 16, p. 129 of 
P. J. McCarthy, Algebraic Extensions of Fields asserts that K is separable over 
k if K is separable over &. Disprove. 


Solution by Robert Gilmer, Florida State University. Let F be a field of char- 
acteristic 0, let X be an indeterminate over F, let K be the quotient field 
of the formal power series ring F(X)[[Y]], and let & be the quotient field of 
F(X)[[Y?]|]. If » is a valuation on K associated with the valuation ring 
F(X)[[Y]], and if w is similarly defined on & for F(X)[[YV?]|], then K and & 
are complete under v and w, respectively. However, K/k is purely inseparable 
of degree p, while K=k= F(X). 


Also solved by Louis Dernier, Harley Flanders (Israel), Harald Niederreiter, T. F. Zelman, 
and the proposer. 


A Linear Combination of Normally Distributed Variables 


5709 [1970, 85]. Proposed by W. A. J. Luxemburg, California Institute of 
Technology 


For all x >0, determine 


tim =. - a [ eol-@ tees bean) dar + + + date, 
where 
D, (x) = f(x Le) Ot,)! wy tae 4 sah 
1 /2 Vn 


Solution by J. C. Tanner, Crowthorne, Berkshire, Great Britain. For a fixed 
n, the problem may be reformulated as follows: If x1, x2, - ++, X, are inde- 
pendently and normally distributed with zero mean and variance 4, what is 


the probability that 


Denoding the left hand side by |y|, y is normally distributed with zero mean 
and variance 


2 1 1 1 1 
aa s(i+sest +c), 
2 2 3 


and therefore the probability that || <xis2F(x/o,) —1, where 
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x 
F(X) = f 7 et /2qf 
_ T 


is the normal integral. As » tends to infinity, o, tends to infinity, and the 
probability tends to 2F(0) —1=0. 


Also solved by D. R. Breach (New Zealand), L. E. Clarke (England), Michael Deakin (New 
Guinea), E. J. Dudewicz, Peter Ennis, Joe Flowers, David Gootkind, J. R. Gumerman, D. A. 
Hejhal, N. L. Johnson, Morton Kupperman, O. P. Lossers (Netherlands), M. F. Neuts, G. S. 
Rogers, E. F. Schmeichel, and the proposer. 


A Special Topology for the Integers 
5712 [1970, 85]. Proposed by Dan Marcus, York University, Toronto 


Is it possible to topologize the integers in such a way that the connected 
sets are the sets of consecutive integers? Generalize to the lattice points of n- 
space. 


Solution by Cleveland State University Problem Solving Group, Frank Wyse, 
Advisor. Let L denote the set of lattice points in m-space. For x, yin LZ, we write 
xRy if and only if they differ by 1 in one coordinate and agree in all of the others. 
For x, y elements of a subset SCL, we write xE,y if and only if there is a se- 
quence x =%1, %2, - ++, X,=y (where m may be any positive integer) such that 
x;€S for each index 2 and x;Rx,,1 fori<n. Wesay SCL is a set of consecutive 
points of L if xE,y whenever x and y arein S. 

Now for any «EZ, let Uz= fyELiy=x or yR«x } if the sum of the coordi- 
nates of x is even; and let U,={x} otherwise. Then x#y and gz in U.QU, 
imply U.={z}, so that {U,:xEL} is a base for a topology on L such that 
each x€L has a smallest neighborhood, namely U,. Let LZ have this topology. 

Now suppose C is a nonempty connected subset of L. Choose x»€C, let 
Ai={yEC:yE.xo}, and let 4g.=C—A1. Then x€A; implies UsVCCA:, so 
that Ai and Az are both open in C. Hence 42=@, and C=A1 is a set of con- 
secutive points of L. 

Conversely, suppose C is a set of consecutive points of Z. Let A and B be 
disjoint nonempty sets whose union is C. Choose xC A, yCB and a sequence 
X=X1, X2, °° *, Xn=y such that x;€C for all 2 and x;Rxi,1 for 1<n. Then, for 
some 1 we have x;CA and x:11€B, so that either x;C Uz,41 in which case B is 
not open in C, or %:41€ Uz, in which case A is not open in C. Hence, C is con- 
nected. 


Also solved by M. E. Ballotti, J. L. Bryant & R. W. Gilmer, F. A. Delahan, Ellen Hertzmark, 
Hs C. Kranzer, Michael McCoy, P. J. Saavedra, E. F. Schmeichel, Konrad Victor (Israel), and 
the proposer, 
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Limits, The Definite Integral as a Limit, S. F. 
Dice, 212-213. 

Morse, Marston. See Bott, Raoul. 

Systems of Numeration, John Colburn Associates, 
Inc., MANMOHAN S. ARORA, 667. 

Whatts Area? 

Area Under a Curve. 

The Definite Integral, DONALD CROWE, 94-95. 

What is Mathematics and How do we Teach 1t ? 
K. O. May, 213. 


NEWS AND NOTICES 
PERSONAL ITEMS 
110-113, 224-225, 330, 427-429, 552-553, 676-679, 797-798, 916, 1035, 1134, 
GENERAL INFORMATION 


An appeal for preservation of archival materials 
by the CBMS Advisory Committee on 
History, 110-112. 

Canadian Mathematical Bulletin acquires new 
dress, 1135, 

CRICISAM Calculus Project, 916. 

Fellowship and research opportunities in the 
mathematical sciences, 1036. 

Hans Rademacher Instructorships, 429. 

International Federation for Information Pro- 
cessing (IFIP), 799. 

NCTM Annual Meeting, 916. 

Notre Dame Journal of Formal Logic—Off- 
prints of articles available, 799. 

NSF Regional Research Conferences in the 
mathematical sciences, 1136. 


NSF seeks proposals for regional conferences in 
the mathematical sciences, 330. 

Professional training in mathematics, 1134, 

Publication costs, DANIEL ZELINSKY, 112. 

Rocky Mountain Journal of Mathematics, 224- 
225. 

Science education news, 798. 

6-th International Cybernetics Congress, 679. 

Steele Prizes for expository papers, 799. 

Symposia information center, 225. 

Symposium on statistical decision theory and 
related topics, 799. 

University of Illinois—two-year fellowship for 
prospective junior college teachers, 113. 
University of Texas—eighth annual symposium 
on biomathematics and computer science 

in the life sciences, 225. 


NECROLOGY 


Baldwin, J. W., 797 
Barrow, D. F., 798 
Bartels, W. H., 798. 
Bershad, M. A., 679. 
Blackwell, R. C., 798. 
Brown, E. G., 798. 
Camp, E. J., 798. 
Church, Randolph, 553. 
Conkling, R. M., 679. 


Goldrich, Abraham, 110. 
Griffin, F. L., 429. 
Harrison, J. H., 1035, 
Hendricks, P. M., Jr., 553. 
Hoffman, Walter, 798. 
Holst, J. J., 679. 

Hughes, H. H., 798. 
Kaplan, Sidney, 798. 
Lasley, J. W., 110. 


Mehlberg, Josephine J., 679. 
Moser, Leo, 553. 

Moy, Shu-Teh, 553. 
Nyswander, J. A., 1035. 
Odin, Eugene, 1035, 
Oldenburger, Rufus, 553. 
Payne, C. K., 553. 

Priebe, R. D., 798. 

Sanders, S. T., 916. 


Conwell, G. M., 798. Lee, F. E., 798. Scavella, A. J., 429. 
Elrick, R. M., 798. Lomberg, R. A., 798. Sierpinski, Waclaw, 679. 
Evans, C. F., 798. Lund, J. L., 679. Townsend, B. B., 798. 


Feller, William, 798. 
Fort, Tomlinson, 798. 
Freese, Frances, 553. 


March, H. W., 679. 
McKissick, R. J., 679. 


Vance, G. T., 798. 
Whyburn, G. T., 798. 
Worthington, Euphemia, 798. 
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REPORTS AND ANNOUNCEMENTS OF THE ASSOCIATION 
AND ITS SECTIONS 


MEETINGS AND ANNOUNCEMENTS OF THE ASSOCIATION 


Academic members elected into the Association, 
H. L. ALDER, 441, and 1047-1048. 

Acknowledgment, 1137. 

Announcement of Lester R. Ford Awards, H. L. 
ALDER, 801-802. 

Fifty-first Summer Meeting of the Association, 
H.L. ALpDER, 1036-1046. 

Fifty-third Annual Meeting of the Association, 
H. L. ALDER, 430-436. 

New procedures for mathematical sciences 
employment register, 1048-1049. 


New Sectional Governors of the Association, 
A. B. WILLcox, 802. 

Officers and Committees as of February 1, 1970, 
436-441. 

Report of the treasurer for the year 1969, E. A. 
CAMERON, 802-803. 

Request for information from CEM, 225. 

Summer group flights to Europe, 1049, 

The 1970 William Lowell Putnam Mathematical 
Competition, 917. 


MEETINGS OF ITS SECTIONS 


Allegheny Mountain, May 1970, M. R. Woop- 
ARD, 925-926. 

Florida, March 1970, F. L. CLEAVER, 917-918. 

Indiana, November 1969, M. J. MANSFIELD, 
442. May 1970, M. J. MANSFIELD, 926. 

Iowa, April 1970, B. E. Gitta, 921. 

Kentucky, May 1970, A. S. Howarp, 926-927. 

Maryland-District of Columbia-Virginia, Octo- 
ber 1969, DENNY GuLick, 441-442. April 
1970, DenNNy GULICK, 922. 

Metropolitan New York, April 1970, GERMANA 
GLIER, 922-923. 

Michigan, April 1970, H. T. SLasy, 923. 

Nebraska, April 1970, Henry M. Cox, 923-924. 

New Jersey, November 1969, J. K. RECKZEH, 
442-443, May 1970, JOHN RECKZEH, 927. 

North Central, October 1969, WARREN THOM- 
SEN, 331. May 1970, W. J. THOMSEN, 1136- 
1137, 

Northeastern, November 1969, G. W. BEST, 
799-800. 


Northern California, February 1970, G. L. 
ALEXANDERSON, 800. 

Ohio, October 1969, FosTER Brooks, 442. May 
1970, FoSTER Brooks, 928, 

Oklahoma-Arkansas, March 1970, E. K. Mc- 
LACHLAN, 800-801. 

Pacific Northwest, August 1969, E. A. Maier, 
331. 

Philadelphia, November 1969, A. E. FILano, 
443. 

Southeastern, March 1970, B. F. Bryant, 918- 
919. 

Southern California, March 1970, D. H. Ports, 
919-920. 

Southwestern, April 1969, S. T. Kao, 113. 
March 1970, R. W. BALL, 920-921. 

Texas, April 1970, J. C. BrRapForp, 924-925. 

Upper New York State, November 1969, P. 
SCHAEFER, 443. May 1970, Paut SCHAEFER, 
928-929. 

Wisconsin, May 1970, R. W. CHRISTENSEN, 929. 


